Gene Saro_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2028 
Symbol 
ID3917349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2161451 
End bp2162656 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID640444780 
Productmolybdopterin molybdochelatase 
Protein accessionYP_497301 
Protein GI87200044 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACGC CTCCCCTTCA GCTGGCCGAA GCGCAGGCTC GCCTGCTGGC GCTCGCGCCC 
AGTCTCTCGG TCGAACATCG CGCGGTCGCG GAATGCCTCG GCCACTACAT CGCCACCCCG
CTTCACGCCC GCCGAACCCA ACCCGCCGCG CCCCTTTCGG CGATGGATGG CCATGCCATG
CGCGCCGCCG ATCTTCCGGG CCCCTGGCGT GTCATTGGCG AGAGCGCGGC CGGCCATCCG
TTCGGCGGCA CCGTCGGTCG CGGCGAAGCG GTCAGGATCA GCACGGGCGC GATGCTGCCG
GCGGGCGCCG ACATGGTCCT CCTCCAGGAA GACAGCGCAC GCGACGGCGA AACGTTGACC
CTGACTGGCG AGCCGCCCGC GCCTCCGGGC CGCCATATCC GCCCCGCCGG CATGGACTTC
ACCGCCGACA CCACGCTGAT CGAAGCCGGC ACCCGCATCG GGCCCGCGCA GATCGCGCTG
GCGATCGCGG CAGGGCACAG CCACCTCGCG GTGCGCCGCC CGTTGCGCCT CACGGTCATC
GACAGCGGAG ACGAACTGGT CCGGCCCGGC AACACGACGC GTCTGCACCA GCTTCCCGCC
AGCAATGGCC CCATGCTCTG CGCCATGGCT TCGGCGCTGC CCTGCGACAT CAGCCACATC
GGCCCGATCG CGGACCGGAT CGAAGATCTC GCCGCCGCGC TCGATGGCGC GCATGAGGCC
GATGTCGTCG TGACCAGCGG CGGGGCTTCG GTCGGTGACC ACGACCTCGT CCGTCCCGCA
CTCGAGGCGG TGGGCGCGAA GATCGATTTC TGGCGCGTGG GCATCAAGCC GGGCAAGCCG
CTGCTCGTCG CGACAAGAGG CAACCAGGTC ATCATCGGCC TGCCCGGCAA CCCAGCCTCG
GCCTTCGTCA CCGCGTTCCT GTTCCTCTTG CCTCTGCTGC GCGCCAGCCT CGGCGCGGCG
AGCCCGCTCC CCCGCACCAT CCCGGCGAGG CTGGCTGGCC CGATGGGGCC GGGCGGCAGC
CGCATGGAAT TCCTCAGGGC CCACTGGGAT GGCAACGGCG TCACACTTGA TGAACTCCAG
GATTCCGGGG CGCTCTCGCC GCTCGCCCGG GCGAACGCGC TTGTCGTGCG CGAGGCGGGA
AGCGAGCGGA AGGAGGGCGG AACGGATGTT CCGATCTACC TGCTCGAAAA TGGCGGAATT
GCTTGA
 
Protein sequence
MKTPPLQLAE AQARLLALAP SLSVEHRAVA ECLGHYIATP LHARRTQPAA PLSAMDGHAM 
RAADLPGPWR VIGESAAGHP FGGTVGRGEA VRISTGAMLP AGADMVLLQE DSARDGETLT
LTGEPPAPPG RHIRPAGMDF TADTTLIEAG TRIGPAQIAL AIAAGHSHLA VRRPLRLTVI
DSGDELVRPG NTTRLHQLPA SNGPMLCAMA SALPCDISHI GPIADRIEDL AAALDGAHEA
DVVVTSGGAS VGDHDLVRPA LEAVGAKIDF WRVGIKPGKP LLVATRGNQV IIGLPGNPAS
AFVTAFLFLL PLLRASLGAA SPLPRTIPAR LAGPMGPGGS RMEFLRAHWD GNGVTLDELQ
DSGALSPLAR ANALVVREAG SERKEGGTDV PIYLLENGGI A