Gene GM21_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2072 
Symbol 
ID8137408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2402794 
End bp2404023 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content63% 
IMG OID644869687 
Productmolybdenum cofactor synthesis domain protein 
Protein accessionYP_003021882 
Protein GI253700693 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000000000000505148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGCA TCGAGCAAGC GCAACGCACC GTCTTCGAGC AGATAGTCCC GCTGGAAACG 
GTGACTGTTC CGGTCGCCCA GGGGTTGAAC CGGATCTCCC CCGACAACCA CGTCGCCCCG
TGGGACATCC CCGCTGCCGA TAATTCCGCC ATGGACGGGT TCGCCTTCCG CTACCAAAGC
GCTAACCCGG TCGAGCTGAA GATTATAGGT TTTCTCCCTG CCGGGGAGGT CATAAACGAT
CCGGTTGCCG AGGGGTGCGC GATCAGGATC ATGACAGGAG CCCCCATCCC CCCCGGATGC
GACACGGTGG TTCCCATCGA GGACGCGGAG GTCGAAGGCG ACCTGCTCAG ACTTAAGTTC
CCGGTGAAGG CAGGAAACCA CGTCCGCAGG CGCGGTGAGG ACATAGCCCG AGGCAATGTC
GTGATACCAG CCGGCTCCGT GCTCCGCCCC CAGGAGATCG GCATGCTCTG CGCCATGGGA
AAAACTACCC TGTCGCTTTA CCGCAAAGCA AGAGTCGCCA TCCTCGCCAC CGGCGATGAA
CTCCTCGAAC CCGGATCGCC CCCCTCCCCC GGCAAACTCA TCAATAGCAA CAGTTACAGC
CTCGCCGCGC AGGTTTTGGA TGCCGGCGGC GACCCAATCG TCCTCGGCAT CGCCGCCGAT
ACGCTGGAAG ACACCTGCGA AAGGATCAGG GCCGGCCTTG ACGCGGACAT GCTGGTCATC
ACCGGGGGGG TATCGGTCGG GGACAGGGAC TACGTCAAGG CGGCAATAGA GCGTTTGGGA
GGCGAGATCC AGTTCTGGAA GGTCAACATG AAGCCAGGTA AGCCCTTGGC CTTCGCCTCC
CTGCAGGGAA AGCCGATTTT CGCACTTCCC GGCAACCCCG TTGCCGCCAT GGTCTCCTTC
GAGCTCTTCG TGCGCCCGTC GATACTGAAG GCGATGGGGC ACCGGCGCAT CCTCCGCCCT
GTCGTGAACG CTATCTTGCA GGAACGCGCG GCCAACAAGG GGGAGCGGCC GCACCTGGTG
CGGGGGATCG TCTCCCGGCG CGGAGGCGGA TACCTCGTTT CCACTACGGG CAATCAAAGC
TCGGGAAGGC TTTCCTCTCT GACGCTGGGC AATGGCTTGA TGAAGCTCGC GCCGGAGTCG
ATCCTGGAAG CAGGGAGCGA GGTGGAGGTC GTTCTCCTGG ACCGATGGTT CGAACAGGGA
GATGTGGAAG AGGGGCTGGT GCGCCAGTGA
 
Protein sequence
MISIEQAQRT VFEQIVPLET VTVPVAQGLN RISPDNHVAP WDIPAADNSA MDGFAFRYQS 
ANPVELKIIG FLPAGEVIND PVAEGCAIRI MTGAPIPPGC DTVVPIEDAE VEGDLLRLKF
PVKAGNHVRR RGEDIARGNV VIPAGSVLRP QEIGMLCAMG KTTLSLYRKA RVAILATGDE
LLEPGSPPSP GKLINSNSYS LAAQVLDAGG DPIVLGIAAD TLEDTCERIR AGLDADMLVI
TGGVSVGDRD YVKAAIERLG GEIQFWKVNM KPGKPLAFAS LQGKPIFALP GNPVAAMVSF
ELFVRPSILK AMGHRRILRP VVNAILQERA ANKGERPHLV RGIVSRRGGG YLVSTTGNQS
SGRLSSLTLG NGLMKLAPES ILEAGSEVEV VLLDRWFEQG DVEEGLVRQ