Gene GM21_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2119 
Symbol 
ID8137455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2476859 
End bp2477851 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content67% 
IMG OID644869734 
Product3-beta hydroxysteroid dehydrogenase/isomerase 
Protein accessionYP_003021929 
Protein GI253700740 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTC TGGTCACCGG CGGTGGAGGC TTCTTAGGCT CCGCCATAGT GCGCCAATTG 
CTGGCCCGGG GGGACCAGGC GGTCAGCTTC TCGCGCGGCG AGTACCCGGA GTTGGCCGCG
CTCGGCGTGG AACAGCGCCG GGGCGACCTG TCGGATCTGG AAGCGGTGGC GGAGGCGGCC
CGGGGGTGCG ACGTCGTTTT CCATGTCGCC GGGAAGGCTG GGATCTGGGG GAAGTTCGAG
GAATACTACC TAGCCAACGT GACCGGCACC GAAAACGTCA TCGAGGCGTG CCGCAGGCTC
GGCATCGAGA GGCTCGTCCA TACCAGCTCC CCGAGTGTGG TATTCGACGG CTCCGACGTC
GAGGGAGGGA ACGAATCGCT CCCCTACCCT GCGCATTTCG AGGCGCATTA CCCCCACACC
AAGGCCCTGG CCGAACAGGC GGTGCTCGCG GCGAATACCC CTACGCTGGC GACGGTATCG
CTGCGCCCCC ACCTGATCTG GGGCCCAGGC GACAACCACC TGGTGCCGCG CATCGTGGCG
AAGGCGCGCT CGGGCGCCCT GAAGCGGATC GGCAACCACC CCTGCCTGGT CGACACCGTC
TACGTCGATA ACGCCGCCGA GGCGCACCTG AATGCCGCCG ACCGGCTGCA ACCGGGGAGC
GCACCGGCAG GAAAGGCGTA CTTCATCTCC AATGGCGAGC CGATCCCGCT CTGGGAGATG
GTGAACCGGA TCCTCGCGGC CGCAGGAGTT CCCCCGGTGA CGCGCCAGGT TTCCCCTGGC
CTTGCCTATG GCGCCGGCGT GATCTGCGAA ACCCTCTGGA GGGTGCTGCG CCTCTCCGGC
GAGCCCCCGA TGACCCGTTT CGTCGCCAAG GAACTCGCCA CGGCGCACTG GTTCGACCTC
TCCGCTGCGC GCACCGATCT CGGTTACCAT CCCCGCATAT CCATCGATGA AGGGCTTGAG
CTGCTGCAAG CATCCCTGAG GCAAGGGCGG TGA
 
Protein sequence
MKALVTGGGG FLGSAIVRQL LARGDQAVSF SRGEYPELAA LGVEQRRGDL SDLEAVAEAA 
RGCDVVFHVA GKAGIWGKFE EYYLANVTGT ENVIEACRRL GIERLVHTSS PSVVFDGSDV
EGGNESLPYP AHFEAHYPHT KALAEQAVLA ANTPTLATVS LRPHLIWGPG DNHLVPRIVA
KARSGALKRI GNHPCLVDTV YVDNAAEAHL NAADRLQPGS APAGKAYFIS NGEPIPLWEM
VNRILAAAGV PPVTRQVSPG LAYGAGVICE TLWRVLRLSG EPPMTRFVAK ELATAHWFDL
SAARTDLGYH PRISIDEGLE LLQASLRQGR