Gene GM21_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3781 
Symbol 
ID8139155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4354073 
End bp4355023 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content62% 
IMG OID644871400 
Productthioredoxin reductase 
Protein accessionYP_003023558 
Protein GI253702369 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGA CTCACCACCG TCTCATTATT CTAGGCTCGG GCCCTGCCGG TTACACCGCT 
GCGGTCTACG CCGCGCGCGC CAACCTGAAC CCGGTCCTGA TCGCGGGGCT GCAGCCGGGG
GGGCAGCTGA TGACGACGAC CGAGGTGGAC AACTGGCCGG GCGACCCGGA AGGGGTGCTG
GGCCCGGATC TGATGGAGCG CATGCGCCTG CATGCCGAGC GTTTCGGCAC CCAGTTCATC
TACGACCATA TCAGCAAGGC TCAGGTCACC AAGCCACCGT TCGTACTGGA AGGGGATAAC
GGCAGCTACA GCTGCGACGC CCTGATCATC GCTACGGGCG CGTCCGCTAA ATACCTGGGG
CTTCCCTCGG AGCACGCGTT CAAGGGAAAA GGCGTTTCCG CCTGCGCCAC CTGCGACGGA
TTTTTCTATC GCGGCAAGCC GGTGGTCGTC ATCGGCGGCG GCAGCACCGC AGTCGAAGAG
GCGCTTTATC TCTCCAACAT TGCGAGCCAC GTCACCGTGG TGCATCGAAG GGACAAGTTC
CGCGCCGAGA AGATCCTGGC CGACAAGCTG ATCGAGAAGA CCAAAAACGG CAACGTCACC
ATCGAATGGA ATCACCACCT GGAAGAAGTG CTGGGAGACG AGTCCGGCGT CACCGGGGTC
AGGCTGAGGC ACACCAGCGG CTCGGACAAG GTGATCAGCG CGCACGGTTG CTTCATCGCC
ATAGGGCACC AGCCCAACAC GCACATCTTC GACGGGCAAT TGGAGATGGA CGAAGGATAC
ATCCGCACCA ACTGCGGCTA TGAAGGGAAC TCCACTTCCA CCAACATCCC CGGGGTCTTC
GCTGCGGGAG ACGTGCAGGA CAGAAACTAC AAGCAGGCGA TCACCTCCGC CGGGACCGGC
TGCATGGCCG CCCTCGATGC GGACCGCTAC CTGGAAATGC TCAAGGCCTA G
 
Protein sequence
MEVTHHRLII LGSGPAGYTA AVYAARANLN PVLIAGLQPG GQLMTTTEVD NWPGDPEGVL 
GPDLMERMRL HAERFGTQFI YDHISKAQVT KPPFVLEGDN GSYSCDALII ATGASAKYLG
LPSEHAFKGK GVSACATCDG FFYRGKPVVV IGGGSTAVEE ALYLSNIASH VTVVHRRDKF
RAEKILADKL IEKTKNGNVT IEWNHHLEEV LGDESGVTGV RLRHTSGSDK VISAHGCFIA
IGHQPNTHIF DGQLEMDEGY IRTNCGYEGN STSTNIPGVF AAGDVQDRNY KQAITSAGTG
CMAALDADRY LEMLKA