Gene GM21_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3038 
Symbol 
ID8138384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3527480 
End bp3529129 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID644870639 
Producthydroxylamine reductase 
Protein accessionYP_003022825 
Protein GI253701636 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01703] hydroxylamine reductase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones161 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGT TTTGCCGTCA ATGTGAGCAG GCCGCCAAGG GGACCGGATG CGACGTCATG 
GGTGTCTGCG GGAAAAGTCC CGAGGTAGCC GCCCTGCTGG ACCTGTTGCT CCACGGTCTC
AAGGGGCTCG CCATCTACGC GGACAAGGCC CGTGCCCTGG ATGCACGCAA CACCGTCGCC
GACATGTATC TCATCGAAGG GCTTTTCACC ACGGTCACCA ACGTCGATTT CGACGCGGTG
CAGCTCGCCG GCAAGCTGAG AAGGTGCTAC GACCTGAAGG AACAGGTCAA AGCGCTGTAC
GAGGGGGCCT GGCGCGAGAA GCACGGGGCG CCCGCCGCAG CCATCACCGA CGGTCCCGCC
GCCTGGGTGA TAGCGGATAC CCTGGAAGGG CTCGTGGCGC AGGGGAAAAA CTGCGGCGTC
AGGAGCCAGC ACAGCGACCC CGACATCCTC TCCTCCATCG AAATCATCAT CTACGGCCTG
AAGGGTATGG CCGCCTACGC GAATCACGCC TGCATCCTCG GCAAGACCGA CGAGGAGGTC
TTCGCCTTCT TCCACAGCGC GCTCGCAGCC ACCACCGACC CGAACAGGGG GCTCATGGAC
TTCGTCGGCA TCGCCATGGA GTGCGGCAAG CTCAACATCA AGGTGATGGG GATGCTCAAC
GAAGGGCACG TCGAGCGTTA CGGCCACCCG GTCCCGACCA AGGTGCAGCT CGGCACCCGC
AAGAACAAGG GGATCCTGGT CTCCGGCCAC GACCTGCGCA TGCTGGAGGA GATCCTCAAG
CAGACCGAAG GGAAGGGGAT CGACATCTAC ACCCACGGCG AGATGATCCC CGCCCACGGC
TACCCGGCGC TGAAGAAATA CCCGCACCTC TACGCCAACT TCGGCGGCGC CTGGCAGGAC
CAGCACAAGG AGTTCCAGGC TTTCCCGGGC GCCATCATCT TCAACACCAA CTGCATCCAG
CGTCCCGCCG ACAGCTACAA AGACCGCCTC TTCACCTGGG GCGAGGTGGG GTGGCCGGGC
GTCAAGCACA TCGCCGGCTG GCACTTCGAC GAGGTGATCA ACAAGGCGCT TGGGTGCCCG
GATCTCCCCG ACGCTCCCGG CAAGGAGATC CTCACCGGCT TCGGGCACAA CGCCGTCCTT
GGCGTGGCGG ACAAGGTCAT CGAGGCGGTG AAGGGGGGAG CGGTCAAGCA CTTCTTCCTG
ATCGGCGGCT GCGACGGTGC GAAGAGCGGC CGCAACTACT ACACCGAGTT CGCCGAAAAG
GTCCCCAAGG ACTGCGTCAT CCTGACCCTT GCCTGCGGCA AGTACCGTTT CAACAAGCTC
GAATTCGGCG ACATCGGCGG CATCCCGCGC CTTCTGGATG TGGGGCAGTG CAACGACGCC
TACTCCGCGG TACAGATCGC GCTGGCCCTC GCCGGAGCCT TCAACTGCGG CGTGAACGAT
CTGCCGCTTT CCTTCATCCT TTCCTGGTAC GAACAGAAGG CGCACGTCAT CCTGCTCTCG
CTTTTGTACC TGGGGATAAG GGACATCAAG CTCGGTCCCA TGCTTCCCGC CTATCTCTCG
CCGAACGTGC TGCAGTTTTT GGTCAGCAAC TTCAACATCA GCCAGATCGG CACTGTCGAC
GAGGACCTCA AGGCGAGCCT CGGGCAATAG
 
Protein sequence
MSMFCRQCEQ AAKGTGCDVM GVCGKSPEVA ALLDLLLHGL KGLAIYADKA RALDARNTVA 
DMYLIEGLFT TVTNVDFDAV QLAGKLRRCY DLKEQVKALY EGAWREKHGA PAAAITDGPA
AWVIADTLEG LVAQGKNCGV RSQHSDPDIL SSIEIIIYGL KGMAAYANHA CILGKTDEEV
FAFFHSALAA TTDPNRGLMD FVGIAMECGK LNIKVMGMLN EGHVERYGHP VPTKVQLGTR
KNKGILVSGH DLRMLEEILK QTEGKGIDIY THGEMIPAHG YPALKKYPHL YANFGGAWQD
QHKEFQAFPG AIIFNTNCIQ RPADSYKDRL FTWGEVGWPG VKHIAGWHFD EVINKALGCP
DLPDAPGKEI LTGFGHNAVL GVADKVIEAV KGGAVKHFFL IGGCDGAKSG RNYYTEFAEK
VPKDCVILTL ACGKYRFNKL EFGDIGGIPR LLDVGQCNDA YSAVQIALAL AGAFNCGVND
LPLSFILSWY EQKAHVILLS LLYLGIRDIK LGPMLPAYLS PNVLQFLVSN FNISQIGTVD
EDLKASLGQ