Gene GM21_0256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0256 
Symbol 
ID8135563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp306957 
End bp308237 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID644867877 
Productnitrite and sulphite reductase 4Fe-4S region 
Protein accessionYP_003020099 
Protein GI253698910 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.000133948 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATGG AGAAGAACCT GCGCCTGGAG GGGATCTACC CCCAGCGCCA GAAGGAATTT 
CATATGCAGC GGGTGAAGCT TCCCGCGGGG ATCATATCCG CCGAGCAGGC GCTCAAAGTG
GCGCAACTGG CGGAGCGCTT CGCCCGCGGG GTGGTGCACC TGACCACCAG GGGGAGCATC
GAGCTGCATT GGCTCGCCGA AGGGAATCTT GAGTACGTGG CGGGCCAGCT GGCGATGGTG
GGGCTTTACA ACCGGGGCGC CTGCGGCGGC GCGGTGCGCG GCGTGGTCTG CGGCAGCCTG
GGGGCGGCCG GAGCCCCTGC CTTGGAGGCC TTGGTGCGCA GGATCCACCG GCACTTCACC
GGCAACGCGC GCTTCGAAAA GCTCCCCAAG AAGTTCAAGG TGGGGGTGGA GGCGGACGTC
TCAAGCGGGC GGCACCTGAT CCAGGACCTG GGGCTCGTGC CGGCGGCCTC CGGGGAGCCG
TCGCGCTTCG ACGTCTGGGC GGCCGGCGGC CTTGGGCGCG AGCCGATCCC GGGCTTCCTG
CTGGCGCGGG ACGTGGCCGA GGACGCGTTG ATCCCGCTGA TCGAGAGGGT GGCCCGGGTT
TACCAGGCCA ATACCCCGGC GGGGAAGCGT CTGAAGCACC TGATACGCGA GATCGGGCAG
GACGAGTTCC GGCTGAGGGT GTTGGGAGAC GCCGAGGAAG TTCCCAGCGC GCCGACCCTG
ACCGGGAGCC TCGTCCCCGT GCCGGCGGAC GAGTCGGCGC GCCTGGAGGC ACATGTCTTC
GCAGGCGAGC TTTTCTGCGA AGGTCTCGTC GCGCTGGCGG AGATCGCCCG GGAGTACTGC
GGCGGGATAT TGATGATAAC CGGCGACCAG AACGTGAAGA TGCATCTGTC CGAGGGAGGG
GAGCGCGACA AGGCAGCGGC CGCACTGGCC GCGGCGGGAT TTGCCGGGGA GAGCGCCAGG
GAGAGAGTGA TCTTCCGGGT CTGCCCGGGG ACCCACGAGT GCATCATGGG GCTCTCCGCC
ACCCGCGAGA TCGCGGCGGC GGTGGTGGAG CAGATGGGTG AGGAGGCGCT GGGGCTTACC
TGGGCCATTT CCGGCTGCCC CAACTGCTGC GCGCAGCCCC AGCTCGCGCA GGCGGGGATC
GTCTCCTCCC GCCTGGTGAC CGATCCGGCC GGGCGCGCGC CGCGTTTCGA TTTGTACCGC
TCAGGCACGG GTCCGTTCGC CGAGCCGGTG CAGCAGGGGC TCACCCTGCC GCAGCTTTTG
GCCGAGGTGA AGAAAATCTA A
 
Protein sequence
MTMEKNLRLE GIYPQRQKEF HMQRVKLPAG IISAEQALKV AQLAERFARG VVHLTTRGSI 
ELHWLAEGNL EYVAGQLAMV GLYNRGACGG AVRGVVCGSL GAAGAPALEA LVRRIHRHFT
GNARFEKLPK KFKVGVEADV SSGRHLIQDL GLVPAASGEP SRFDVWAAGG LGREPIPGFL
LARDVAEDAL IPLIERVARV YQANTPAGKR LKHLIREIGQ DEFRLRVLGD AEEVPSAPTL
TGSLVPVPAD ESARLEAHVF AGELFCEGLV ALAEIAREYC GGILMITGDQ NVKMHLSEGG
ERDKAAAALA AAGFAGESAR ERVIFRVCPG THECIMGLSA TREIAAAVVE QMGEEALGLT
WAISGCPNCC AQPQLAQAGI VSSRLVTDPA GRAPRFDLYR SGTGPFAEPV QQGLTLPQLL
AEVKKI