Gene GSU2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2203 
Symbol 
ID2687015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2417245 
End bp2418336 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID637126896 
Productcytochrome c family protein 
Protein accessionNP_953252 
Protein GI39997301 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain
[TIGR03508] decaheme c-type cytochrome, DmsE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAACA AGCTGTATGG CGTGAAAGCG CTAAAAATGG CCGGAATGGT CGCGATTTCG 
CTGCTTGTCG GTGCCTGCGC CACCGGGAAG ATCAGGGAAC GGGTCCTGAC ATTGCCGACC
ATCGAAGGCG CTTCGTACGT CGGCGACACA GACTGTGCCC AGTGCCATGA CAAGATCGCG
GCACCCTTCG TCCGGAGTAT TCACGGCCGG ATCGCCGACT TCGAAGTGAT GGGGGGAACC
AGGGGATGCG AGTCATGCCA CGGTGCCGCC AGCCTTCACA CCACGGAAGG AGATTCCGCC
AAGATTCTTT CGTTCGCAAG CCTGTCGTCG GACCAGGCAT CTGCCGTCTG CCTCAAATGC
CACTCGGCCG GCACCCACAT GGAGTGGGCA GGCAACGAAC ATGCCCTCAA CGATGTGGCC
TGCACCGACT GTCATAAAAT CCACCAGGAC AAGACCGTTG CCCCGCACAG CCTGAAAATG
GCACAGCCCG AGCTCTGCTA CTCCTGCCAT CAGGAGTATC GCGCAAAGGC CAACTTCCCT
TCCCACCATC CCATCCGAGA GGGGAAAATG ACCTGCACGA GCTGCCACGA AGCCCACGGT
TCCGGCCAGA AGAACCTGAA GACCGAGGAG CGGGTGAACG ACCTCTGCCT TAACTGCCAC
AGCCGCTACC AGGGGCCCTT CGTCTTCGAG CATGCGCCGG TCCAGGAAGA CTGCACCATC
TGTCATGACG CACACGGCAC CGTAGCCAAC AACCTGCTTC GCCAGAGCGA GCCCTTCCTC
TGCCTCCAGT GCCACGAGAG CCACTTCCAC ATCACCCGCG AGGGAGCCAC GATCCCGGCG
GGCAATGTCG CCCTTGACGC CAACAAGGAC GGGGGCACCA CCTACCCTGC CGGCCAGACG
ATCAGCTCAT CCAACAGCGT AACCATGCCG AACAATCTCG GCGCTGAAGG ATGGCGGATG
GCCTTCGGCA CCAAGTGCAG CGTATGCCAC ACCCAGGTGC ACGGCAGCGA CCTTCCCTCC
CAGACCGCAC CCACGGTCAA CAGCACCGGC ACCAGGGGAT GGCCGTCCGG CGGCAAGGGA
TTGACCCGCT AA
 
Protein sequence
MRNKLYGVKA LKMAGMVAIS LLVGACATGK IRERVLTLPT IEGASYVGDT DCAQCHDKIA 
APFVRSIHGR IADFEVMGGT RGCESCHGAA SLHTTEGDSA KILSFASLSS DQASAVCLKC
HSAGTHMEWA GNEHALNDVA CTDCHKIHQD KTVAPHSLKM AQPELCYSCH QEYRAKANFP
SHHPIREGKM TCTSCHEAHG SGQKNLKTEE RVNDLCLNCH SRYQGPFVFE HAPVQEDCTI
CHDAHGTVAN NLLRQSEPFL CLQCHESHFH ITREGATIPA GNVALDANKD GGTTYPAGQT
ISSSNSVTMP NNLGAEGWRM AFGTKCSVCH TQVHGSDLPS QTAPTVNSTG TRGWPSGGKG
LTR