Gene GM21_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3158 
Symbol 
ID8138510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3667337 
End bp3669130 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content63% 
IMG OID644870763 
Productcytochrome C family protein 
Protein accessionYP_003022943 
Protein GI253701754 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones118 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TCGAATCTGG CATATACGTA CTTCTCGCGC TGGCGGCAGC CTTCATGGCA 
GTCCCCTTTG CGTCTGCCGT CGAGAAGCCG CACAGCAGGG ACATCATCCA GACCCCGCAC
AACCTCTCCA TCACCGGTGG AGGAGGGGCG CACGACATAA AGTCCGGCAC GGAGGCCGAG
GTCTGCATCT TCTGCCACGC CCCGCACCAT GCCTCGACCG TCACCCCGCT TTGGAGCCGG
GAGATATCCC CGCTGACCAT CTACGTCACC TACAAGTCGC CCACCCTCAA GGCGAACCCG
CAGCAGCCGC GGGGGGCCTC GCGCCTGTGC CTCTCCTGCC ACGACGGCAC CATCGCGCTC
GGGCACCTAA CCGGGGACAG GATCCTCGAC GCCTCTTTGC CTGCATTCAA GGACATGCCC
CAGGAGACCG ACCCCCGCAA AAACCCGAAC CTGGGGACCG ACCTCTCCAA CGATCATCCT
ATTTCCTTCT TGTACTCCGA GGCCGGCAAC CTGGAACTGC ACGACGCAAC AGCGGTTCAG
GCCAAGGGAG TCAGGTTGTC CCAGGATCAG TACGTCGAGT GCACCTCCTG CCACGATGCG
CACAACAACC AGTACGGCAA TTTCCTGGTG CAGGACGTGA CCTTGCAGCA GGACGCCCTG
TGCACCACTT GCCACAACAA GCAGGGGTGG AGCGAGCCGG ACAGCACGCA CCGCACCGGC
GGCAGCCGCT ATGACACCGT CACTGCCGGT GTCGCGGCAT CGGGCTGCAT CAACTGCCAT
TTGCCGCACA ACGCGCAAAG AGGCGAGCAC CTGCTGAGAC TTTCGGGGGT GGGGGCCGGA
GAGGAAACCA ACTGCTACAC CTCCTGCCAC CAAAACGTCC CGTACTCGAA CGTATGGAGC
CAGTTCAACA CCTCCCTCTA CACCCATCGT GTCCAGAACT ACAACGGCGT CCATGTCGAC
AACGAGAGTC TGCCGGTGGT CGGCGGCAAA AAGCACGTCG AGTGCACGGA CTGCCACAAC
CCGCACTTCG CCGGGGCCCA GGGTCTGCCG CTTGGCAGTT CCACCCCCCT GGTCCCGCCT
GCCTCCGCCG CGCCCGACAT CAACGGCGCG CTGCGCGGAG TGCGCGGTGT CGACCTCACG
GGCGCCGCGG TGGTTTCCCC TGCGCGTTAT GAGTATGAGG TCTGCTACCG CTGCCATGCC
GGACCCAGTG CCGACCAGTA CACCAGCCTG GCCCAAATGC TCCCCAATCG CCTCTTCAAG
GATTACGACG AGAGCAACCG GTTCAATTCT TCCAATGCGG CATACCATCC GGTGTCGGCG
GATCGCCGTC CGGGTCCCAA CGGCCGCAGT CTGCGCAGCC AGTACCAGAG CACGATGTTC
CGCATCTATT GCAACGACTG CCATGATTCC CACGGCACCA ATGAGCCGCA CATGCTGCGT
TACCTGAACC AGGACACCTT CCCGGCCACG GGAGGCACCA ACTACCCGCT TTGCTTCCGT
TGCCACGACC CCGATTACCT GCTCAACCCG GTGGGGGCTC CTAGCTCGGA TACCGCTGTC
CTGCACCAGA GACACGTACT GGGCCAGCAC CTGAACGGTG ACACGCGGCA AACCCCGTGC
TCCGTCTGCC ACGACCCCCA CGGCGTTCCG GCTACTCGCG GCGCGCTATC CAGCAACGCC
GCGCACTTGG TGAACTTCGA CGTGCGTTAT GCAGGAGAAA CGGCAGTATA CGACGCTGTT
GCCAGGACCT GCGCCGTAAT ATGCCACACC AGCAACCCCA AGTCGTACCC ATAG
 
Protein sequence
MKKVESGIYV LLALAAAFMA VPFASAVEKP HSRDIIQTPH NLSITGGGGA HDIKSGTEAE 
VCIFCHAPHH ASTVTPLWSR EISPLTIYVT YKSPTLKANP QQPRGASRLC LSCHDGTIAL
GHLTGDRILD ASLPAFKDMP QETDPRKNPN LGTDLSNDHP ISFLYSEAGN LELHDATAVQ
AKGVRLSQDQ YVECTSCHDA HNNQYGNFLV QDVTLQQDAL CTTCHNKQGW SEPDSTHRTG
GSRYDTVTAG VAASGCINCH LPHNAQRGEH LLRLSGVGAG EETNCYTSCH QNVPYSNVWS
QFNTSLYTHR VQNYNGVHVD NESLPVVGGK KHVECTDCHN PHFAGAQGLP LGSSTPLVPP
ASAAPDINGA LRGVRGVDLT GAAVVSPARY EYEVCYRCHA GPSADQYTSL AQMLPNRLFK
DYDESNRFNS SNAAYHPVSA DRRPGPNGRS LRSQYQSTMF RIYCNDCHDS HGTNEPHMLR
YLNQDTFPAT GGTNYPLCFR CHDPDYLLNP VGAPSSDTAV LHQRHVLGQH LNGDTRQTPC
SVCHDPHGVP ATRGALSSNA AHLVNFDVRY AGETAVYDAV ARTCAVICHT SNPKSYP