Gene GM21_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3108 
Symbol 
ID8138458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3603447 
End bp3605108 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID644870712 
Productcytochrome c family protein 
Protein accessionYP_003022894 
Protein GI253701705 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones149 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAA TGAAGCAAGG GGTCTTGTTG CTGGTCTTTG GGGCGCTCCT CGTCTCAGGG 
GGGGGGGAGG CTGCCCACGG CGCCAACAGC GTGCTCGCCA CCAAACACAA CCTCTCGGTC
TCCGGCCCCG GCGAGATCAA GGCGGTCAGC GAAGAGCGCG TCTGCATCTT CTGCCACGCC
CCGCATCACG CCAACCCCTC CACGCCGCTT TGGAGCCGCG ACACCAGCTC ATCGGAGTAC
GACCTGTACG ATTCCACCAC GCTCGTGGCG AAGCCCGGGC AGCCCAGCGG CTCGGCGCGC
CTTTGCCTTA GCTGCCACGA CGGGACCATC GCGCTCGGGG CCCTTTTCGG CAGCGACCGG
AACGTCAACG CCATCAGCAT GGCGGGAGGG TTCGTGAAAC TCCCGGCGGG GAGGAGCAGC
AACCTCGCCG GGACGACGGG AAGGGACCTC GCCAACGATC ACCCGATCTC CTTTCCCTAC
ACCAACGAGC TGGCGCAGCT AAACGGCCAG TTGAACCTTC CCGGTTCGCT CCCGTCCCAG
GTCCGGCTGG AACAGGGGAA CACGCTCCAG TGCACCGCCT GCCACAATCC GCACAAGAAC
CCTTACGGCA AGTTCCTGGT GGTCGACAAC TCCCGTTCGC AGCTTTGCAT CTTCTGCCAC
AACATCGCCG GATACGCATC CTCGCGGCAC GCGACCACGG CGAGCCTTAC CGCGGGTTGC
AACCTCTGCC ATGCCACCCA CAACGCGGGG GGGAAAAAGC GGCTTCTGGG ACATGCCGCC
GAGGAGGAGA ACTGCTACCA GTGCCATAGC GACCAGGGGG GGGCGAAGGA CGTGAGGGTC
CCCGCCGCCA AGTTCTACAG CCACCCCATG AGCGCGACTA CAGGGGTGCA CGATCCCAAG
GAAGATCCGC TCACCGCCGA AAAGCACGTG GAGTGCTCCG ACTGCCACAA CCCGCACAAG
GTTGTCGCGA CCGCCGCCTC CGCGCCTGTG GCCTCAGGGG TTAATGCAGG CGTGGGCGGG
GTCTCCATCT CGGGGGAGGT GCTCCCGGGC GACGCCACCT ACCAGTACCA GATCTGCTTT
CGCTGCCATG CCGAAAACAA TTTCAGCGGG AGCCAGACCG TGATCAGGCA GATCCAGGAC
GTCAACACCA GGCTCGACTT CGACCCGGCC AACCCTTCCT ACCACCCCGT GGCGGCGATA
GGGAAGGGGA ACAGCGTCCC AAGCCTGCGC ACCAATTACA GCACGGCCAG CATGATCTAC
TGCACCGACT GCCACGGCAA CGACGACGCC ACCCAGGCCC GCGGCCCCCA CGGTTCCAAC
CTGAAGCACA TCCTCGTCGC ACGCTACGAG AGCGACACCT ACCCGCTCAC CTACAACGAG
GAGAGTTACG CCCTTTGCTA CCGCTGCCAC GACCAGCTCG TGCTCCTGGA TCCCTTGAGG
TCGTCCTTCG CGCCGCATGC ACGGCACGTG GTCGACAACA GGGTCCCCTG CTCGGTCTGC
CACGATCCGC ACGGCGTCTC CGCGACCCGG GGCGCAGGCA CCACCGCCAA CGCCCACCTG
ATCAACTTCG ACATCCGCTT CGTCGCCGCC GGGGGGAGCT ACAACTCCGT GGGGAGGAGC
TGCACGGTGA GCTGCCACGC CGTGAACCCC CGCCTCTACT GA
 
Protein sequence
MSKMKQGVLL LVFGALLVSG GGEAAHGANS VLATKHNLSV SGPGEIKAVS EERVCIFCHA 
PHHANPSTPL WSRDTSSSEY DLYDSTTLVA KPGQPSGSAR LCLSCHDGTI ALGALFGSDR
NVNAISMAGG FVKLPAGRSS NLAGTTGRDL ANDHPISFPY TNELAQLNGQ LNLPGSLPSQ
VRLEQGNTLQ CTACHNPHKN PYGKFLVVDN SRSQLCIFCH NIAGYASSRH ATTASLTAGC
NLCHATHNAG GKKRLLGHAA EEENCYQCHS DQGGAKDVRV PAAKFYSHPM SATTGVHDPK
EDPLTAEKHV ECSDCHNPHK VVATAASAPV ASGVNAGVGG VSISGEVLPG DATYQYQICF
RCHAENNFSG SQTVIRQIQD VNTRLDFDPA NPSYHPVAAI GKGNSVPSLR TNYSTASMIY
CTDCHGNDDA TQARGPHGSN LKHILVARYE SDTYPLTYNE ESYALCYRCH DQLVLLDPLR
SSFAPHARHV VDNRVPCSVC HDPHGVSATR GAGTTANAHL INFDIRFVAA GGSYNSVGRS
CTVSCHAVNP RLY