Gene GM21_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3163 
Symbol 
ID8138515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3674220 
End bp3675548 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content62% 
IMG OID644870768 
Productcytochrome C family protein 
Protein accessionYP_003022948 
Protein GI253701759 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones121 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAGT TCGGCTTAAT GACTATCTTT TCCCTGAGCG CGCTGCTTAC GGCATGGTCG 
CCGGCTTTCG CCGACAGTTG CCTTACCGCA GGGTGCCACC AGCCGATATC GGGAATGAAA
GCGCAGCATG ACCCGGTCAA GGGAGGCGAC TGTCTTTCCT GCCATGTCAG CCAATCGGCG
AACCACCCCA CCCCCGGGGC GAAAGGGTTC AAGCTCACCG CCTCCGGCGC CGCCCTCTGC
AGCCAGTGCC ACACCCCTTA CGGGAAGAAG AAGACAGTGC ATGCGCCGGT CAAGGAGGGT
GAGTGCACCG CCTGCCACAA CCCGCATGGC GCCGACGGCC GCTACCTCAT CAACGCTAGC
GACGACCAGA CCGGGCTCTG CATGGCATGC CACGATTCCG CCATGATCAA GCACAAATAC
ATGCACGGCC CCGTAGCGGT GGGCGCCTGC ACCAAATGCC ACGATCCGCA CGAGTCCAAC
GGCAAGGGCC TGATCAAAGG GAGCGTGCGC GAAAGCTGCC TCGGCTGCCA TGCCGACTTT
GCCGCCTCCT TCCAGACCGC ACAGGTGGTG CACCCGCCGG TCAAAAACGA TCCCTGCACC
CTTTGCCACG ATCCTCATGG CTCTGCCGTT CCCTTCATGC TCAGCAAGAA GATGCCGGAT
CTCTGCATCG GCTGCCACAG CGGGCTGGCG AAGAAACTCA CCGCCAAGGT CACCCACAAG
CCGCTGCTGC AGGAGGCCGG GTGCGGCAGT TGCCACTCCG CCCACTTCGC CAAGGCCAAG
GGGCTTCTCC CCTTCGACGA GGTGACTACC TGCCTCTCCT GTCACGACAA GGACAACCTC
GGCAAGCCGG CTCTGCGCAA CATCAAGAAG GAGATGGCCG GCAAGAAGTA CCTGCACGGT
CCGGTCGCAA AGGGGGAGTG CAAGGCCTGC CACGACCCGC ACGGCTCCGA CAACTTCCGT
CTGCTCAAAG GGGCATACCC GTCGACACTG TACGTGCCGT ACCAGGAAGG GATCTACGAT
GCCTGCCTCA ACTGCCACGA GAAGAACCTG CTCCGTTTCG CGGACACCAC GATCTATACC
AACTTCAGGA ACGGGAACCG GAACCTCCAC TACGTCCATG TGGTTAACAA CCGCAAAGGG
CGTAGCTGCC GCATCTGCCA CGACGTCCAC GCAAGCGACG GCCAGAAGTT GATCACCAAG
ACCGGGGCCA AGTTCGGAGA CTGGAAGATT CCGACCAACT TCAAAATGAC CGAAACCGGA
GGGAGTTGCG CTCCGGGATG CCACCGCGAA CTCTCGTACG ACCGCAAGAG CGCGGTGTCT
TACAAGTGA
 
Protein sequence
MRKFGLMTIF SLSALLTAWS PAFADSCLTA GCHQPISGMK AQHDPVKGGD CLSCHVSQSA 
NHPTPGAKGF KLTASGAALC SQCHTPYGKK KTVHAPVKEG ECTACHNPHG ADGRYLINAS
DDQTGLCMAC HDSAMIKHKY MHGPVAVGAC TKCHDPHESN GKGLIKGSVR ESCLGCHADF
AASFQTAQVV HPPVKNDPCT LCHDPHGSAV PFMLSKKMPD LCIGCHSGLA KKLTAKVTHK
PLLQEAGCGS CHSAHFAKAK GLLPFDEVTT CLSCHDKDNL GKPALRNIKK EMAGKKYLHG
PVAKGECKAC HDPHGSDNFR LLKGAYPSTL YVPYQEGIYD ACLNCHEKNL LRFADTTIYT
NFRNGNRNLH YVHVVNNRKG RSCRICHDVH ASDGQKLITK TGAKFGDWKI PTNFKMTETG
GSCAPGCHRE LSYDRKSAVS YK