Gene GM21_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2216 
Symbol 
ID8137552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2584980 
End bp2586851 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content59% 
IMG OID644869829 
Productcytochrome c family protein 
Protein accessionYP_003022024 
Protein GI253700835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000000696852 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGC TGCTCACGAT GTTGATGTTG CTGTTGACGG CGCAGGCTTG GGCATTCGAT 
GCCAAATCGC AGTGCGTAAT CTGCCATGGA GACAAGGGTA AGATGGAGTC TCTTGGCGCC
GCGTCGATGT ATCTGGATCC GGCCCAGGTT GATCGTGAAG TGGGTATGGA CGGCGCCACT
TGTGTCGATT GTCACCTGGG GGACCCTTCC CAGCCGTCGA AGGAAGCATC CCACAAGGAC
ATGCTGCGTC CTTTCGTGGT CGGGGTGGGG CCAAAGGTCA AGGGGCAGGC GGTTTCCCGC
GCCGATGCAG GGGCTCTGAA ACCCATCGTT CCTGACGGAG ACGGCATGGA CCGGATGCTA
CCGGAGGGTG ACCCCAAGAA GCTGGAGGGC CTCGGGGTAA AGATACTGAC GGGAATCGAG
TGGCACGACC GCGATCCCGA AACGCTCGCG TATGCTCCCA AGGTTGCCGA GCAGACCTGC
GGCAGGTGTC ATGCAAAAGA GGTGAAGGAT TACAACAGTT CCGCCAAGGG GCTGTTGAAA
CATCAGCGCG CCTATCGGAA GTGGGCCGAG ACCCTTCCGG GACCGCAGAA CTGCGGCATG
TGGTTCGGGC AGAACTACGA GAATCTGAAG AGCCAGACTT CGGTACCTTT CAGCGCCGCA
CAAAATGCTG CCACGGATCG CAGCTGCAAC ACGTGCCATC CCGGTTGTAA TGACTGCCAC
TACAAGCCCT TCACAGGCAA GGGGCGGCAC TCATACGGCA AACCGGACAC CGATAGCTGC
TACGGCGGAG GGAGAGCGAG CATCTGTCAT GCGGGACCCA TGGACCGGAG GCGCGGCGCG
GGATACGTCC GCGGCGAATA CGCCTTTCCG AGCAACCTGC CGCGAGGCGC CCACGTAAAG
GCCGGCGTAC AGTGCCTTGA TTGCCACAAG CCTGTGAACC ATCAATTCGG CCATCTCGCC
GCCGACGACG CGAGGGGGGC TTGCGCCAAG TGCCACGCCG ACATCGTGAA AGCCGTGCAG
ACTTCGGCGC ACAAAAAGGT TGATTGCGGC GCCTGCCACA TAACCGTCTC CGGCGCCTAC
CAGTACACCT TCTGGGGTAA GGGAAACTTT GCCGGCGTGG AAACGCCTTA CGGAAAACAT
AAGGAATACT ACGGCATTCG CGACCTCCCG ACCCTGATCA AGAACGCCTC CGGCCGCTGG
ATTCCGGTTA AGCCGTATCC TATGGCGGTG CTGAACCAGA CAATGGAAGT AGGCCCCACC
GGGCTTCTGT TCCGCTCGAT CCCAAAGAGA AGCGTTCCAG GCAACGTCAG GATAGGTGAG
CCTCCCGCAT TCGAAGTCTC CCGTGCCGCC ACCGATGTCA ATGACGCCTT CATCATCGTC
GGCACCCGTA ACGATCTACC CTCCGGCAAC AAGGCGATCC TTTGGGTGCA GATGGACAAG
CTAAGCCATG CCCTGGGTAA GCCGAGAGGA TGCGCGACCT GTCATGACTC CCACGCGCAG
GTCGGAAAGT CCGAGTGGAG CTATTTCGAA TCAAAGGACG TAACCAAACG GTTCAAAGGG
AGCTACACGG TGACGGCGGA CAAGAACGGG ATCAGGTTCA GCGACGCAGT GTGGGAGACC
CCGATCATGG CAGCTAATCG GAAGGTCGAG GACATAGCGC CGTTTGCCGT GCTGCCCAAA
GACGCCTGGG ATGTGAAGCG GATAAACCTC TCCATCCCCT TCGACGAGAA GAAGACGGGG
AAGGAGAGGG GAGAGCTCGA CAAGTTCCTG GCCGAGCTTG GCAAGCGGAA GGGAGGCGAT
GAACTGCGAA AGATAAGGGT GATCGCCTAC CATAACCTTG CCATGGCGAA AAAGATGCTG
AAGGCACTTT AG
 
Protein sequence
MKTLLTMLML LLTAQAWAFD AKSQCVICHG DKGKMESLGA ASMYLDPAQV DREVGMDGAT 
CVDCHLGDPS QPSKEASHKD MLRPFVVGVG PKVKGQAVSR ADAGALKPIV PDGDGMDRML
PEGDPKKLEG LGVKILTGIE WHDRDPETLA YAPKVAEQTC GRCHAKEVKD YNSSAKGLLK
HQRAYRKWAE TLPGPQNCGM WFGQNYENLK SQTSVPFSAA QNAATDRSCN TCHPGCNDCH
YKPFTGKGRH SYGKPDTDSC YGGGRASICH AGPMDRRRGA GYVRGEYAFP SNLPRGAHVK
AGVQCLDCHK PVNHQFGHLA ADDARGACAK CHADIVKAVQ TSAHKKVDCG ACHITVSGAY
QYTFWGKGNF AGVETPYGKH KEYYGIRDLP TLIKNASGRW IPVKPYPMAV LNQTMEVGPT
GLLFRSIPKR SVPGNVRIGE PPAFEVSRAA TDVNDAFIIV GTRNDLPSGN KAILWVQMDK
LSHALGKPRG CATCHDSHAQ VGKSEWSYFE SKDVTKRFKG SYTVTADKNG IRFSDAVWET
PIMAANRKVE DIAPFAVLPK DAWDVKRINL SIPFDEKKTG KERGELDKFL AELGKRKGGD
ELRKIRVIAY HNLAMAKKML KAL