Gene GM21_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0019 
Symbol 
ID8135318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp26549 
End bp27586 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID644867636 
ProductCytochrome-c peroxidase 
Protein accessionYP_003019864 
Protein GI253698675 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA AATGGTTCAT ACTGCTCTTG CCGCTGCTTC CCGGATGCGT CGATGCCACC 
GCCAAGGAAA CCATGTCCAA GGCCCAGGCC ACCTTCAAGC CCATCCCGGC GCAGGCACCA
GCCATCAAGG GGAACGAGGC GACCAAGGCC AAGGTCGACT TGGGCAAGAA ACTCTTCTTC
GATCCCCGTC TCTCCACCTC GCAACTCATC AGCTGCAACA CCTGCCATGA CGTAGGCCTC
GGGGGCGCCG ACCTCCAGGA AACCTCCGTC GGCCATGGGT GGCAGAGAGG CCCCCGCAAC
GCCCCTACCG TTTTCAACGC CGTCTACAAC GTCGCCCAGT TCTGGGACGG CAGGGCCAAG
GACCTGCAGA CCCAGGCCAA GGGGCCGGTG CAGGCGTCCG TGGAGATGAA CAGCAACCCC
GAGCTGGTGG TGAGAACCCT GAAGAGCATC CCGGGATACC CAGCTCTCTT CGAGGCGGCC
TTCCCCGGGT ACAGCGATCC AGTCACCTTC GACAACATGG CGAAGGCGAT TGAGGTGTTC
GAGGCGACAC TGGTGACTCC GGATGCCCCG TTCGACCGCT TCCTCAACGG AGAGGCCAGC
GCTCTAAGCG CGCGGGAACA GGCTGGTTTG GGTGTCTTCA TGGAGAAGGG TTGCGCCGCC
TGCCACGGGG GAATCAACAT CGGCGGTGCC GCCTACTACC CCTTCGGCGT CCGTGAGGTC
CCGGCTGCAG AGATCCGCCC CGAGAGCGAC ACGGGTCGTT TCAAGGTGAC CAATACCGCC
AGCGACAAGT ACGTTTTCCG GGCGCCGTCG CTCAGGAACG TCGCGATCAC CCAGCCTTAT
TTCCATTCCG GAAAGGTGTG GAGCCTCAGG GAGTCGGTGG TGGTGATGGG GTCCGCGCAA
CTGGGAATGA AACTGAACGA GACGGAAGTG AACGACACGG TCGCATTCAT GAAGAGCCTG
ACGGGAAGAC AGCCGAATAT GGATTACCCC CTGCTTCCGC CGAGTTCGGA CCAGACCCCG
CATCCGCAGC TAAAGTGA
 
Protein sequence
MKAKWFILLL PLLPGCVDAT AKETMSKAQA TFKPIPAQAP AIKGNEATKA KVDLGKKLFF 
DPRLSTSQLI SCNTCHDVGL GGADLQETSV GHGWQRGPRN APTVFNAVYN VAQFWDGRAK
DLQTQAKGPV QASVEMNSNP ELVVRTLKSI PGYPALFEAA FPGYSDPVTF DNMAKAIEVF
EATLVTPDAP FDRFLNGEAS ALSAREQAGL GVFMEKGCAA CHGGINIGGA AYYPFGVREV
PAAEIRPESD TGRFKVTNTA SDKYVFRAPS LRNVAITQPY FHSGKVWSLR ESVVVMGSAQ
LGMKLNETEV NDTVAFMKSL TGRQPNMDYP LLPPSSDQTP HPQLK