Gene GM21_3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3443 
Symbol 
ID8138810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3979161 
End bp3980702 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content62% 
IMG OID644871059 
Productcytochrome c peroxidase 
Protein accessionYP_003023224 
Protein GI253702035 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.41254e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAAAG ATGCCGCGCT CGTTTCCTCA CTTTTTCTTG CAGCAGCCCT GATCATCCCG 
GCGACCAAGT CGTGGACGCA GCCACAGGGA GAGACCGCCA GGAAGGCGCC CTATGACGCG
GTCCAGGATG CGCAGACGCC CAGGGAGGTC GCAAGGACCC GCCCCGAGCA GCAAAAGGAC
CAGCACGACA ACTCGGACCT GTTCACGGCC ACGAAGGCGG AACCCTCCTC GACGGCTTTT
AAGAACCAGC CCGACGAGGG AAAGATCCTC GGTTTCGATT TCTACCGCGA TCCCCTCAAC
GCCAAGAAAC CGATGACAAC CTTTCAGGAG GTCTACCAGA AGGACGTGGC CGAGAAGCCG
AAGGTGATGG CGACCCAAAG GCGCCTTCTG GAGATGAGGT ACAACCTGAA GCCGAACCTC
TCACCTGACG TGAAGATGAC CCGGGGCAAG CCTATCGCGG TAGGACCGAC CGCGCTCCTT
GCCCAAGGAA CTACCTGGGA GAAGTTGTCG GCGATGCCGC CCGAGCAGAT CAGGTCGGGG
AACCTCTTCC CTTACCCCCC TCTGCCCCAT CCCAAGCAGG TCAACGGCGG GCAGGTCTTC
CCCCAGATCC AGATCGACAT GTTCCCGAGG CTGCAGCGCT TCGACGTCGA TTTCGACCTA
CCCGATGCCT TCCTCCCCGA GTTCCCCCCC GCCATCTTCC TGCAGAACCG CCCAGAACTG
GGAGACGTTT CCCGCGGCGA GGTGGTCAGC ATCAACAACT TCTACCGCCT CTTCAAGGAC
CTCCTCACCC CGGTGCAGTT GGACGGCTTG CGGATGCTGG TGACCCCCTT CCCGCAGGAA
GAGTTCAACC CCACCGACGA CCGCAAATCC CCGCAGGCCA GCCTCGGGGT CGCCTGCCTC
GACTGCCACG TCAACGGGCA CACCACCGCT CAGTTCCACC TGAGCCCCGA CATCCGTCCC
CAGGAGCGCC GCTTCCGGCT CGACACGACC AGCCTGAGGG GGCTATATAA CCAGCAGATC
CACTCCTCCA AGCGCAGCCT GCGCTCGGTC GAGGATTTTA CCGAATTCGA GCAGCGCACC
GCCTACTTCA ACGGGGACGA AATCCACGCC GCCAAAAAAG GGATGAACAT CCTGAGCCGG
GTCCAGGTCA GCCACATGGC CCAGATGCAG AACATGTTCG ACGTACCTCC CGCACCCAAG
CTCGACCCTG CCGGTTACCT GGCCCCCATG AAGGCCACCC CGGCGGAAAT AGCGGGTCAG
AAGATCTTCT TCGGGAAGGG TAGATGCGGC ACCTGCCACC CCGCCCCGTT CTACCTGGAT
CACCAGATGC ACGACCTGCA GATGGAGCGC TTCACCCGCG AGCCGGGCGA CGGCCCCATC
AAAACCTTCA CCCTAAGGGG GATCAAGGAA AGCCCCCCGT ACATGCATGA CGGTCGTTGC
CTCACCCTGG AGGACACGGT GAAGTTCTTC AACCTGGTGC TCGGGCTCAA ACTTTCCGCG
GAGGAGGAGA CCAACCTGGT CGCCTTCCTG CGGGTGCTCT AG
 
Protein sequence
MRKDAALVSS LFLAAALIIP ATKSWTQPQG ETARKAPYDA VQDAQTPREV ARTRPEQQKD 
QHDNSDLFTA TKAEPSSTAF KNQPDEGKIL GFDFYRDPLN AKKPMTTFQE VYQKDVAEKP
KVMATQRRLL EMRYNLKPNL SPDVKMTRGK PIAVGPTALL AQGTTWEKLS AMPPEQIRSG
NLFPYPPLPH PKQVNGGQVF PQIQIDMFPR LQRFDVDFDL PDAFLPEFPP AIFLQNRPEL
GDVSRGEVVS INNFYRLFKD LLTPVQLDGL RMLVTPFPQE EFNPTDDRKS PQASLGVACL
DCHVNGHTTA QFHLSPDIRP QERRFRLDTT SLRGLYNQQI HSSKRSLRSV EDFTEFEQRT
AYFNGDEIHA AKKGMNILSR VQVSHMAQMQ NMFDVPPAPK LDPAGYLAPM KATPAEIAGQ
KIFFGKGRCG TCHPAPFYLD HQMHDLQMER FTREPGDGPI KTFTLRGIKE SPPYMHDGRC
LTLEDTVKFF NLVLGLKLSA EEETNLVAFL RVL