Gene GM21_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1291 
Symbol 
ID8136618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1511356 
End bp1512819 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID644868905 
ProductDi-heme cytochrome c peroxidase 
Protein accessionYP_003021109 
Protein GI253699920 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID[TIGR02953] pentapeptide MXKDX repeat protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.5340600000000002e-32 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTCAA AACTCGTACT GACGACGCTG TCTGTAGTGT TGGTAGCTGT ACCGGTAGCG 
GCGGCGGAAC TTACCCCGCT GGAAAGCTTG GGCAAGAATC TCTTTTTCGA CCAGTCTCTT
TCCAATCCTC CAGGTCAGGC CTGTGTAGAT TGCCATAGCC CTGAAACTGG GTGGACCGGA
CCGGATTCGG AGATCAACGC CGCAGGTGCG GCTATTCCTG GAGCGGTCCA CACGAGGGCG
GGCAACAGGA AACCTCCGAC CGCGGCCTAT GCCGGATACA ACCCGGTACT GCACAAAGCC
GGCTCCATGG GCGGCGGTGG CATGGGCGGT GGTGGCATGG GCGGTGGTGG CATGGGCGGC
GGCGGCATGG GCGGAGGCGG TATGGGCGGA GGCGGTATGG GCGGTGGCGG CATGGGCGGT
GGCGGCATGG GCGGTGGCGG CATGGGTGGT GGCGGCATGG GTGGCAACAT GCAGGATGTA
TTCGTGGGCG GGATGTTCTG GGATGGCCGG GGCACGGGTT GGGAGATGGG GGATCCGCTT
GCCGAACAGG CGATGGGTCC TTTCCTGAAC CCGCTGGAAC AGAACAATCC CAACGCGAAG
CATGTCTGTC TCAACGTTCT GAGGACCGGG TACGCGACGC AGTTCGAAGA GGTTTGGGGA
GCAGGTTCCC TGGACTGCGT GAAGGACGTT GACGGCACCT ACCAGCGCAT TGGCCGCTCT
ATCGCTGCCT ACGAGCGTTC CGCCGAGGTT AGCGCGTTCA ACTCCAAGTT CGATACCTTC
TGGAAGAACT CGGAGGGGAA GATGCCGCCG GTTCCCATGA TCAACATGAT GAACTGGACC
CGGTTCAAAA AACGCGGCCT GACGGACATG GAGCTGCAGG GTCTGATGAT CTTCAACACC
AAGGGGAAGT GCTCCACCTG TCACTTCTTG CAACCGATGA ACGGAAGCCG GTTCCCGCTT
TTCACCGATT TCAGGTACCA CAACCTGGGC GTGCCCGCCA ATCCTGAGAA CCCGTACTAC
GACATGCCGC GCCAGTGGAA CCCGAAAGGG GAGAACTGGG TCGATCAGGG GCTGGGCGGA
TTTTTGGCCA AGACGGCGGT GATGACGGAC AGCGCCGGCG TTTCCATGGA CTACAGCGCG
CTGGCGGCTC AGAACATGGG CAAACAGAGG ACGCCGACGC TGCGCAACGT GGACAAGCGT
CCGGGACCGG ACTTCGTCAA GGCTTTCGGC CACAACGGCC ATTTCAAAAC TCTGCAGGAG
ATCGTGCACT TCTATAACTT GAGGGACGTC CTTCCGATTT GCGACACCCC CAACCCGCCC
AAGGACGCCA TGGGTGGCGC AACCTGCTTC CCTCCTCCCG AAGTGGCGGA AAACATCAAC
AGGGTAGACA TGGGGAACCT CGGGCTGACC CCCCAAGAGG GAATGGCGCT GATCCAGTTT
ATGAAGACCC TGAACGATCT GTAA
 
Protein sequence
MISKLVLTTL SVVLVAVPVA AAELTPLESL GKNLFFDQSL SNPPGQACVD CHSPETGWTG 
PDSEINAAGA AIPGAVHTRA GNRKPPTAAY AGYNPVLHKA GSMGGGGMGG GGMGGGGMGG
GGMGGGGMGG GGMGGGGMGG GGMGGGGMGG GGMGGNMQDV FVGGMFWDGR GTGWEMGDPL
AEQAMGPFLN PLEQNNPNAK HVCLNVLRTG YATQFEEVWG AGSLDCVKDV DGTYQRIGRS
IAAYERSAEV SAFNSKFDTF WKNSEGKMPP VPMINMMNWT RFKKRGLTDM ELQGLMIFNT
KGKCSTCHFL QPMNGSRFPL FTDFRYHNLG VPANPENPYY DMPRQWNPKG ENWVDQGLGG
FLAKTAVMTD SAGVSMDYSA LAAQNMGKQR TPTLRNVDKR PGPDFVKAFG HNGHFKTLQE
IVHFYNLRDV LPICDTPNPP KDAMGGATCF PPPEVAENIN RVDMGNLGLT PQEGMALIQF
MKTLNDL