Gene Cag_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1187 
Symbol 
ID3748221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1580886 
End bp1581944 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID637773721 
Productcytochrome c551 peroxidase 
Protein accessionYP_379492 
Protein GI78189154 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0823128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC ATCAAATGCT TGCGGCTACT TTGTTAGCTG TTGGGGTTCT TACTTCATGC 
TCCAAAAAGT CGGAGCCTGT GCCTGAACCA GCCCCCGTTC CACCACCAGC CCCCGTGCTT
CAACCACGCA CTGGCGAACC AGCGCAACCT ATTGAAGCTC CAACCGTTGC TGATGCGGGA
ATGGTTGAGC TTGGCAAAAA GCTCTTTTTT GATCCACGTC TTTCCAAGTC AGGTTTTATT
TCTTGTAACT CCTGCCATAA TTTGAGCATG GGCGGCAGTG ATAACCTGAA AAGTTCAATT
GGGCATAAGT GGCAGCAAGG ACCTATTAAT TCGCCAACCG TGCTTAATTC AAGCATGAAT
TTGGCACAAT TTTGGGATGG ACGCGCTAAA GATTTAAAAG AGCAAGCAGG TGGTCCAATA
GCTAATCCGG GTGAAATGGC ATTTACGCAC GAATTGGCAA TTAATGTACT AAAAACAGTA
CCCGGTTATG TTGATGAGTT TAAAAAAGTC TTTAAAAGCG ATTCCCTCTC AATTGATCAA
GTAACGCAAG CTATTGCCTC TTTTGAAGAG ACGTTAGTAA CGCCTAACTC CCGTTTTGAT
AAGTGGTTAA AGGGAGATGA TGCGGCTCTT ACCGCTGAAG AGCTTGCGGG TTACCAGCTC
TTTAAAAGCA GTGGATGTAC GGCGTGCCAT AATGGTGTGG CGCTTGGCGG TAACTCCTTC
CAAAAGATGG GAGTAGTCCA GCCCTATCGT TCCACTAACA AAGCAGCAGG TCGTTTTGCG
GTAACGAAGG ATAACGCTGA CCGTTTTGCC TTTAAAGTGC CAACATTGCG TAACGTCGAG
CTAACCTATC CATACTTCCA TGATGGAGCG GCACCAACTC TTGCAAAAGC AGTGGAAATT
ATGGGTCAAG TGCAGCTTGG GCGCACCTTT ACGCCTGAAG AAAATGGTTC GATTGTGGCA
TTCTTGAAAA CCTTAACGGG CGATCAACCA AGCTTTAGCC TACCACAATT ACCACCATCA
TCCGACACAA CGCCTGCACC TCAGCCATTT GGTAAGTAG
 
Protein sequence
MKKHQMLAAT LLAVGVLTSC SKKSEPVPEP APVPPPAPVL QPRTGEPAQP IEAPTVADAG 
MVELGKKLFF DPRLSKSGFI SCNSCHNLSM GGSDNLKSSI GHKWQQGPIN SPTVLNSSMN
LAQFWDGRAK DLKEQAGGPI ANPGEMAFTH ELAINVLKTV PGYVDEFKKV FKSDSLSIDQ
VTQAIASFEE TLVTPNSRFD KWLKGDDAAL TAEELAGYQL FKSSGCTACH NGVALGGNSF
QKMGVVQPYR STNKAAGRFA VTKDNADRFA FKVPTLRNVE LTYPYFHDGA APTLAKAVEI
MGQVQLGRTF TPEENGSIVA FLKTLTGDQP SFSLPQLPPS SDTTPAPQPF GK