Gene Clim_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1506 
Symbol 
ID6354822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1621221 
End bp1622279 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID642669111 
ProductCytochrome-c peroxidase 
Protein accessionYP_001943536 
Protein GI189347007 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.558747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTCA GGCAGATTAT TTCAGGAATT GCCGTACTTT CGGTAACCGC ATCATGCGCG 
CCGACACCCC AGGTGGAAAA GAAAAGTGAA CCGGCTGAAA CCCTGAACGT GGAAGTGCCA
AAACCAAGAG CCGGTGAGCC GGTATCACCG ATCAGTGCAG CAACGGTTTC CAATGCCGAA
ATGGTCGAGC TTGGTAAAAA GCTTTTTTTT GATCCCCGGC TTTCAAAATC AGGTTTTATC
TCCTGCAACT CCTGCCACAA TCTCAGTATG GGAGGCAGTG ACAACCTGAA GTCCTCTATT
GGTCACAAAT GGAACAAGGG TCCGATCAAT TCGCCAACAG TTCTGAACTC CTCCATGAAT
CTTGCGCAGT TCTGGGATGG CAGAGCAAAA GATCTGAAGG AACAGGCAGG GGGCCCTATT
GCCAATCCCG GCGAAATGGC CTTTACCCAT GAGCTTGCGG TAGGTGTACT GCAGTCAATT
CCCGGCTATG TCGATGAATT TAAAAAAGTG TTCGGATCCG ATCAGATCAC CATCGATCAG
ATTACCCAGG CGATAGCTGC ATTTGAAGAG ACGCTTGTGA CGCCCGGCTC ACGTTTTGAC
AAATGGCTGC TGGGAGATGA TAATGCCATA ACGAAAGATG AACGTGAAGG GTATGAGCTT
TTCAAATCGA GCGGATGTAC TGCCTGTCAT AACGGCCCGG CACTTGGGGG CAATTCCTAT
CAGAAAATGG GCGTTGTTGA ACCTTACAAA GCTGCCAGCA AGGTCGAAGG GAGATCTGCC
GTTACCGGAA AAGATGCCGA CCGCTTCAAT TTCAAGGTTC CTGCTCTCCG CAATGTTGCT
TTGACCTATC CATATTTCCA TGATGGCGAA GCGGCAACCC TTGCCAAAGC GATCGATGTG
ATGGGGCAGA TACAGCTCGG CAAACGGTTC ACTCCTGAAG AGAATGCAAA GATTGTGGCG
TTCATGAAGA CCCTGACCGG CAAGCAGCCG GTATTTGAGC TTCCCGTTCT TCCGCCTTCT
TCCGATACGA CACCGGCTCC GGAGCCTTTC GGGAAGTAA
 
Protein sequence
MHLRQIISGI AVLSVTASCA PTPQVEKKSE PAETLNVEVP KPRAGEPVSP ISAATVSNAE 
MVELGKKLFF DPRLSKSGFI SCNSCHNLSM GGSDNLKSSI GHKWNKGPIN SPTVLNSSMN
LAQFWDGRAK DLKEQAGGPI ANPGEMAFTH ELAVGVLQSI PGYVDEFKKV FGSDQITIDQ
ITQAIAAFEE TLVTPGSRFD KWLLGDDNAI TKDEREGYEL FKSSGCTACH NGPALGGNSY
QKMGVVEPYK AASKVEGRSA VTGKDADRFN FKVPALRNVA LTYPYFHDGE AATLAKAIDV
MGQIQLGKRF TPEENAKIVA FMKTLTGKQP VFELPVLPPS SDTTPAPEPF GK