Gene Cpha266_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1614 
Symbol 
ID4571137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1835686 
End bp1837443 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content48% 
IMG OID639766195 
Productrestriction endonuclease 
Protein accessionYP_912059 
Protein GI119357415 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000775525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAC GCATCGGCCA ATTGTCCCTA AACTACAGTG CACGCGGCAT TCCCACTTAC 
ACACTCGAAC TCTGGCACGA CGGACTGAAA AAGCATCGCC TTATTCGTGG CGAAAGTGAA
TCAATCGTCA ATCTGAAGGC AACACTGCAA GTTGAAGAAT GGGAGGAACG CTGGGCAGTT
ATTGACGCTA AAGAGCGTGA CCGCTCACAG AAACTCGCCG GAAAACGGCA GATTGAAGAA
AACAAATCGC TAGCCGTGGA ACGGACTGCT GAAGCCCAGC AAGAACTCGA ACGGCTAAAT
TCGCTTCTGA AGGCAACCTT GGCGGTTGAT GACACTATTG ATTGGGAGAA GCTAAAGGAT
AAAACGCCCT ACCCGGAAAA AAAACCGGTG ATGCCACCCA CACCACGGGA GCCCGTATTG
CCGCAAATGC CAAGCGAACC ATTACGAGGT GACCAAAAAT ATATTCCCTC ATTAGGAATT
CTCGACAAGC TGATAGTCTC TCGAAAGGAA CGTGCGGTTT CGGAAAAGCT GGCACTGTTT
GCCTCCGATC ACAAGTCGTG GCAGGACGAG GTAGCCATAA TTACACGCAC ACACACAGCA
GCGCTTTTGG TACATGGAAA GTCCGTTGCC GCCATGCGTG AAGAACACGA AAAGCAAGTT
TCAGCATGGG ACAAACGACG CAACGAGTAT TTAAACAAAC AATCTGCTAC ACATGCTGAA
GTTGACGCAA AACGCACTAC CTATGAGTCC AGCGATCCTG ATGCAATTAC CGAATACTGC
GATTTAGTTC TTTCATCCTC GCGCTATCCA GTCTATTTCC CGCAGGAATA TGATCTCGAC
TATGACGCAG CAACTAAAAC AATCATCGTT GATTACCGGC TTCCCGCGCC AGACGATCTT
CCACGTTTGA AGGCAGTTAA GTACGTTGCA AGCCGTGATG AGTTTGAAGA GCAGTATATT
TCCGAGGCTC AATCATCCAA GCTCTACGAC GATATTTTAT ATCAAGTCGT CCTACGCACA
GTTCACGAGT TGTTCGAAGC GGACATCATC TCTGCAATTG AGACAATTGT TTTCAATGGT
ATTGTCACTT CAACGGATCG TACAACCGGT AAGCCAACGA CAGCATGCGT TCTCTCACTG
CGTGCCAATC GTGCTGAGTT CTTGGAGATT AACCTTTCAC AAGTCGATCC GAAGGCGTGT
TTTAAGTCGC TCAAAGGTGT CGGAAGCTCA AAGCTCCATG GCTTGTCGCC GGTTCCACCC
ATCATGCAGC TTCGGAGGGA CGATGGACGA TTCGTATCCG CTTACGAAGT CGCCAATACG
CTTGATAGCA GCGTAAATTT AGCTGCTATG GACTGGGAGG ACTTCGAACA TTTGATTCGT
GAGATTTTTG AAAAGGAATT TTCATCATCT GGTGGCGAAG TTAAGGTAAC TCAAGCAAGT
CGCGATGGAG GTGTTGATGC CATTGCTTTT GATCCTGATC CCATTAGGGG CGGAAAGATC
GTTATTCAAG CAAAGCGATA TACCAACACG GTCGGCGTTG GTGCGGTACG TGATCTCTAC
GGCACCGTAG TGAATGAAGG TGCAACAAAG GGTATTTTGG TTACTACGTC CGACTATGGC
CCCGACTCTT ATGCCTTTGC CAATGGAAAA CCCCTTGTTC TTCTCAGCGG TGCTAACTTG
TTACATATTC TGGAGAAACA TGGTCATCAA GCCCGCATTG ACATACAGGA AGCAAGAAAG
CTTACTGCAA AGCTATGA
 
Protein sequence
MKTRIGQLSL NYSARGIPTY TLELWHDGLK KHRLIRGESE SIVNLKATLQ VEEWEERWAV 
IDAKERDRSQ KLAGKRQIEE NKSLAVERTA EAQQELERLN SLLKATLAVD DTIDWEKLKD
KTPYPEKKPV MPPTPREPVL PQMPSEPLRG DQKYIPSLGI LDKLIVSRKE RAVSEKLALF
ASDHKSWQDE VAIITRTHTA ALLVHGKSVA AMREEHEKQV SAWDKRRNEY LNKQSATHAE
VDAKRTTYES SDPDAITEYC DLVLSSSRYP VYFPQEYDLD YDAATKTIIV DYRLPAPDDL
PRLKAVKYVA SRDEFEEQYI SEAQSSKLYD DILYQVVLRT VHELFEADII SAIETIVFNG
IVTSTDRTTG KPTTACVLSL RANRAEFLEI NLSQVDPKAC FKSLKGVGSS KLHGLSPVPP
IMQLRRDDGR FVSAYEVANT LDSSVNLAAM DWEDFEHLIR EIFEKEFSSS GGEVKVTQAS
RDGGVDAIAF DPDPIRGGKI VIQAKRYTNT VGVGAVRDLY GTVVNEGATK GILVTTSDYG
PDSYAFANGK PLVLLSGANL LHILEKHGHQ ARIDIQEARK LTAKL