Gene Cpha266_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1492 
Symbol 
ID4570281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1692656 
End bp1694095 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content42% 
IMG OID639766075 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_911940 
Protein GI119357296 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGTA ATCATCATGT GATTGCAATA TTAGGAGATG TCGCTGAATA TATAAATGGT 
CGTGCGTTTA AACCGTCAGA GTGGGGAAAA GAAGGTCTCC CTATCATTAG GATAAAAAAT
TTGAATGATG AAAACTCAAA ATTCAATTAT AGTAATGAGG TTTTTGAAAA AAGGTACCTT
GTGAAAAAAG GGGATTTACT TTTTGCTTGG TCTGCCTCTC TCGGTGCATA CATATGGAAA
AAAGATGAAG CTTGGTTAAA TCAACATATT TTTCTTGTCA AACCGAGTCC GTTTATAGCA
AAACTATACC TTTATTATTT TCTCGACAAA ATAACACAAG AGCTTTATTC TGCTGCACAT
GGTTCCGGAA TGGTCCATGT TACAAAGAAG AAATTTGAGG AAACTAAGAT TGGTTTACCG
CCACTATCTG AGCAACGATC CATCGTTTCC AAAATCGAGC AGCTTTTCAG CGAACTTGAT
AACGGGATTG CCTGTCTGAA AAAAGCACAG GAGCAACTTA AAGTCTATCG TCAGGCTGTT
CTGAAGCAAG CGTTTGAGGG TGAACTCACA AAATCCTGGC GCGAACAGCA AGCCAACCTC
CCGTCAGCAC AGGATCTTCT CGATACGATC AAGACAGAAC GAGAGCAAGC TGCAAAAAAT
CAGGGTAAAA AGCTCAAGCC GGTAACTCCT CTTGCAAAAG TGGAACTTGA TGAGTTGACT
GAACTGCCGG ATGGGTGGTG CTGGATAAAA TTAGGTGAGT TGACCATCGG TGTTGAGTAT
GGGACTTCAA CAAAATCACT TGAAAAAGGT GAGGTTCCCG TAATAAGAAT GGGCAATATT
CAGCAAGGTC GAATTGATTG GAATGATTTG GCTTTTACCG ATGATAAGGC GGATATTTCA
AAATATCGAT TGTTAAAAGG TGATGTCCTT TTTAATAGGA CAAATAGCCC GGAACTCGTT
GGTAAAGCCG CGATCTATAA TGGAGAAATG CCTGCTATTT TTGCAGGATA CCTCATCAGA
GTCAATCAAA TCAAAGAATT ATTGCACTGC AAGTATCTCA ACTTTTTCCT GAATTCTCAT
CCTGCAAAAG TTTATGGCAA TTCAGTAAAG ACTGATGGAG TAAATCAGTC AAACATCAAT
GGGGAAAAAC TCAAAAGTTA TCCCTTGCCA TATTGTTCAC CAAAAGAGCA AGAGCAAATC
GTGCAGGAAA TTGAGGCGCG CCTTTCGGTT TGCGACAACA TGGAGGCAAC AATCCGCGAA
TCGCTTGAAA AAGCTGAGGC CTTACGGCAA AGCATTCTGA AAAAAGCATT CGAGGGCAAG
TTGCTCAGCG AGGAGGAGTT AACGGCAACC CGCAACGATC CGGACTGGGA GCCTGCCGAG
AAGCTGCTTG AGCGGATCAG GGCTGAAAAA AACCAATCGA AGAAACAAGC ATTAACGTAG
 
Protein sequence
MTSNHHVIAI LGDVAEYING RAFKPSEWGK EGLPIIRIKN LNDENSKFNY SNEVFEKRYL 
VKKGDLLFAW SASLGAYIWK KDEAWLNQHI FLVKPSPFIA KLYLYYFLDK ITQELYSAAH
GSGMVHVTKK KFEETKIGLP PLSEQRSIVS KIEQLFSELD NGIACLKKAQ EQLKVYRQAV
LKQAFEGELT KSWREQQANL PSAQDLLDTI KTEREQAAKN QGKKLKPVTP LAKVELDELT
ELPDGWCWIK LGELTIGVEY GTSTKSLEKG EVPVIRMGNI QQGRIDWNDL AFTDDKADIS
KYRLLKGDVL FNRTNSPELV GKAAIYNGEM PAIFAGYLIR VNQIKELLHC KYLNFFLNSH
PAKVYGNSVK TDGVNQSNIN GEKLKSYPLP YCSPKEQEQI VQEIEARLSV CDNMEATIRE
SLEKAEALRQ SILKKAFEGK LLSEEELTAT RNDPDWEPAE KLLERIRAEK NQSKKQALT