Gene Cpha266_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1705 
Symbol 
ID4571065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1926620 
End bp1930885 
Gene Length4266 bp 
Protein Length1421 aa 
Translation table11 
GC content55% 
IMG OID639766288 
Producthypothetical protein 
Protein accessionYP_912147 
Protein GI119357503 
COG category[S] Function unknown
[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases
[COG2852] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTC ACCCTTACAC CGAAGACCAG CTTGTCGAGC AGCCTGCTAT CGGGTTGTTT 
GCGGAGCTTG GCTGGACGAC GGTGTCGGCG TTGGATGAGA CATTCGGCGT TGCCGAACCC
TCCTTCGGCC ACTTTGGCCA CCCTCTCCCG GAGGGCGAGG GTAATGGAGA TTCCCGGATA
TCATTAGGGC GGGAGACGAA GGGGGAGGTG GTTCTGGCGT TACGGTTGGG TGCGGTGTTG
GAGCGGTTGA ACCCTGCGCT GCCGCCCGAG GCAATCACCT CGGCAATGGA TGAATTGACT
CGCGACCGTT CAGCCATGCT TCTGGAGGCA GCGAATCGCG AAATCTATCT GCTGCTCAAG
GAGGGGATTC GCGTCAGCGT TGTTGACACC GAACCCTCAC CAGGTTTCCC TCTCCCAGAG
GGCGAGGGTG CAGAACCACC TCCATCTACC TTTGGGAGAA GGGCAGGAAA TAACGATTCG
CTACAGAAAT ATGGAGGGCA GAAGATGGAG CGGGTGAGAG TTGTTGATTG GGAGCATCCG
GAGAACAACG ATTTTCTGCT GGTGAGTCAA TTCAGCGTTA CGGGACAGCT CTACACTTGC
CGCCCTGATC TGGTTGGGTT CGTGAACGGG CTGCCAATGG TGGTGATCGA GCTGAAAAAG
CCAGGTGTGC CTGCCCGGAC GGCTTTCGAC GAAAATCTCA GGCACTACAA GGAGCAGATT
CCTTCGCTTT TCTGGTACAA TGCGCTGCTT ATCGCTTCGA ACGGTACGGA AAGCCGTGTC
GGTTCGCTCA CTGCTGATTG GGAACGGTTC TTCGAGTGGA AGCGCATCGC GCGGGAAAAC
GAGCCCCGCA GGGTCTCGCT TGAGGTGATG CTTCGCGGCA CGTGCGACCG CGGTCGTTTT
TTTGACCTCG TTGAGAACTT CACGCTTTTT TCGGAGCATA AGGCGGGATT TGCAAATCCC
GCCACCCTCA CCCCAACCCC TCTCCCAGGG GTAGAGGGGC GGAGTAAAAG CGGGCTGGTG
AAGATTATCG GACAGAACCA CCAGTTTCTC GGAGTCAATG CGGCAATTGC GTCGATGCTC
AGGATTCGTA AAGAATATGC AGTTGTCGCA ACCTTCACCC CAACCCCTCT CCCAGAGGTA
GAGGGGCAAT ATCGTGGTGG TTTTTATTAT TCGGGACTGG TTGAACGCGC GAGGGAATTA
CGGCAACAGC AGACACCGGC AGAGGATATC GTATGGGAGC TTTTGCGTGA CCATCGGTTC
GAGGGGCTGA AGTTTCGTCG GCAACACCAG ATCGGAAATT ATATTGCAGA TTTCTTTTGT
TCAGAGCATA AACTTGTTGT CGAAGTTGAT GGTGACGTTC ATCGTTCGCC GGAGGTGGTT
GCGAAGGATT CACAGCGGGA TGCATATTTG CGTTCTCTTG GTTATACCGT TATTCGCTTT
GAGAATCAGT TGGTATTGAA TAATCCGGCA GAGTTTCTGA ATCAAATCAA GACAGCATTG
CAGCTTCCCT CTACCGCTGG GAGAGGGGCC GGGGGTGAGG GGAACACACC AGGACCGAAA
GTGATTGGTG CAGCATTGAC TTACCCCTCT ACCACTGGGA GAGGGGCCGG GGGTGAGGGG
AATACACCAG TACCGAATGA AATTGGTGAA TCATTGACTT ACCCCTCTAC CACTGGGAGA
GGGGCCGGGG GTGAGGGGAA CACACCAGGA CCGAAAGTGA TTGGTGAATC ATTGACTTAC
CCCTCTACCA CTGGGAGAGG GGCTGGGGGT GAGGGGAACA TACCAGTACC GAATGAAATT
GGTGAATCAT TGACTTACCC CTCTACCACT GGGAGAGGGG CCGGAGGTGA GGGGAACACA
CCAGGACCGA AAGTGATTGG TGAATTATTG ACTTACCCCT CTACCACTGG GAGAGGGGCT
GGGGGTGAGG GGAACACACC AGGACCGAAT GGTATTGGTG TGTTCTGGCA GACGCAGGGA
AGCGGCAAGA GTTTTTCGAT GGTCTTTTTT GCACAGAAGG TGCTTCGAAA GGTTGCGGGC
AACTGGACCT TCGTGGTGGT GACCGACCGT ATTGAGCTTG ACGAGCAGAT AGCAAAAACC
TTCAAGGCTG CCGGTGCAGT AAGCAAGGCC GAAGGCGATG CATGCCACGC TTCGAGCGGC
GATCATCTCC GGCAACTGCT TCGGGGCAAT CATCGCTACG TCTTTACGCT GATCCATAAG
TTTCGTTCCG AAACATGTTC CGAAACACCG GTTGCGGGTA ACAATGGAAT CATGCCAGTG
CTGAGTGACA GGGCGGATAT TATTGTTTTG ACAGACGAGG CGCACCGCAG CCAGTACGAC
ACGCTGGCGC TGAATATGCG CTCGGCTCTT CCCAATGCCA TGTTTCTCGC CTTTACCGGT
ACGCCGCTCA TTGCCGGAGA GGAGCGCACC AAAGAGGTTT TCGGCGATTA CGTATCCATT
TACGATTTCC AGCAGTCCGT CGAGGACGGA GCTACGGTTC CGCTTTTCTA CGAGAACCGT
ACGCCCGAGC TGCAACTGGT CAACCCCGAC CTGAACGACG ATATCTACCG GCTTATCGAG
GATGCCGAAC TCGATCCCGA TCAGGAATCG AAGCTTGAGC GGGAGCTGAA ACGACAGTAT
CATCTGCTGA CGCGCGACGA CCGGCTTGAT ACCGTTGCAA AGGACATCGT GCGGCATTTT
CTCGGACGGG GGTTCATCGG CAAGGCGATG GTGGTATCTA TCGACAAGGC CACCGCAATC
AGGATGTACG ACAAAGTGCA ACGGTTCTGG ATGGAAGAGA CGGGGCTTGT CCGGCAGCAG
CTTGTTCGTT ATGATCTTTC GTCTGAAAGA AGAGGGGAAC TGCAGGAGCG GCTACAGGTG
CTTGAGAGCA CCGACATGGC CGTGATCGTT TCTCCCGGCC AGAATGAAAT CGGGCAGATG
CAGCAGATTG GCCTCGATAT TGTGAAGCAC CGCAAGCGCA TGGTTGAGTC GCAGCCGGCT
CTGGATGAGA AGTTCAAGGA CAGCGTCGAT CCGCTGAGAA TCGTGTTCGT TTGCGCCATG
TGGTTGACCG GTTTTGATGC GCCGAGCTGT TCGACGGTCT ATCTCGACAA ACCCATGCGC
AACCATACGC TCATGCAGAC CATAGCGCGG GCAAACCGTG TTTTTCCCGG CAAGCACAGC
GGCATGATTG TGGATTACGC GAATGTGTTC GCCTCGCTTG AAAAGGCGCT GGCTATCTAC
GGGGCAGGGA AGGGCGGTCA TAATCCGGTG AGGGGGAAAG AGGCGCTTGT CGAAGAATTA
CGCAAGGCGG TCGAGCTGGC AACGGCCTTC TGCATGGAGC ATGGGGTGCA TCTTGCAGGA
ATCGAAGGGG CGGCTTCAGG CATGGAACGG CTGCAGCGAA TCGAGGATGC GGTAAACGCC
CTGATTGGGG ATACCCTGCG ACGGGAGTTT TTCGGCCATG AACGTCTTGT CCGTACGCTC
TACCAGGCGG TAAAGCCCGA TCCAGCTGCG ATTGCTTTTT CGGAGCGGGT AGCGTGTATT
GCGGCAATTG CTGAAGCGAT ACGGGGGAAG CTGAATCCAG TTGCGCCGGA TATTTCCACA
ATCATGCAGG GAATCGGCAG ATTGCTCGAC GAATCCATTA CCGGTCACGC CATCCGCGAA
TCCGGGTCGC CGGTGCTCGA TCTTTCGAAA ATCAATTTCA AGGCGCTCTC TGATCGGTTC
AGGCAATCGA AGCACAGGAA CACCGATCTG GAAATGCTCA AAGCCGCGAT TAACGCCAAG
CTGGAAAACA TGATTCGCCT CAATCGCACA AGGGCGGATT TTGCCGGGAA GTTCGAGGAG
TTGATTGCAA GCTACAATGC GGGCAGCCGA AGCATCGAAG ATCTCTTTGA CGAACTCCTC
AAGCTCTCGC TCAGCCTGAG CGATGAGGAG CAGCGCCATG TTCGGGAAAA TATGAACGAG
GAGGAGCTGG TTATTTTCGA CATTCTCACC CGCCCGGCAC CGGAACTCAA TACCGATGAA
CGCAGCGAAG TGAAAAAGGT TGCCCGCGAA CTGCTCGTCA AGATCAAAGG CCTCCTGGTG
CTGAACTGGC GGCAAAAAAC AGAGGCCCGT TCCAGGCTGA AGCTCACCAT CGAGGATGCC
CTCGACCAGG GCCTTCCCCG AGCCTATACG CCGGAAATCT ACCGGCAGAA ATGCTCCGCC
GTATTCGAGC ATGTCTATGA ATCCTATCCC GAACGCGATG CGGGGGTATA CGCCGAGGCG
GGATGA
 
Protein sequence
MTIHPYTEDQ LVEQPAIGLF AELGWTTVSA LDETFGVAEP SFGHFGHPLP EGEGNGDSRI 
SLGRETKGEV VLALRLGAVL ERLNPALPPE AITSAMDELT RDRSAMLLEA ANREIYLLLK
EGIRVSVVDT EPSPGFPLPE GEGAEPPPST FGRRAGNNDS LQKYGGQKME RVRVVDWEHP
ENNDFLLVSQ FSVTGQLYTC RPDLVGFVNG LPMVVIELKK PGVPARTAFD ENLRHYKEQI
PSLFWYNALL IASNGTESRV GSLTADWERF FEWKRIAREN EPRRVSLEVM LRGTCDRGRF
FDLVENFTLF SEHKAGFANP ATLTPTPLPG VEGRSKSGLV KIIGQNHQFL GVNAAIASML
RIRKEYAVVA TFTPTPLPEV EGQYRGGFYY SGLVERAREL RQQQTPAEDI VWELLRDHRF
EGLKFRRQHQ IGNYIADFFC SEHKLVVEVD GDVHRSPEVV AKDSQRDAYL RSLGYTVIRF
ENQLVLNNPA EFLNQIKTAL QLPSTAGRGA GGEGNTPGPK VIGAALTYPS TTGRGAGGEG
NTPVPNEIGE SLTYPSTTGR GAGGEGNTPG PKVIGESLTY PSTTGRGAGG EGNIPVPNEI
GESLTYPSTT GRGAGGEGNT PGPKVIGELL TYPSTTGRGA GGEGNTPGPN GIGVFWQTQG
SGKSFSMVFF AQKVLRKVAG NWTFVVVTDR IELDEQIAKT FKAAGAVSKA EGDACHASSG
DHLRQLLRGN HRYVFTLIHK FRSETCSETP VAGNNGIMPV LSDRADIIVL TDEAHRSQYD
TLALNMRSAL PNAMFLAFTG TPLIAGEERT KEVFGDYVSI YDFQQSVEDG ATVPLFYENR
TPELQLVNPD LNDDIYRLIE DAELDPDQES KLERELKRQY HLLTRDDRLD TVAKDIVRHF
LGRGFIGKAM VVSIDKATAI RMYDKVQRFW MEETGLVRQQ LVRYDLSSER RGELQERLQV
LESTDMAVIV SPGQNEIGQM QQIGLDIVKH RKRMVESQPA LDEKFKDSVD PLRIVFVCAM
WLTGFDAPSC STVYLDKPMR NHTLMQTIAR ANRVFPGKHS GMIVDYANVF ASLEKALAIY
GAGKGGHNPV RGKEALVEEL RKAVELATAF CMEHGVHLAG IEGAASGMER LQRIEDAVNA
LIGDTLRREF FGHERLVRTL YQAVKPDPAA IAFSERVACI AAIAEAIRGK LNPVAPDIST
IMQGIGRLLD ESITGHAIRE SGSPVLDLSK INFKALSDRF RQSKHRNTDL EMLKAAINAK
LENMIRLNRT RADFAGKFEE LIASYNAGSR SIEDLFDELL KLSLSLSDEE QRHVRENMNE
EELVIFDILT RPAPELNTDE RSEVKKVARE LLVKIKGLLV LNWRQKTEAR SRLKLTIEDA
LDQGLPRAYT PEIYRQKCSA VFEHVYESYP ERDAGVYAEA G