Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1705 |
Symbol | |
ID | 4571065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1926620 |
End bp | 1930885 |
Gene Length | 4266 bp |
Protein Length | 1421 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639766288 |
Product | hypothetical protein |
Protein accession | YP_912147 |
Protein GI | 119357503 |
COG category | [S] Function unknown [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [COG2852] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATTC ACCCTTACAC CGAAGACCAG CTTGTCGAGC AGCCTGCTAT CGGGTTGTTT GCGGAGCTTG GCTGGACGAC GGTGTCGGCG TTGGATGAGA CATTCGGCGT TGCCGAACCC TCCTTCGGCC ACTTTGGCCA CCCTCTCCCG GAGGGCGAGG GTAATGGAGA TTCCCGGATA TCATTAGGGC GGGAGACGAA GGGGGAGGTG GTTCTGGCGT TACGGTTGGG TGCGGTGTTG GAGCGGTTGA ACCCTGCGCT GCCGCCCGAG GCAATCACCT CGGCAATGGA TGAATTGACT CGCGACCGTT CAGCCATGCT TCTGGAGGCA GCGAATCGCG AAATCTATCT GCTGCTCAAG GAGGGGATTC GCGTCAGCGT TGTTGACACC GAACCCTCAC CAGGTTTCCC TCTCCCAGAG GGCGAGGGTG CAGAACCACC TCCATCTACC TTTGGGAGAA GGGCAGGAAA TAACGATTCG CTACAGAAAT ATGGAGGGCA GAAGATGGAG CGGGTGAGAG TTGTTGATTG GGAGCATCCG GAGAACAACG ATTTTCTGCT GGTGAGTCAA TTCAGCGTTA CGGGACAGCT CTACACTTGC CGCCCTGATC TGGTTGGGTT CGTGAACGGG CTGCCAATGG TGGTGATCGA GCTGAAAAAG CCAGGTGTGC CTGCCCGGAC GGCTTTCGAC GAAAATCTCA GGCACTACAA GGAGCAGATT CCTTCGCTTT TCTGGTACAA TGCGCTGCTT ATCGCTTCGA ACGGTACGGA AAGCCGTGTC GGTTCGCTCA CTGCTGATTG GGAACGGTTC TTCGAGTGGA AGCGCATCGC GCGGGAAAAC GAGCCCCGCA GGGTCTCGCT TGAGGTGATG CTTCGCGGCA CGTGCGACCG CGGTCGTTTT TTTGACCTCG TTGAGAACTT CACGCTTTTT TCGGAGCATA AGGCGGGATT TGCAAATCCC GCCACCCTCA CCCCAACCCC TCTCCCAGGG GTAGAGGGGC GGAGTAAAAG CGGGCTGGTG AAGATTATCG GACAGAACCA CCAGTTTCTC GGAGTCAATG CGGCAATTGC GTCGATGCTC AGGATTCGTA AAGAATATGC AGTTGTCGCA ACCTTCACCC CAACCCCTCT CCCAGAGGTA GAGGGGCAAT ATCGTGGTGG TTTTTATTAT TCGGGACTGG TTGAACGCGC GAGGGAATTA CGGCAACAGC AGACACCGGC AGAGGATATC GTATGGGAGC TTTTGCGTGA CCATCGGTTC GAGGGGCTGA AGTTTCGTCG GCAACACCAG ATCGGAAATT ATATTGCAGA TTTCTTTTGT TCAGAGCATA AACTTGTTGT CGAAGTTGAT GGTGACGTTC ATCGTTCGCC GGAGGTGGTT GCGAAGGATT CACAGCGGGA TGCATATTTG CGTTCTCTTG GTTATACCGT TATTCGCTTT GAGAATCAGT TGGTATTGAA TAATCCGGCA GAGTTTCTGA ATCAAATCAA GACAGCATTG CAGCTTCCCT CTACCGCTGG GAGAGGGGCC GGGGGTGAGG GGAACACACC AGGACCGAAA GTGATTGGTG CAGCATTGAC TTACCCCTCT ACCACTGGGA GAGGGGCCGG GGGTGAGGGG AATACACCAG TACCGAATGA AATTGGTGAA TCATTGACTT ACCCCTCTAC CACTGGGAGA GGGGCCGGGG GTGAGGGGAA CACACCAGGA CCGAAAGTGA TTGGTGAATC ATTGACTTAC CCCTCTACCA CTGGGAGAGG GGCTGGGGGT GAGGGGAACA TACCAGTACC GAATGAAATT GGTGAATCAT TGACTTACCC CTCTACCACT GGGAGAGGGG CCGGAGGTGA GGGGAACACA CCAGGACCGA AAGTGATTGG TGAATTATTG ACTTACCCCT CTACCACTGG GAGAGGGGCT GGGGGTGAGG GGAACACACC AGGACCGAAT GGTATTGGTG TGTTCTGGCA GACGCAGGGA AGCGGCAAGA GTTTTTCGAT GGTCTTTTTT GCACAGAAGG TGCTTCGAAA GGTTGCGGGC AACTGGACCT TCGTGGTGGT GACCGACCGT ATTGAGCTTG ACGAGCAGAT AGCAAAAACC TTCAAGGCTG CCGGTGCAGT AAGCAAGGCC GAAGGCGATG CATGCCACGC TTCGAGCGGC GATCATCTCC GGCAACTGCT TCGGGGCAAT CATCGCTACG TCTTTACGCT GATCCATAAG TTTCGTTCCG AAACATGTTC CGAAACACCG GTTGCGGGTA ACAATGGAAT CATGCCAGTG CTGAGTGACA GGGCGGATAT TATTGTTTTG ACAGACGAGG CGCACCGCAG CCAGTACGAC ACGCTGGCGC TGAATATGCG CTCGGCTCTT CCCAATGCCA TGTTTCTCGC CTTTACCGGT ACGCCGCTCA TTGCCGGAGA GGAGCGCACC AAAGAGGTTT TCGGCGATTA CGTATCCATT TACGATTTCC AGCAGTCCGT CGAGGACGGA GCTACGGTTC CGCTTTTCTA CGAGAACCGT ACGCCCGAGC TGCAACTGGT CAACCCCGAC CTGAACGACG ATATCTACCG GCTTATCGAG GATGCCGAAC TCGATCCCGA TCAGGAATCG AAGCTTGAGC GGGAGCTGAA ACGACAGTAT CATCTGCTGA CGCGCGACGA CCGGCTTGAT ACCGTTGCAA AGGACATCGT GCGGCATTTT CTCGGACGGG GGTTCATCGG CAAGGCGATG GTGGTATCTA TCGACAAGGC CACCGCAATC AGGATGTACG ACAAAGTGCA ACGGTTCTGG ATGGAAGAGA CGGGGCTTGT CCGGCAGCAG CTTGTTCGTT ATGATCTTTC GTCTGAAAGA AGAGGGGAAC TGCAGGAGCG GCTACAGGTG CTTGAGAGCA CCGACATGGC CGTGATCGTT TCTCCCGGCC AGAATGAAAT CGGGCAGATG CAGCAGATTG GCCTCGATAT TGTGAAGCAC CGCAAGCGCA TGGTTGAGTC GCAGCCGGCT CTGGATGAGA AGTTCAAGGA CAGCGTCGAT CCGCTGAGAA TCGTGTTCGT TTGCGCCATG TGGTTGACCG GTTTTGATGC GCCGAGCTGT TCGACGGTCT ATCTCGACAA ACCCATGCGC AACCATACGC TCATGCAGAC CATAGCGCGG GCAAACCGTG TTTTTCCCGG CAAGCACAGC GGCATGATTG TGGATTACGC GAATGTGTTC GCCTCGCTTG AAAAGGCGCT GGCTATCTAC GGGGCAGGGA AGGGCGGTCA TAATCCGGTG AGGGGGAAAG AGGCGCTTGT CGAAGAATTA CGCAAGGCGG TCGAGCTGGC AACGGCCTTC TGCATGGAGC ATGGGGTGCA TCTTGCAGGA ATCGAAGGGG CGGCTTCAGG CATGGAACGG CTGCAGCGAA TCGAGGATGC GGTAAACGCC CTGATTGGGG ATACCCTGCG ACGGGAGTTT TTCGGCCATG AACGTCTTGT CCGTACGCTC TACCAGGCGG TAAAGCCCGA TCCAGCTGCG ATTGCTTTTT CGGAGCGGGT AGCGTGTATT GCGGCAATTG CTGAAGCGAT ACGGGGGAAG CTGAATCCAG TTGCGCCGGA TATTTCCACA ATCATGCAGG GAATCGGCAG ATTGCTCGAC GAATCCATTA CCGGTCACGC CATCCGCGAA TCCGGGTCGC CGGTGCTCGA TCTTTCGAAA ATCAATTTCA AGGCGCTCTC TGATCGGTTC AGGCAATCGA AGCACAGGAA CACCGATCTG GAAATGCTCA AAGCCGCGAT TAACGCCAAG CTGGAAAACA TGATTCGCCT CAATCGCACA AGGGCGGATT TTGCCGGGAA GTTCGAGGAG TTGATTGCAA GCTACAATGC GGGCAGCCGA AGCATCGAAG ATCTCTTTGA CGAACTCCTC AAGCTCTCGC TCAGCCTGAG CGATGAGGAG CAGCGCCATG TTCGGGAAAA TATGAACGAG GAGGAGCTGG TTATTTTCGA CATTCTCACC CGCCCGGCAC CGGAACTCAA TACCGATGAA CGCAGCGAAG TGAAAAAGGT TGCCCGCGAA CTGCTCGTCA AGATCAAAGG CCTCCTGGTG CTGAACTGGC GGCAAAAAAC AGAGGCCCGT TCCAGGCTGA AGCTCACCAT CGAGGATGCC CTCGACCAGG GCCTTCCCCG AGCCTATACG CCGGAAATCT ACCGGCAGAA ATGCTCCGCC GTATTCGAGC ATGTCTATGA ATCCTATCCC GAACGCGATG CGGGGGTATA CGCCGAGGCG GGATGA
|
Protein sequence | MTIHPYTEDQ LVEQPAIGLF AELGWTTVSA LDETFGVAEP SFGHFGHPLP EGEGNGDSRI SLGRETKGEV VLALRLGAVL ERLNPALPPE AITSAMDELT RDRSAMLLEA ANREIYLLLK EGIRVSVVDT EPSPGFPLPE GEGAEPPPST FGRRAGNNDS LQKYGGQKME RVRVVDWEHP ENNDFLLVSQ FSVTGQLYTC RPDLVGFVNG LPMVVIELKK PGVPARTAFD ENLRHYKEQI PSLFWYNALL IASNGTESRV GSLTADWERF FEWKRIAREN EPRRVSLEVM LRGTCDRGRF FDLVENFTLF SEHKAGFANP ATLTPTPLPG VEGRSKSGLV KIIGQNHQFL GVNAAIASML RIRKEYAVVA TFTPTPLPEV EGQYRGGFYY SGLVERAREL RQQQTPAEDI VWELLRDHRF EGLKFRRQHQ IGNYIADFFC SEHKLVVEVD GDVHRSPEVV AKDSQRDAYL RSLGYTVIRF ENQLVLNNPA EFLNQIKTAL QLPSTAGRGA GGEGNTPGPK VIGAALTYPS TTGRGAGGEG NTPVPNEIGE SLTYPSTTGR GAGGEGNTPG PKVIGESLTY PSTTGRGAGG EGNIPVPNEI GESLTYPSTT GRGAGGEGNT PGPKVIGELL TYPSTTGRGA GGEGNTPGPN GIGVFWQTQG SGKSFSMVFF AQKVLRKVAG NWTFVVVTDR IELDEQIAKT FKAAGAVSKA EGDACHASSG DHLRQLLRGN HRYVFTLIHK FRSETCSETP VAGNNGIMPV LSDRADIIVL TDEAHRSQYD TLALNMRSAL PNAMFLAFTG TPLIAGEERT KEVFGDYVSI YDFQQSVEDG ATVPLFYENR TPELQLVNPD LNDDIYRLIE DAELDPDQES KLERELKRQY HLLTRDDRLD TVAKDIVRHF LGRGFIGKAM VVSIDKATAI RMYDKVQRFW MEETGLVRQQ LVRYDLSSER RGELQERLQV LESTDMAVIV SPGQNEIGQM QQIGLDIVKH RKRMVESQPA LDEKFKDSVD PLRIVFVCAM WLTGFDAPSC STVYLDKPMR NHTLMQTIAR ANRVFPGKHS GMIVDYANVF ASLEKALAIY GAGKGGHNPV RGKEALVEEL RKAVELATAF CMEHGVHLAG IEGAASGMER LQRIEDAVNA LIGDTLRREF FGHERLVRTL YQAVKPDPAA IAFSERVACI AAIAEAIRGK LNPVAPDIST IMQGIGRLLD ESITGHAIRE SGSPVLDLSK INFKALSDRF RQSKHRNTDL EMLKAAINAK LENMIRLNRT RADFAGKFEE LIASYNAGSR SIEDLFDELL KLSLSLSDEE QRHVRENMNE EELVIFDILT RPAPELNTDE RSEVKKVARE LLVKIKGLLV LNWRQKTEAR SRLKLTIEDA LDQGLPRAYT PEIYRQKCSA VFEHVYESYP ERDAGVYAEA G
|
| |