Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1468 |
Symbol | |
ID | 6354781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 1573849 |
End bp | 1576926 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642669076 |
Product | type III restriction protein res subunit |
Protein accession | YP_001943504 |
Protein GI | 189346975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.780281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATC CTTTCTTCGA TCATCCCATA CTCAACTCTC CCTACAGGTA TCCCGAACGG CACTGGGAAC TCGACGAACA CGGCCAGCCT ACCCAGAAAA TCATCGACAC TCGACGACCG GCGCAGTTCA TCACGCCGAT TCCGAAACCC CGCAAACGCA GGAGTGACGA AGCCCAGCAG CAGCTTGTTT TCGACGAAGG AAAAGGCCTT TCAACCGAAA CCCAGCAGTA CGACCAGACA TCGCTTATCA ACGCCGTCCG CCGTGAGGTT GAGAAGTGGC GGGAATTGCC GAACCCGAAC AACTGGCAGG TAACGCCTGA AACCGCCCGG CTGCTGCAGC ACTGGCGGCA CCACGACTTC AGCGGTATCC GTCCCTTTTT CTGTCAGGTG GAAGCGGTAG AAACCGCCAT CTGGCTGACC GAGGTTGCAC CCCATACGGG AAAAACGGGC AAGAGGTTTC TCGACCATCT CGAAGACGCC AATGGCAATG CCAACCCGGA AATCATGCGC CTTGCACTGA AGCTTGCAAC CGGAGCCGGC AAAACCACCG TTATGGCCAT GCTCATTGCG TGGCAGGCGG TCAATGCCGC CCGCAGGCCG CAAAGCCGGA AGTTCACGCG AGGCTTTCTC GTTGTCACGC CAGGCATCAC CATTCGCGAC CGGCTCAGGG TTTTGCTGCC CAACGACCCC GACAGCTATT ACAAAAGCCG CGAACTTGTA CCCGGCGACA TGATCGGTGA TATCGAACGG GCCAAGATCG TCATCACCAA CTACCACGCT TTCAAGCTTC GTGAACGGCA CGAACTCTCC AAAGGCGGGC GATTGCTCCT GCAGGGCAGG GGACAAAAGC TGCAGACCCT CGAAACCGAA GGGCAGATGC TGCAGCGGGT TATGCCGTAC CTGATGGGTA TGAAGAACAT CATGGTCATC AACGATGAGG CACACCACTG TTACCGCGAA AAACCGGATG GTGATGAATT CCAGGAACTC AGGGGCGACG AGAAAAAGGA AGCCGAAGAG AACAATGAAG CGGCGCGGGT CTGGATTACC GGCATCGAAA CCGTGAAAAG AAAGCTTGGC GTGAACTGGG TGATCGACCT GTCGGCTACG CCCTTTTTCC TGAGCGGTTC CGGCTATGCC GAGGGTACGC TGTTTCCCTG GACCATGAGC GACTTTTCGC TGATGGACGC CATCGAAAGC GGCATCGTCA AATTGCCGAG GGTGCCGGTA GCCGACAACG TACCCGGCGG CGACATGCCG AAGTTCCGTG AACTCTGGAA GCATATCGGC AAAAAGATGC CGAAGAAAAG CCGCAGCAAG ACAAACGCCT ACGACCCGCT CAGTATTCCG GTAGAGCTGC AGACTGCTCT CGAAGCGCTT TACGGCCATT ACCGGCAGAC CTTCGACCTC TGGAAAGAGA ACAATATTTC CGTGCCACCC TGCTTCATCG TGGTTTGCAA CAACACCTCG ACATCGAAGC TGGTGCACGA CTACATTTCC GGTTTTTACC GCGAGCAGGA AGACGGCACG AAGCAACTGG TCAACGGGAG GCTCGAACTT TTCAGGAACT TCGATGCGGA CGGTTCACCC CTTCCGCAAC CGAGCACACT GCTTATCGAC AGCAAGCAGC TCGAATCCGG CGATGCGCTC GACAGGAACT TCCGCGACAT GGCCGGTGAC GAAATCGAAC GGTTCCGGAG AGAGATCATC GAGCGAACCG GCGACCGCCG ACAGGCTGAA AACCTCACCG ATCAGGACCT TCTGCGCGAG GTCATGAACA CCGTGGGAAA GCATGGCCGG CTTGGGCAGT CGATCCGTTG CGTGGTGTCG GTCTCCATGC TTACCGAAGG GTGGGACGCC AACACGGTCA CCCATGTGCT CGGCGTCCGC GCATTCGGCA CCCAGCTCCT CTGCGAGCAG GTGATCGGGC GCGCTCTGCG CAGGCAGTCC TACGATCTCA ACGAAGAGTG TCTCTTCAAC ACCGAATATG CCGATGTGCT CGGCATACCC TTCGACTTCA CCGCCAAGCC GGTTGTCGCC CCGCCGCAGC CGCCGCGCGA AACCGTGCAG GTCAGGGCTG TCCGTCCGGA ACGCGATCAT CTTGAAATCA CCTTTCCCAA CGTGGCGGGT TATCGTGTCG AACTGCCTGA AGAACAGCTT ACCGCCGAGT TTACCGATGA ATCGGTGCTC GAACTTACGC CCGACCTTGT CGGCCCCTCG ATCACGCGCA ACTCGGGCAT CATCGGCGAA GCCATCGACC TCAGCCTCGA ACACCTTGGC GACATGCGCC AGTCAACCCT GCTGTTCCAC CTGACCCAGC GGCTGCTCTA CACCAAATGG CGCGACCCGG AGGAGTCGCC CAAGCTGCAC CTCTTCGGCC AGCTCAAGCG CATCACCCGC CAGTGGCTCG ACACCTGCCT TGTCTGCAAA GGGGGAACCT ACCCTGCGCA GCTCATCTAT CAGGAACTTG CCGACATTGC CTGCAACCGC ATCACAGCAG CCATCACGAG GGCGGAAATC GGCAGGCGGC CGGTCAAGGC CGTGCTCGAC CCTTACAATC CGACAGGTTC ATCAAGGTAT GTGAACTTTA CCACATCGAA ACGCGACCGA TGGGAAACCG ATGCACGGCA CTGCCATGTC AACTGGGTCA TTCTCGACAG CGACTGGGAA GCCGAGTTCT GCAGGGTTGC CGAATCGCAT CCCAAGGTCC GTTCATACGT CAAGAACCAT AACCTCGGGC TCGAAGTCCC TTACCGGTAC GGCTCCGAAA TGCGCCGATA CCTGCCGGAC TTCATTGTGC TCATTGACGA CGGCAACGGC AGTGACGACC TCCTGCACCT CGTGGTTGAA ATCAAGGGCT ACCGGCGCGA AGACGCCAAG GAGAAGAAAT CCACCATGGA TACCTACTGG ATTACCGGCG TCAACAACAT CGGCACTTAC GGGCGCTGGG CATTCGCGGA GCTTACCCAG CCCTACACCT TCGAACTGGA TATGGGCAAG CAGATCGAGG AGGCGTTCAG CAGAATGCTC GAACAGGCAT CGGCTGTTCA ATCAGAGGGA GCGACGAGCC ATGCTTGA
|
Protein sequence | MSNPFFDHPI LNSPYRYPER HWELDEHGQP TQKIIDTRRP AQFITPIPKP RKRRSDEAQQ QLVFDEGKGL STETQQYDQT SLINAVRREV EKWRELPNPN NWQVTPETAR LLQHWRHHDF SGIRPFFCQV EAVETAIWLT EVAPHTGKTG KRFLDHLEDA NGNANPEIMR LALKLATGAG KTTVMAMLIA WQAVNAARRP QSRKFTRGFL VVTPGITIRD RLRVLLPNDP DSYYKSRELV PGDMIGDIER AKIVITNYHA FKLRERHELS KGGRLLLQGR GQKLQTLETE GQMLQRVMPY LMGMKNIMVI NDEAHHCYRE KPDGDEFQEL RGDEKKEAEE NNEAARVWIT GIETVKRKLG VNWVIDLSAT PFFLSGSGYA EGTLFPWTMS DFSLMDAIES GIVKLPRVPV ADNVPGGDMP KFRELWKHIG KKMPKKSRSK TNAYDPLSIP VELQTALEAL YGHYRQTFDL WKENNISVPP CFIVVCNNTS TSKLVHDYIS GFYREQEDGT KQLVNGRLEL FRNFDADGSP LPQPSTLLID SKQLESGDAL DRNFRDMAGD EIERFRREII ERTGDRRQAE NLTDQDLLRE VMNTVGKHGR LGQSIRCVVS VSMLTEGWDA NTVTHVLGVR AFGTQLLCEQ VIGRALRRQS YDLNEECLFN TEYADVLGIP FDFTAKPVVA PPQPPRETVQ VRAVRPERDH LEITFPNVAG YRVELPEEQL TAEFTDESVL ELTPDLVGPS ITRNSGIIGE AIDLSLEHLG DMRQSTLLFH LTQRLLYTKW RDPEESPKLH LFGQLKRITR QWLDTCLVCK GGTYPAQLIY QELADIACNR ITAAITRAEI GRRPVKAVLD PYNPTGSSRY VNFTTSKRDR WETDARHCHV NWVILDSDWE AEFCRVAESH PKVRSYVKNH NLGLEVPYRY GSEMRRYLPD FIVLIDDGNG SDDLLHLVVE IKGYRREDAK EKKSTMDTYW ITGVNNIGTY GRWAFAELTQ PYTFELDMGK QIEEAFSRML EQASAVQSEG ATSHA
|
| |