Gene Clim_1468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1468 
Symbol 
ID6354781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1573849 
End bp1576926 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content58% 
IMG OID642669076 
Producttype III restriction protein res subunit 
Protein accessionYP_001943504 
Protein GI189346975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.780281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC CTTTCTTCGA TCATCCCATA CTCAACTCTC CCTACAGGTA TCCCGAACGG 
CACTGGGAAC TCGACGAACA CGGCCAGCCT ACCCAGAAAA TCATCGACAC TCGACGACCG
GCGCAGTTCA TCACGCCGAT TCCGAAACCC CGCAAACGCA GGAGTGACGA AGCCCAGCAG
CAGCTTGTTT TCGACGAAGG AAAAGGCCTT TCAACCGAAA CCCAGCAGTA CGACCAGACA
TCGCTTATCA ACGCCGTCCG CCGTGAGGTT GAGAAGTGGC GGGAATTGCC GAACCCGAAC
AACTGGCAGG TAACGCCTGA AACCGCCCGG CTGCTGCAGC ACTGGCGGCA CCACGACTTC
AGCGGTATCC GTCCCTTTTT CTGTCAGGTG GAAGCGGTAG AAACCGCCAT CTGGCTGACC
GAGGTTGCAC CCCATACGGG AAAAACGGGC AAGAGGTTTC TCGACCATCT CGAAGACGCC
AATGGCAATG CCAACCCGGA AATCATGCGC CTTGCACTGA AGCTTGCAAC CGGAGCCGGC
AAAACCACCG TTATGGCCAT GCTCATTGCG TGGCAGGCGG TCAATGCCGC CCGCAGGCCG
CAAAGCCGGA AGTTCACGCG AGGCTTTCTC GTTGTCACGC CAGGCATCAC CATTCGCGAC
CGGCTCAGGG TTTTGCTGCC CAACGACCCC GACAGCTATT ACAAAAGCCG CGAACTTGTA
CCCGGCGACA TGATCGGTGA TATCGAACGG GCCAAGATCG TCATCACCAA CTACCACGCT
TTCAAGCTTC GTGAACGGCA CGAACTCTCC AAAGGCGGGC GATTGCTCCT GCAGGGCAGG
GGACAAAAGC TGCAGACCCT CGAAACCGAA GGGCAGATGC TGCAGCGGGT TATGCCGTAC
CTGATGGGTA TGAAGAACAT CATGGTCATC AACGATGAGG CACACCACTG TTACCGCGAA
AAACCGGATG GTGATGAATT CCAGGAACTC AGGGGCGACG AGAAAAAGGA AGCCGAAGAG
AACAATGAAG CGGCGCGGGT CTGGATTACC GGCATCGAAA CCGTGAAAAG AAAGCTTGGC
GTGAACTGGG TGATCGACCT GTCGGCTACG CCCTTTTTCC TGAGCGGTTC CGGCTATGCC
GAGGGTACGC TGTTTCCCTG GACCATGAGC GACTTTTCGC TGATGGACGC CATCGAAAGC
GGCATCGTCA AATTGCCGAG GGTGCCGGTA GCCGACAACG TACCCGGCGG CGACATGCCG
AAGTTCCGTG AACTCTGGAA GCATATCGGC AAAAAGATGC CGAAGAAAAG CCGCAGCAAG
ACAAACGCCT ACGACCCGCT CAGTATTCCG GTAGAGCTGC AGACTGCTCT CGAAGCGCTT
TACGGCCATT ACCGGCAGAC CTTCGACCTC TGGAAAGAGA ACAATATTTC CGTGCCACCC
TGCTTCATCG TGGTTTGCAA CAACACCTCG ACATCGAAGC TGGTGCACGA CTACATTTCC
GGTTTTTACC GCGAGCAGGA AGACGGCACG AAGCAACTGG TCAACGGGAG GCTCGAACTT
TTCAGGAACT TCGATGCGGA CGGTTCACCC CTTCCGCAAC CGAGCACACT GCTTATCGAC
AGCAAGCAGC TCGAATCCGG CGATGCGCTC GACAGGAACT TCCGCGACAT GGCCGGTGAC
GAAATCGAAC GGTTCCGGAG AGAGATCATC GAGCGAACCG GCGACCGCCG ACAGGCTGAA
AACCTCACCG ATCAGGACCT TCTGCGCGAG GTCATGAACA CCGTGGGAAA GCATGGCCGG
CTTGGGCAGT CGATCCGTTG CGTGGTGTCG GTCTCCATGC TTACCGAAGG GTGGGACGCC
AACACGGTCA CCCATGTGCT CGGCGTCCGC GCATTCGGCA CCCAGCTCCT CTGCGAGCAG
GTGATCGGGC GCGCTCTGCG CAGGCAGTCC TACGATCTCA ACGAAGAGTG TCTCTTCAAC
ACCGAATATG CCGATGTGCT CGGCATACCC TTCGACTTCA CCGCCAAGCC GGTTGTCGCC
CCGCCGCAGC CGCCGCGCGA AACCGTGCAG GTCAGGGCTG TCCGTCCGGA ACGCGATCAT
CTTGAAATCA CCTTTCCCAA CGTGGCGGGT TATCGTGTCG AACTGCCTGA AGAACAGCTT
ACCGCCGAGT TTACCGATGA ATCGGTGCTC GAACTTACGC CCGACCTTGT CGGCCCCTCG
ATCACGCGCA ACTCGGGCAT CATCGGCGAA GCCATCGACC TCAGCCTCGA ACACCTTGGC
GACATGCGCC AGTCAACCCT GCTGTTCCAC CTGACCCAGC GGCTGCTCTA CACCAAATGG
CGCGACCCGG AGGAGTCGCC CAAGCTGCAC CTCTTCGGCC AGCTCAAGCG CATCACCCGC
CAGTGGCTCG ACACCTGCCT TGTCTGCAAA GGGGGAACCT ACCCTGCGCA GCTCATCTAT
CAGGAACTTG CCGACATTGC CTGCAACCGC ATCACAGCAG CCATCACGAG GGCGGAAATC
GGCAGGCGGC CGGTCAAGGC CGTGCTCGAC CCTTACAATC CGACAGGTTC ATCAAGGTAT
GTGAACTTTA CCACATCGAA ACGCGACCGA TGGGAAACCG ATGCACGGCA CTGCCATGTC
AACTGGGTCA TTCTCGACAG CGACTGGGAA GCCGAGTTCT GCAGGGTTGC CGAATCGCAT
CCCAAGGTCC GTTCATACGT CAAGAACCAT AACCTCGGGC TCGAAGTCCC TTACCGGTAC
GGCTCCGAAA TGCGCCGATA CCTGCCGGAC TTCATTGTGC TCATTGACGA CGGCAACGGC
AGTGACGACC TCCTGCACCT CGTGGTTGAA ATCAAGGGCT ACCGGCGCGA AGACGCCAAG
GAGAAGAAAT CCACCATGGA TACCTACTGG ATTACCGGCG TCAACAACAT CGGCACTTAC
GGGCGCTGGG CATTCGCGGA GCTTACCCAG CCCTACACCT TCGAACTGGA TATGGGCAAG
CAGATCGAGG AGGCGTTCAG CAGAATGCTC GAACAGGCAT CGGCTGTTCA ATCAGAGGGA
GCGACGAGCC ATGCTTGA
 
Protein sequence
MSNPFFDHPI LNSPYRYPER HWELDEHGQP TQKIIDTRRP AQFITPIPKP RKRRSDEAQQ 
QLVFDEGKGL STETQQYDQT SLINAVRREV EKWRELPNPN NWQVTPETAR LLQHWRHHDF
SGIRPFFCQV EAVETAIWLT EVAPHTGKTG KRFLDHLEDA NGNANPEIMR LALKLATGAG
KTTVMAMLIA WQAVNAARRP QSRKFTRGFL VVTPGITIRD RLRVLLPNDP DSYYKSRELV
PGDMIGDIER AKIVITNYHA FKLRERHELS KGGRLLLQGR GQKLQTLETE GQMLQRVMPY
LMGMKNIMVI NDEAHHCYRE KPDGDEFQEL RGDEKKEAEE NNEAARVWIT GIETVKRKLG
VNWVIDLSAT PFFLSGSGYA EGTLFPWTMS DFSLMDAIES GIVKLPRVPV ADNVPGGDMP
KFRELWKHIG KKMPKKSRSK TNAYDPLSIP VELQTALEAL YGHYRQTFDL WKENNISVPP
CFIVVCNNTS TSKLVHDYIS GFYREQEDGT KQLVNGRLEL FRNFDADGSP LPQPSTLLID
SKQLESGDAL DRNFRDMAGD EIERFRREII ERTGDRRQAE NLTDQDLLRE VMNTVGKHGR
LGQSIRCVVS VSMLTEGWDA NTVTHVLGVR AFGTQLLCEQ VIGRALRRQS YDLNEECLFN
TEYADVLGIP FDFTAKPVVA PPQPPRETVQ VRAVRPERDH LEITFPNVAG YRVELPEEQL
TAEFTDESVL ELTPDLVGPS ITRNSGIIGE AIDLSLEHLG DMRQSTLLFH LTQRLLYTKW
RDPEESPKLH LFGQLKRITR QWLDTCLVCK GGTYPAQLIY QELADIACNR ITAAITRAEI
GRRPVKAVLD PYNPTGSSRY VNFTTSKRDR WETDARHCHV NWVILDSDWE AEFCRVAESH
PKVRSYVKNH NLGLEVPYRY GSEMRRYLPD FIVLIDDGNG SDDLLHLVVE IKGYRREDAK
EKKSTMDTYW ITGVNNIGTY GRWAFAELTQ PYTFELDMGK QIEEAFSRML EQASAVQSEG
ATSHA