Gene Clim_2467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2467 
Symbol 
ID6354737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2705381 
End bp2706697 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content50% 
IMG OID642670056 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_001944466 
Protein GI189347937 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTG AAAACCTTCA ATGCGCAGAT GCGTCACTTG AAACAATTCA TTGTCAACAA 
AGTGATGGAG ATTCTGGAGA TTGGATGAAA GTTGGTTTAA CCGAGTCAAC GCTTGCTGAG
GTATGTAGTC TCGTTACCGA CGGTACGCAT GATACTCCGA AGCGGGTCGA AACCGGCTAT
CCTCTTGTCA AGGCGAAGGA AATTTCAGGG GGTCGGATTG ATTTTGATAA CTGTGATCAG
ATTTCTGAGC AAGAGCACCT TAAAGTCATC GCTCGATCCA AGCCAGAATT TGGTGATACA
CTTTTTGCCC ACATCGGTGC ATCATTGGGT GAGGCGGCGT TCGTGAATAC CACTCGTGAG
TTCAGTATTA AAAACGTTGC GCTGTTCAAA CCGAATCCAT CCGTGATTGA TGCTCGTTAC
CTCTATTACC TTGTAGTCAG TCCCGCATTC CAATCACTTG CTAAAGGAAC AAGGACCGGT
TCGGCGCAAC CGTTTCTCGG ACTCAGTCAG TTGCGCGGAC ACCAAATTCA ATATCATCGT
GACTTGGCCC ATCAAAGGCG AATTTCGGGT ATTCTTTCAG CGTATGATGA CCTGATTGAG
AACCGTCAGC GACGCATCCG GATTTTGGAG GAGATGGCCC GCTCTCTCTA CCGTGAGTGG
TTCGTCCACT TCCGCTTCCC CGGACACGAA AACCATCCGC TTGTTCCCTC TTCTCTTGGC
GTCATTCCGC AGGGGTGGGA GGTGAAAAAG CTTGGTGATA TAGCGGAAAG CATGCGACGC
AACGTGTCGA AAGGCAAACT CGAAGAAAGA ACGCCGTACG TCGGTCTTGA ACATATTCCT
CGGCAATCGC TTGCACTCGA TGCATGGGAA ATGGCAACCG CACTCGGCTC GAACAAACTG
GAGTTCAAGA AAGGTGAAGT TCTGTTCGGC AAGATTCGGC CATACTTCCA TAAGGTCAGT
GTTGCGCCCT TCGTCGGACT TTGCTCCGCC GACACCATCG TCATCCGCGC CCTTCGTCCA
GAGCATTACG GCATTGTCGT CGCATGTGTC TCAAGTGATG AGTTTGTTGC CGTTGCGAGC
GCGACCGCAA ACGGCGCGAA GATGCCCCGG GCAAATTGGA ATGTGCTTGA GAAATACCAA
GTAGTTATTC CAAAAGGCAA TCTGGCAGAG AAATTCTCTG CGCTGTTCGC TGATATTATT
GCTCAGCAAC AAACGCTTAT TTTCAAAATC CAAAATCTTC GCCAGACGCG CGACCTGCTG
CTGCCGCGTC TACTGTCGGG GGAGGTGAAA CTCAAGGAAA CTGACGAACC ATTATGA
 
Protein sequence
MNVENLQCAD ASLETIHCQQ SDGDSGDWMK VGLTESTLAE VCSLVTDGTH DTPKRVETGY 
PLVKAKEISG GRIDFDNCDQ ISEQEHLKVI ARSKPEFGDT LFAHIGASLG EAAFVNTTRE
FSIKNVALFK PNPSVIDARY LYYLVVSPAF QSLAKGTRTG SAQPFLGLSQ LRGHQIQYHR
DLAHQRRISG ILSAYDDLIE NRQRRIRILE EMARSLYREW FVHFRFPGHE NHPLVPSSLG
VIPQGWEVKK LGDIAESMRR NVSKGKLEER TPYVGLEHIP RQSLALDAWE MATALGSNKL
EFKKGEVLFG KIRPYFHKVS VAPFVGLCSA DTIVIRALRP EHYGIVVACV SSDEFVAVAS
ATANGAKMPR ANWNVLEKYQ VVIPKGNLAE KFSALFADII AQQQTLIFKI QNLRQTRDLL
LPRLLSGEVK LKETDEPL