Gene Clim_0240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0240 
Symbol 
ID6354696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp267975 
End bp269543 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content53% 
IMG OID642667868 
Producttransposase IS4 family protein 
Protein accessionYP_001942314 
Protein GI189345785 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.995289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATC GGGTTCACCA GCTCAACAAG AAGACCGGAG TCACCTATGT TTACGAGGCG 
GTCTCTACCT GGGACAAAAC GCTCAAGCAG GCCAGAAACA AACAGATCTG CGTCGGCAAG
ATCGATCCGG TGACCGGTGA GTTTGTGCCG TCAAAACGGC TTGATCCTGC CCAAAGTGCG
CTTCGCGATC CAGCTGTGAC CGCTTCGGCT CAGGTGATGG GTCCAACCTT CGTGCTCGAT
GCGATTGCCC TGCGTACAGG GGTGAGCGCG CTCATGAAAT CAGTATTTCC GCAGTCGCAT
CAGGAGCTCA TGGCCATGGC ATTCTATTTG ACCAGCCAAG GGGGCGCATT GAGCCTTTGC
GCCTCATGGG CCAAGGGCCA TATGCCTGAC CTTGCGGCAT CACTTGGCAG CCAGCGCATG
AGCGATTTGC TTGCCTCAAT CGGAACCGAC CGGAAGCAGG CCTTCTTTGC CAAGTGGATG
AAGATGCGTC TGGAGAACGA TTACCTGTGC TATGATATCA CCTCGGTCTC CTCATATTCG
GAGCTGAACG AGTATATCAA GTACGGCTAC AATCGTGATG AAGAGAAGTT GCCACAACTG
AACCTGGCCA TGTTGTTTGG ACAGAAGTCC GGATTACCGG GCTATTACCA TCGGATTCCT
GGCAATATCA ATGATGTGTC AACCTTGCAT AACCTTCTGG AGACCTTCAG AATGCTGGAG
ATCGGGCAAT TGCATTATGT GATGGATAAA GGATTTTACA GCAAGAAGAA TGTCGATGAT
CTGGTCGGAT ACCGCGACCA TTTCACCATC TCGGTACCGA TAAACAATCG GTGGCTACAG
CGGGCTATCG ATGACATCCA TCAGACGATT CACGGCCCTG AAGGGTATCG CAGGCTCGAT
GACGAAATCC TGTATGTGCA CTCACGCTTC TACCCGTGGG GAGAAGCACG GAGACGGTGC
TACCTGCATC TGTACTACAA CGCCACCAAA CGGGCACGGG AGATCGACAC GTTCAATGAG
TCGTTGTTCC GGTATCGGGA GGAGCTTGAA TCCGGCAAAC CGATCGCTGC CCACCAGAAG
GCGTATGAGG ATTTCTTTAC CGTGAAAACG ACACCGAAAC GAGGAACGAT AGTCTCCTTC
AACACCGAGG CGATCAACCG CCATATCAGC CGGTATGCCG GGTTCCAGGC ACTGCTCTCC
AGTGACATCA AGGATCCGGT CGAAGCCCTG CGTGTCTATC GTAATAAGGA TTCTGTGGAA
AAGTGTTTCG ATGACCTGAA AAACACACTC GATATGAAGC GGCTGAGAAT GCACTCCTCA
GCGACGGTTG ACGGACGACT GTTTATCCAG TTCATCGCCC TGATACTCAT CAGTGCGCTT
CGCAAGCAGA TGCGGGATTC CGGATTGATC GAGCAGTATA CGGTGCGCGA ACTGCTCAGG
GAGATGGAGA CGCTCACCAA GATAACCTAT TCCGGAAAGT ACGGGCATAT CCTTACCGAA
CTGACCAAGC CTCAGCGTCA GATTCTCACT GCTCTCAATA TTCCCGTCCT TGACCCGGCA
TCGTTATAA
 
Protein sequence
MAYRVHQLNK KTGVTYVYEA VSTWDKTLKQ ARNKQICVGK IDPVTGEFVP SKRLDPAQSA 
LRDPAVTASA QVMGPTFVLD AIALRTGVSA LMKSVFPQSH QELMAMAFYL TSQGGALSLC
ASWAKGHMPD LAASLGSQRM SDLLASIGTD RKQAFFAKWM KMRLENDYLC YDITSVSSYS
ELNEYIKYGY NRDEEKLPQL NLAMLFGQKS GLPGYYHRIP GNINDVSTLH NLLETFRMLE
IGQLHYVMDK GFYSKKNVDD LVGYRDHFTI SVPINNRWLQ RAIDDIHQTI HGPEGYRRLD
DEILYVHSRF YPWGEARRRC YLHLYYNATK RAREIDTFNE SLFRYREELE SGKPIAAHQK
AYEDFFTVKT TPKRGTIVSF NTEAINRHIS RYAGFQALLS SDIKDPVEAL RVYRNKDSVE
KCFDDLKNTL DMKRLRMHSS ATVDGRLFIQ FIALILISAL RKQMRDSGLI EQYTVRELLR
EMETLTKITY SGKYGHILTE LTKPQRQILT ALNIPVLDPA SL