Gene Clim_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0222 
Symbol 
ID6354678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp245244 
End bp246290 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content48% 
IMG OID642667853 
Producttransposase IS4 family protein 
Protein accessionYP_001942299 
Protein GI189345770 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA TCAATCCACC CGGCCTTTTT GATGAACAAT TCCAGCTCGA ACGTCTTACC 
CAGCTCAAAG ATCCGCTGGT GAAGCTGGAA CAATACATCG ACTGGAAAAT ATTTGCTCCT
ATTCTTGATG TCGTTTTCAA CAAACCAGAG AACCATAGCA ATGCCGGAAG ACCACCTTTT
GATCGGGTCA TGATGTTTAA GGTTCTTATC CTCCAGAGCC TGTACAGTCT CTCGGATGAT
GCCATGGAAT TTCAGATTAA CGATCGTCTC AGTTTCAAAC GCTTTCTGGG GCTCAAATCA
AGTGACCGGG TACCGGACAG CAAGACCATC TGGAAGTTCC GTGAAACCCT GATACAGGAA
GGCATTATCG AAGCCTTGTT TTACCGGTTC AATCAAGCCC TGGACGACCA GAGCATCTTT
GCCAAAACCG GGCAGATTGT TGATGCGAGC TTTGTTGAAG TGCCTCGGCA ACGCAACAGC
CGTGACGAAA ACGACCAGAT CAAAAAGGGC CAGACCCCGG AATCGTGGAA AGCAAAGCCG
AACAAACTGT GCCAGAAAGA CCGTGATGCC CGCTGGACAA AAAAGAACAA GATGAACTTT
TACGGCTATA AAAATCACAT CAAGGTTGAT CAGGGAACCA AGCTCATCAG TACGTATATG
GTGACGGATG CGGCTGTCCA TGACTCTCAG GAACTGGAAA CGCTGATCGA CAAGGAGGAT
GCGGGTCAGA AACTTTACGG TGACGCCGCA TACATCGGGC AGGAAGAAAG CATCGAAGCA
TGCGGCATGC AGAGCGAAAT CCATGAAAAA GCCACCAGAA ACCACAAGCT CACTGCAGAC
CAGAAAGCAA AGAATCGCCA GAAGTCGAAG GTTCGTTCCA GAGTCGAGCA TGTCTTCGGC
TTCATGACCA ACACCCTGAA AGCCATGACT ATTAAAACCA TCGGCTATGT AAGGGCTACC
GCAAAGATCG GATTGGCCAA CCTGACCTAT AACCTCATGC GCTGTGTGCA GCTGAAAAAG
AAAGTCTATG CGGTTTTCCT GGGATAA
 
Protein sequence
MKNINPPGLF DEQFQLERLT QLKDPLVKLE QYIDWKIFAP ILDVVFNKPE NHSNAGRPPF 
DRVMMFKVLI LQSLYSLSDD AMEFQINDRL SFKRFLGLKS SDRVPDSKTI WKFRETLIQE
GIIEALFYRF NQALDDQSIF AKTGQIVDAS FVEVPRQRNS RDENDQIKKG QTPESWKAKP
NKLCQKDRDA RWTKKNKMNF YGYKNHIKVD QGTKLISTYM VTDAAVHDSQ ELETLIDKED
AGQKLYGDAA YIGQEESIEA CGMQSEIHEK ATRNHKLTAD QKAKNRQKSK VRSRVEHVFG
FMTNTLKAMT IKTIGYVRAT AKIGLANLTY NLMRCVQLKK KVYAVFLG