Gene Caul_5358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5358 
Symbol 
ID5897139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp70795 
End bp71916 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID641550650 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001672136 
Protein GI167621628 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0901346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCAGG AAATCATCTT CGTAGGATTG GATGTGCACA AGGCTACGAT CGCCGTGGCG 
CTGGCGCCTG ACGGCCGATC GAACGAGGTC AGATTCCAGG GCTCCGTGCC GCATCGCCCG
GGGGCTATGG CGGACCTGGG GCGTCGGTTG CAAAGCAAGC ATCCGGGCGC CCGACTAACA
TTCTGCTACG AGGCAGGGGC ATGCGGGTAC GGCTTGGCGC GCGAGCTCAC CGACGCCGGC
TACGAATGCT TGGTGATCGC GCCTTCGCTT ATCCCGTCGC GGCCCGGAGA CCACATCAAG
ACCGACAAGC GCGACGCCAC CATGCTGGCG AAAATGCATC GCGCTGGGGA ACTGACGACG
GTCTGGGTAC CGGACGCCAG CCACGAGGCG GTCCGTGACC TCGTCCGCGC CCGGGAGGCG
GCCATGTATT GGCGGAACCG GGCGCGCCAA GCGCTCTCCA GCTTCCTGCT GCGGCAGGGC
TACCGCTATG GCGGCCGTAG CGCCTGGACG GCGTCCTACT GGCGCTGGCT GGCGACGCTT
GTACTTGAGC ATCCCGCCCA ACAGATCGCG ATGCAGGAGT ACATCCTGGC GATCCATGAA
GCCGACGCGC GACACGATCG GCTCGTGAAG CAGATCGAAG GGATCATCCC GACTTGGTCG
ATGGCGCCCT TGGTGACCGC GCTACAGGCG ATGCGCGGCA TCGCGATGAT CACCGCCGCC
ACCATCGTTG CCGAGATCGG AGACCTCGGG CGCTTCGCAA CGCCGCGTAA GCTGATGGCC
TACCTCGGCC TGGTCCCGGC CGAGCACTCC AGCGGCGCCA AGATCCGCCG GCGCGGCATC
ACCAAAGCCG GAAATCCCCG GGCGCGGCGT TTGCTTGTCG AAAGCGCATG GACCTACGCC
AAGTCACCTC GAGTCAGTCA GGTGATCCAG CGCCGGCAGG AAGGCGTCGA GCTGGAGATC
GTGAAGATCG CCTGGAAGGC GCAGCTGCGG CTACACGACC GCTATCGCCG ACTGGCGGCG
ACGGGCAAGC CGAAGAACGT GGTGATCACG GCCATCGCTC GGGAGCTGGT TGGCTTCATC
TGGGCCATAA GCCAGCGTAT CAGCCCGCCC GCTACTGCCT GA
 
Protein sequence
MDQEIIFVGL DVHKATIAVA LAPDGRSNEV RFQGSVPHRP GAMADLGRRL QSKHPGARLT 
FCYEAGACGY GLARELTDAG YECLVIAPSL IPSRPGDHIK TDKRDATMLA KMHRAGELTT
VWVPDASHEA VRDLVRAREA AMYWRNRARQ ALSSFLLRQG YRYGGRSAWT ASYWRWLATL
VLEHPAQQIA MQEYILAIHE ADARHDRLVK QIEGIIPTWS MAPLVTALQA MRGIAMITAA
TIVAEIGDLG RFATPRKLMA YLGLVPAEHS SGAKIRRRGI TKAGNPRARR LLVESAWTYA
KSPRVSQVIQ RRQEGVELEI VKIAWKAQLR LHDRYRRLAA TGKPKNVVIT AIARELVGFI
WAISQRISPP ATA