Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5358 |
Symbol | |
ID | 5897139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | + |
Start bp | 70795 |
End bp | 71916 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641550650 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001672136 |
Protein GI | 167621628 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0901346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCAGG AAATCATCTT CGTAGGATTG GATGTGCACA AGGCTACGAT CGCCGTGGCG CTGGCGCCTG ACGGCCGATC GAACGAGGTC AGATTCCAGG GCTCCGTGCC GCATCGCCCG GGGGCTATGG CGGACCTGGG GCGTCGGTTG CAAAGCAAGC ATCCGGGCGC CCGACTAACA TTCTGCTACG AGGCAGGGGC ATGCGGGTAC GGCTTGGCGC GCGAGCTCAC CGACGCCGGC TACGAATGCT TGGTGATCGC GCCTTCGCTT ATCCCGTCGC GGCCCGGAGA CCACATCAAG ACCGACAAGC GCGACGCCAC CATGCTGGCG AAAATGCATC GCGCTGGGGA ACTGACGACG GTCTGGGTAC CGGACGCCAG CCACGAGGCG GTCCGTGACC TCGTCCGCGC CCGGGAGGCG GCCATGTATT GGCGGAACCG GGCGCGCCAA GCGCTCTCCA GCTTCCTGCT GCGGCAGGGC TACCGCTATG GCGGCCGTAG CGCCTGGACG GCGTCCTACT GGCGCTGGCT GGCGACGCTT GTACTTGAGC ATCCCGCCCA ACAGATCGCG ATGCAGGAGT ACATCCTGGC GATCCATGAA GCCGACGCGC GACACGATCG GCTCGTGAAG CAGATCGAAG GGATCATCCC GACTTGGTCG ATGGCGCCCT TGGTGACCGC GCTACAGGCG ATGCGCGGCA TCGCGATGAT CACCGCCGCC ACCATCGTTG CCGAGATCGG AGACCTCGGG CGCTTCGCAA CGCCGCGTAA GCTGATGGCC TACCTCGGCC TGGTCCCGGC CGAGCACTCC AGCGGCGCCA AGATCCGCCG GCGCGGCATC ACCAAAGCCG GAAATCCCCG GGCGCGGCGT TTGCTTGTCG AAAGCGCATG GACCTACGCC AAGTCACCTC GAGTCAGTCA GGTGATCCAG CGCCGGCAGG AAGGCGTCGA GCTGGAGATC GTGAAGATCG CCTGGAAGGC GCAGCTGCGG CTACACGACC GCTATCGCCG ACTGGCGGCG ACGGGCAAGC CGAAGAACGT GGTGATCACG GCCATCGCTC GGGAGCTGGT TGGCTTCATC TGGGCCATAA GCCAGCGTAT CAGCCCGCCC GCTACTGCCT GA
|
Protein sequence | MDQEIIFVGL DVHKATIAVA LAPDGRSNEV RFQGSVPHRP GAMADLGRRL QSKHPGARLT FCYEAGACGY GLARELTDAG YECLVIAPSL IPSRPGDHIK TDKRDATMLA KMHRAGELTT VWVPDASHEA VRDLVRAREA AMYWRNRARQ ALSSFLLRQG YRYGGRSAWT ASYWRWLATL VLEHPAQQIA MQEYILAIHE ADARHDRLVK QIEGIIPTWS MAPLVTALQA MRGIAMITAA TIVAEIGDLG RFATPRKLMA YLGLVPAEHS SGAKIRRRGI TKAGNPRARR LLVESAWTYA KSPRVSQVIQ RRQEGVELEI VKIAWKAQLR LHDRYRRLAA TGKPKNVVIT AIARELVGFI WAISQRISPP ATA
|
| |