Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5388 |
Symbol | |
ID | 5897208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | + |
Start bp | 98285 |
End bp | 99973 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641550678 |
Product | transposase IS66 |
Protein accession | YP_001672164 |
Protein GI | 167621656 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCTG ACCTCGCAGC CCTGCCGGAC GATATCGAAG CGCTGAAGGC GGCGCTTCTG GTCGCCCGGG CCGAGGTCGC GCAAGCGCAA GACGTGGCCG CGAGAGCTCA GGCCGAAGCC TCCAGAGCCC AGGCCGAAGC CGCTGAAGCC AAGGCGCGCG TGTCTGACGA CCAAGCGCTG ATCGCCCACC TGAAGCTCCA GATCCAGAAG CTCAATCGTG AGCGCTTCGG CCCTAGCTCG GAACGCACGG CCCGTCTGCT TGATCAGCTG GAACTGCAGT TGGAGGAGCT GGAGGCTTCG GCGACGGAAG ACGAGCTGGC CGCCGAGATG GCGGCGGCTC GGACCACGAC GGTGGCCGCC TTCAGCCGCA AGCGGCCTTC GCGCCAGCCC TTCCCGGAAC ACCTGCCGCG TGAGCGGGTG ATCGTGCCAG GTCCGACCGC CTGCGCCTGC TGTGGCGGGC TGCGCCTCTC GAAGCTGGGC GAAGACGTTA CCGAAACGCT GGAGGTCGTG CCCCGGTCCT GGAAGGTCAT CGCGCACGTC CGCGAGAAGT TTAGCTGCCG CGACTGTGAG GCCATCGGCC AGGCGCCGGC TCCGTTCCAT GTGATCGCCA GGGGCTGGGC GGGTCCCAGC CTGCTGGCCA TGATCCTGTT CGAGAAGTTT GGTCAGCATC AGCCGCTCAA TCGCCAGGCC GACCGCTATG CTCGCGAGGG CGTGCCGCTC AGTCTGTCGA CCTTGGCCGA TCAGGTCGGG GCCTGCACGG CGGTGCTGGC GCCGCTGTTC CAGCGGCTGG AGGCTCACGT GCTTGCCGCC GAACGATTGC ACGGCGACGA CACCACGGTT CCGGTATTGG CCAAGGGCAA GACCGACACC GCCAGGCTCT GGGTCTATGT GCGCGACGAC AAGCCGTTCG CGGGATCGGC GCCGCCGGGC GCGGTCTTCT ACTACTCGCG TGATCGGGGT GGCGAGCATC CGCAAGCGCA CTTGTCAGGT TATGCCGGCC TGTTCCAGGC CGACGCCTAT GGCGGTTACG GCAAGCTCTA TGAGCCAGGG CGAAACCCAG GTCCCATTCT TGAAGCAGCC TGCTGGGCAC ACGCGCGTCG GCCGTTCTTC GTGCTGGCCG ACCTGGAGCA GAATGCGCGC CGCAAGGCTC GCGGCGCGGC GCCGGCGGTG ATCTCGCCGA TCGCCCTGGA GATGGTCCAG CGGATCGACG CGCTGTTCGA GATCGAGCGG GGGATCAGCG GCCAGGACGC AGATAGGCGC CTAGCGGTGC GACAGGCGCT CAGCGCCCCG CTGGTCGCCG AGATGGAGAT CTGGATGCGC GAGCAGCGCG CCAAGCTCTC ACGCGGTCAT GACTTGGCCC GGGCCTTCGA CTACATGCTC AAGCGCTGGG CCGCGTTCAC GCGCTTCCTC GACGACGGCC GCGTCTGTCT GAGCAACAAT GCCGCCGAGC GGGCGCTGCG CGGCGTGGCC ATGGGGCGTA AGTCCTGGCT GTTCTGTGGT TCTGATCGCG GCGGTCAACG CGCGGCGGTG ATGTACAGCC TGATCGTCAC CGCCAAGCTG AACGACATCG ACCCTCAAGC CTGGCTGGCC GACGTCCTGG CCCGCATCGC CGAGCATCCC AGCCAGCAGC TCGATGAACT ACTGCCCTGG AACTGGCAGC CCCTCGCTAC CGCTGACCGC GCCGCTTAG
|
Protein sequence | MDADLAALPD DIEALKAALL VARAEVAQAQ DVAARAQAEA SRAQAEAAEA KARVSDDQAL IAHLKLQIQK LNRERFGPSS ERTARLLDQL ELQLEELEAS ATEDELAAEM AAARTTTVAA FSRKRPSRQP FPEHLPRERV IVPGPTACAC CGGLRLSKLG EDVTETLEVV PRSWKVIAHV REKFSCRDCE AIGQAPAPFH VIARGWAGPS LLAMILFEKF GQHQPLNRQA DRYAREGVPL SLSTLADQVG ACTAVLAPLF QRLEAHVLAA ERLHGDDTTV PVLAKGKTDT ARLWVYVRDD KPFAGSAPPG AVFYYSRDRG GEHPQAHLSG YAGLFQADAY GGYGKLYEPG RNPGPILEAA CWAHARRPFF VLADLEQNAR RKARGAAPAV ISPIALEMVQ RIDALFEIER GISGQDADRR LAVRQALSAP LVAEMEIWMR EQRAKLSRGH DLARAFDYML KRWAAFTRFL DDGRVCLSNN AAERALRGVA MGRKSWLFCG SDRGGQRAAV MYSLIVTAKL NDIDPQAWLA DVLARIAEHP SQQLDELLPW NWQPLATADR AA
|
| |