Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5462 |
Symbol | |
ID | 5897142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 176188 |
End bp | 177846 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641550749 |
Product | transposase |
Protein accession | YP_001672235 |
Protein GI | 167621727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGAA CCGGGGCCAG CCACGAACCA CAGAGTGAGG AGCGGCGCTG GGACCTGGCG GTCGCGCGCC AGCGAGAGCT GCGCCCGCTG TTGGACAATC CCAACCGCAC ACGCGCCGAG GTCGAGGCCG CCGCACAGCG CCTGAACGTC CACACCGCGA CAGTCTACCG GCTACTCGCC CGTTATGCGG TCGGCGAGAC CGCGCAGGCG GTCGTGACCG GGGTAGGGGG ATGGAAAGGC GGGCGGCCAC GGCTACACGC CCGCGTCATT GAGCTCATCG ACATGGCCGT CGAACGGCTG TTCCTCGACC GCCAGGCGAT TAGCAAAGCC CAGCTGGGGC GCGAAATCGC CCGGCGCTGC GCGTCGGAGT CCATGGCGAT GCCCTCGCGT TCGGTGATCA GCCGCCGGCT GCAACGGATC GGCGAGCGGG AGCAGGTCCG GCGCCGCAAG GGCCCCGGCG CGGCCGAGGC CGTGACGATG CGACCGGGGA GTTTGACCGT GACCCAGCCG AACGCGGTTT GGCAAATCGA TCATTCCCCG GCCGACGTCA TCCTCGTGGA CGCGGACTCG CGCGCGCCGA TCGGCCGGCC TTGGGTGACG CTGGTCATCG ACGTGGCGTC CAGGGTGGTC ACGGGGCTCT ACGTCTCGTT GGATCCTCCG TCCGTCGTTT CGGTCGGCAT GGCGCTGCAA CACGCCATCC TGCCGAAGGA TGAGGCCCTG GCAGAGCGCG GGATCTCGGC CGAGTGGCCG GCCTTTGGCC TGCCTGAGCT GGTTCACACC GATAACGGTT CGGATTTTCG GAGCCGATCG TTCTCCCACG CGTGCAGCAA TCTCGGCATC GAGACCGATC GCCGGCCCGT CGGCGCGCCC CGCTACGGCG GCCACATCGA GCGCCTGATC GGATCGGTGA TGAGCGAGAT GCACCTGCTG CCGGGCGCCA CCTTCTCCAA CGTGGCCGCC CGCGGCGACT ACGCCAGCGA TGCGGCGGCG ATCATGACGA TCGACGAGTT CGAGGTGTGG CTCCGGCGGT TCGTCGCCGG CGACTACAAT CGCCGGATCC ACGCGAGCCT CTCAGCCCCG CCGCTGGAGA CCTGGCGCCG TCTATGCTCC GAACAGGCTG TCACGCCCCG CCAACCCCTC GACCCCGAAG GTCTGGCGGT CGCCTTTCTC CCGCGGATCT CGCGCACCAT CACCCGTCAG GGCGTCAGCT TCCATCACGT GCGCTACTAC GAGCCCTTTC TGGCGCCGTT GTTCGACAGT GGCCTGCGCC GCATCGAACT CGCCTACGAT CCGCGGGACC TCTCGCGACT GCTCGTCGAG ACGCCGCAGG GCGTGCGAGC CCTTCGCTAT CGCAACCTCA CCCGCCCGCC CATGAGCCTG TGGGAGTTGC GGGCGGCGCG CCGCCGGCTC CGCGCCGAGG GGCGGGCCCA GGTCGACGAG GCGGCCCTGT TTGCGGCCCG CGAAGCGAAC GGAGCCCTGG TGGCCACGGC GGCGCGGGAA AGTCGCCGGA TTCGACGCGA GTCCGTGAGG CGTGAGCGTC ACCGGGCCGA AGTGAAGCCC GCTCCAACCG CCGAGGACGA GATCCCGCCA GAGCCGATCG ATGTGGGCGC GCCGACAAAC CCGACCTCGG GGTTTGTCGT GGAGATCGAG CCATGGTGA
|
Protein sequence | MARTGASHEP QSEERRWDLA VARQRELRPL LDNPNRTRAE VEAAAQRLNV HTATVYRLLA RYAVGETAQA VVTGVGGWKG GRPRLHARVI ELIDMAVERL FLDRQAISKA QLGREIARRC ASESMAMPSR SVISRRLQRI GEREQVRRRK GPGAAEAVTM RPGSLTVTQP NAVWQIDHSP ADVILVDADS RAPIGRPWVT LVIDVASRVV TGLYVSLDPP SVVSVGMALQ HAILPKDEAL AERGISAEWP AFGLPELVHT DNGSDFRSRS FSHACSNLGI ETDRRPVGAP RYGGHIERLI GSVMSEMHLL PGATFSNVAA RGDYASDAAA IMTIDEFEVW LRRFVAGDYN RRIHASLSAP PLETWRRLCS EQAVTPRQPL DPEGLAVAFL PRISRTITRQ GVSFHHVRYY EPFLAPLFDS GLRRIELAYD PRDLSRLLVE TPQGVRALRY RNLTRPPMSL WELRAARRRL RAEGRAQVDE AALFAAREAN GALVATAARE SRRIRRESVR RERHRAEVKP APTAEDEIPP EPIDVGAPTN PTSGFVVEIE PW
|
| |