Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3494 |
Symbol | |
ID | 8334847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3892342 |
End bp | 3893670 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956638 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_003114241 |
Protein GI | 256392677 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0376392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.207887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATC ATGCCGAGGT GGTGGAGCTC GCCGTCGAGG TCGAGCAGGT GGGGCGGGTC GCGGCGATCG ATGTGGCCAA GGCCTCCGGG ATGGTGTGCA CCCGGCTGCC GTCGGAGACG AAGGCGCAGC GCCGGGTGCA GAGGGTTTGG GCGGTGGCGG CGACCACCGA CGAGATCACA GCTCTGGGTG ATCATCTGGT GTGTCAAGGT GTGGAGTTGG TGGTGATGGA GGCCACCGGG GTGTTTTGGA GACCGTGGTT CTATCTGCTG GAGGATCGGG GCCTGCGGGT GTGGCTGGTG AACGCCCGGG ATGTGAAGAA CGTGCCCAGC CGTCCGAAAA CCGACAAGCT GGATGCGATC TGGCTGGCGA AGCTGGCCGA GCGGGGGATG TTGCGGGCCT CGTTCGTGCC GCCGGAGCCG GTGCGCAGGC TGCGGGATCT GACCAGGCTG CGCCGCACCC TGGTGGAGGA GCGCACCCGG TATCGGCAGC GGGTCGCCGA TGTGTTGCAG GACGCGTGTT TGAAGGTCGC CGACCCTAAA CAGGGCCTGA CCGACCTGTT CGGCATGTCC GGGCGGGCGA TCCTGGCCGC GCTGGTCGCC GGGCAGCGCG ACCCAAAGGC CTTGGCCGCC TTGGCGATGG GGCGGGCTGT GGTGAAGACC GCATACTTGG AGAAGGCCCT GGCCGGCCGG TTCACCGCCC ACCACGGCTT CCTCGTGGGC AAGCTGCTCG ATCTGCACGA CAGGCTGGAG AACGACATCG CCGAGCTGAA CGCCCGGATC GAAGCCATGA TCGCCGAACT GGACCGGACT CCGCCACCGG ATGACAACCA CCCAGACCGG CTACCCCTGT TGGACCGACT CGACGAGATC CCCGGAGTCT CCCGGGAGAT CGCCGCCGAC ATCCTCGGCG AGACCGGGTT CGACATGACC GTCTTCCCCA CCGGCGGGCA CCTGGCCTCC TGGGCCAAAC TCACCCCACG CACCATCCAA TCCGGCGCCA GAAACAGCCA CGGCGGCACC GGTAAAGGCA ACCGCTGGAT CAAAGGACCG CTCGGACAGG CCGCGCAGGC CGCCGGCCGC ACCAAAACCT TCCTCGGGGC ACGCTACAAA CGCATCGTCA AACACGCCCC GGCGAAGAAG GCCCAGGTCG CCGTAGCCCG CAACATCCTG GAGATCGCCT GGGTGCTCAT CAACGACCCC GACGCCCGCT TCACCGACCT CGGCCCCGAC TGGCACACCC GGCGAACCGA TGAAACCCGC AAAACCCGAC AGCACATCCG CGAACTCGAA CACCTCGGCT ACACCGTCAC CCTGACCAAA GCAGCCTGA
|
Protein sequence | MDDHAEVVEL AVEVEQVGRV AAIDVAKASG MVCTRLPSET KAQRRVQRVW AVAATTDEIT ALGDHLVCQG VELVVMEATG VFWRPWFYLL EDRGLRVWLV NARDVKNVPS RPKTDKLDAI WLAKLAERGM LRASFVPPEP VRRLRDLTRL RRTLVEERTR YRQRVADVLQ DACLKVADPK QGLTDLFGMS GRAILAALVA GQRDPKALAA LAMGRAVVKT AYLEKALAGR FTAHHGFLVG KLLDLHDRLE NDIAELNARI EAMIAELDRT PPPDDNHPDR LPLLDRLDEI PGVSREIAAD ILGETGFDMT VFPTGGHLAS WAKLTPRTIQ SGARNSHGGT GKGNRWIKGP LGQAAQAAGR TKTFLGARYK RIVKHAPAKK AQVAVARNIL EIAWVLINDP DARFTDLGPD WHTRRTDETR KTRQHIRELE HLGYTVTLTK AA
|
| |