Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4362 |
Symbol | |
ID | 4443473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | + |
Start bp | 101547 |
End bp | 102791 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639687683 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_829380 |
Protein GI | 116662326 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000113167 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGTCA TTCACCCGCG CTGCGCGGGT GTCGATATTT CAAAGAAGGA CGCGAAGGTC TGTGTCCGCA TCCAGGGCCG CGGCGGCCGG TCCACCACAT CCACCATCAC GACCTGGGGT TCGATGACGG GGCAGATCCT GGGGCTAAAG GAACACCTCC TGGACGAGCA CGTGGATCTG GTGGTCATGG AGGCCACCGG TGATTACTGG CGCCCCTTCT ACTACCTGCT TGAAGACGAC GGACTGAACA TCATCCTGGT CAATGCCCAT GACGCCAGGA ACGTCCCGGG CCGCAAAACC GACGTTTCCG ACGCTGCCTG GCTTGCCGAC CTGGGCGCCC ACGGGCTGGT CCGGGCCTCC TTCGTGCCCC CGCCTCCGAT CCGTGAACTG CGGGACCTGA CCCGGGCCCG GACCATCATC ACTCAGGAAC GGACCCGGGA AATCCAGCGC CTGGAGAAAC TCCTCGAGGA CGCCTGCATC AAACTATCCT CGGTAGCTTC GAACATCACC GGGGTCTCCG GACGACTGAT CCTCCAAGCA CTCATCGACG GGCAGACCGA CCCCGCGGCC CTGGCAGAGA TGGCCCAACG CCGGCTGCGG TCCAAAATCC CTGAACTCAC CCAAGCCCTC AACGGCCGGT TCACCGAACA CCACCGCTAC ATGACCGGAC TCTATCTGCA CCGGATTGAT GCCCACACCG CCGATATCAA CGACCTCAGC GCCAGGATCG AGGCGGCGAT GGAGCCCTTT CGTTTCGCCC GGGAGCTCCT CGTGAGCATC CCGGGGTTCA GCACCACCAT CGCGGAAATC TTCATCGCCG AAACCGGCGC CGACATGAGC GCATTCGCCA CCGCAGGGCA GCTGGCATCC TGGGCCGGTA CTTCCCCGGG ATCCAACGAA TCCGCCGGAC GGGTCAAATC CACCAAAACA CGGCCCGGCA ACCGGTACCT CAAAGGTGCC CTGGGCATCG CTGCTCTGTC CTGCGCGAAA TCGAAGAACA CCTACCTCGG TGCCAGATAC CGGCGCATTG CCTCCCGGCG CGGACCCGCC AAAGCCCTCG TCGCCGTCGA ACACTCCATC CTCACCGCAG CGTGGCACAT GCTCACCACC GGCGAACTCT ACAACGACCC CGGCGCCGAC TACTTCACCC GGCAGACACC CGTTAAAACC ATGGCCCGGG CCGTCAGACA ACTAGAATCC CTCGGCTATC AAGTCATTCT CGAACCCCTG CAACAGACCG GATAA
|
Protein sequence | MEVIHPRCAG VDISKKDAKV CVRIQGRGGR STTSTITTWG SMTGQILGLK EHLLDEHVDL VVMEATGDYW RPFYYLLEDD GLNIILVNAH DARNVPGRKT DVSDAAWLAD LGAHGLVRAS FVPPPPIREL RDLTRARTII TQERTREIQR LEKLLEDACI KLSSVASNIT GVSGRLILQA LIDGQTDPAA LAEMAQRRLR SKIPELTQAL NGRFTEHHRY MTGLYLHRID AHTADINDLS ARIEAAMEPF RFARELLVSI PGFSTTIAEI FIAETGADMS AFATAGQLAS WAGTSPGSNE SAGRVKSTKT RPGNRYLKGA LGIAALSCAK SKNTYLGARY RRIASRRGPA KALVAVEHSI LTAAWHMLTT GELYNDPGAD YFTRQTPVKT MARAVRQLES LGYQVILEPL QQTG
|
| |