Gene Arth_4362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4362 
Symbol 
ID4443473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp101547 
End bp102791 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content63% 
IMG OID639687683 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_829380 
Protein GI116662326 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000113167 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGTCA TTCACCCGCG CTGCGCGGGT GTCGATATTT CAAAGAAGGA CGCGAAGGTC 
TGTGTCCGCA TCCAGGGCCG CGGCGGCCGG TCCACCACAT CCACCATCAC GACCTGGGGT
TCGATGACGG GGCAGATCCT GGGGCTAAAG GAACACCTCC TGGACGAGCA CGTGGATCTG
GTGGTCATGG AGGCCACCGG TGATTACTGG CGCCCCTTCT ACTACCTGCT TGAAGACGAC
GGACTGAACA TCATCCTGGT CAATGCCCAT GACGCCAGGA ACGTCCCGGG CCGCAAAACC
GACGTTTCCG ACGCTGCCTG GCTTGCCGAC CTGGGCGCCC ACGGGCTGGT CCGGGCCTCC
TTCGTGCCCC CGCCTCCGAT CCGTGAACTG CGGGACCTGA CCCGGGCCCG GACCATCATC
ACTCAGGAAC GGACCCGGGA AATCCAGCGC CTGGAGAAAC TCCTCGAGGA CGCCTGCATC
AAACTATCCT CGGTAGCTTC GAACATCACC GGGGTCTCCG GACGACTGAT CCTCCAAGCA
CTCATCGACG GGCAGACCGA CCCCGCGGCC CTGGCAGAGA TGGCCCAACG CCGGCTGCGG
TCCAAAATCC CTGAACTCAC CCAAGCCCTC AACGGCCGGT TCACCGAACA CCACCGCTAC
ATGACCGGAC TCTATCTGCA CCGGATTGAT GCCCACACCG CCGATATCAA CGACCTCAGC
GCCAGGATCG AGGCGGCGAT GGAGCCCTTT CGTTTCGCCC GGGAGCTCCT CGTGAGCATC
CCGGGGTTCA GCACCACCAT CGCGGAAATC TTCATCGCCG AAACCGGCGC CGACATGAGC
GCATTCGCCA CCGCAGGGCA GCTGGCATCC TGGGCCGGTA CTTCCCCGGG ATCCAACGAA
TCCGCCGGAC GGGTCAAATC CACCAAAACA CGGCCCGGCA ACCGGTACCT CAAAGGTGCC
CTGGGCATCG CTGCTCTGTC CTGCGCGAAA TCGAAGAACA CCTACCTCGG TGCCAGATAC
CGGCGCATTG CCTCCCGGCG CGGACCCGCC AAAGCCCTCG TCGCCGTCGA ACACTCCATC
CTCACCGCAG CGTGGCACAT GCTCACCACC GGCGAACTCT ACAACGACCC CGGCGCCGAC
TACTTCACCC GGCAGACACC CGTTAAAACC ATGGCCCGGG CCGTCAGACA ACTAGAATCC
CTCGGCTATC AAGTCATTCT CGAACCCCTG CAACAGACCG GATAA
 
Protein sequence
MEVIHPRCAG VDISKKDAKV CVRIQGRGGR STTSTITTWG SMTGQILGLK EHLLDEHVDL 
VVMEATGDYW RPFYYLLEDD GLNIILVNAH DARNVPGRKT DVSDAAWLAD LGAHGLVRAS
FVPPPPIREL RDLTRARTII TQERTREIQR LEKLLEDACI KLSSVASNIT GVSGRLILQA
LIDGQTDPAA LAEMAQRRLR SKIPELTQAL NGRFTEHHRY MTGLYLHRID AHTADINDLS
ARIEAAMEPF RFARELLVSI PGFSTTIAEI FIAETGADMS AFATAGQLAS WAGTSPGSNE
SAGRVKSTKT RPGNRYLKGA LGIAALSCAK SKNTYLGARY RRIASRRGPA KALVAVEHSI
LTAAWHMLTT GELYNDPGAD YFTRQTPVKT MARAVRQLES LGYQVILEPL QQTG