Gene Arth_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1766 
Symbol 
ID4445700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1977360 
End bp1978424 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID639689585 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_831257 
Protein GI116670324 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0961431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ACGCGCAGAC GACGCAAGCA GTGATCGTCA TCGGAGGCAT CGATGCGCAC 
GCCGACACTC ATCATGTCGT CGCGCTCGAT ACGACCGGGA AGATACTCGG CGACCACCCT
TTTCCCGCTT CATCACGCGG ATATCGTGAC GCGCTGGACT GGTTGGCAAA GTTCGGGTTG
ATTGACAAGA TCGGAGTCGA ATCCACCGGT TCGTATGCGG CCGGCATCAC ACGGTTCCTC
CTCGAATCAG GTGTCGATGT CGTGGAAGTC AACCAGCCAC ACCCGCACCT GAGGGCGCGC
CGCGGCAAAG ACGATTCGAT CGACGCTGAA GCAGCAGCGC GCAAAGCGCT CTCGGGGCAG
GCCACCGCGA TCCCGAAGGT CACCACGGGT GTTGTCGAGT CTTTCCGTGT GCTGCGCTTG
GCCCGGGAAT CCGCCGTTCG TTCCCGCACG AGAACGATCG TGCAACTGCG CAGTCTTCTA
GTCACAGCAC CTGCGCGGCT GCGGGAGCAG CTCACGGAAC GGTCCGCAGC CGTGCTCGTG
GCACGATGCG CGGGCTTGCG GCCTGATCTG GATCGTCTTG ATGACCCCCT TCAAGCCACC
AAGCGTGCAC TGCGCGCCAT GGCCCGGAGG ATCCAGATGC TCGATGAGGA GATCAACGAG
ACCGACGCCT CACTCAAACA GCTCGTCGAG CGCACCGCGC CGACTCTGAC GTCCAAGCTC
GCGATCGGGC CAGGGCACGC CGCGCAGCTG TTGATCACCG CCGGGCAGAA CATTGAGCGG
CTCCACTCCG AGGCCGCATT CGCCAGACTC TGCGGCGTCG CACCGATCCC GGTCTCCTCC
GGCAAGACGC ATCGCATGCG CCTGCACCGA GGCGGTGATC GTCAAGCCAA CGCCGCGCTC
CACATGATCG CGGTCTGCCG GATGCGCTAC CACCAGCCCA CCATCGACTA CGTCAAGCGA
CGCCTCTCTG AAGGACTGTC GAAGAAGGAC GTGCTCCGAT GCCTCAAACG ATTCATTGCC
CGGGAGGTCT ACCACGACCT GAAAACCGAC CTTGGACTCA CTTGA
 
Protein sequence
MSNNAQTTQA VIVIGGIDAH ADTHHVVALD TTGKILGDHP FPASSRGYRD ALDWLAKFGL 
IDKIGVESTG SYAAGITRFL LESGVDVVEV NQPHPHLRAR RGKDDSIDAE AAARKALSGQ
ATAIPKVTTG VVESFRVLRL ARESAVRSRT RTIVQLRSLL VTAPARLREQ LTERSAAVLV
ARCAGLRPDL DRLDDPLQAT KRALRAMARR IQMLDEEINE TDASLKQLVE RTAPTLTSKL
AIGPGHAAQL LITAGQNIER LHSEAAFARL CGVAPIPVSS GKTHRMRLHR GGDRQANAAL
HMIAVCRMRY HQPTIDYVKR RLSEGLSKKD VLRCLKRFIA REVYHDLKTD LGLT