Gene Arth_4289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4289 
Symbol 
ID4443540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp22253 
End bp25219 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content64% 
IMG OID639687610 
Producttransposase Tn3 family protein 
Protein accessionYP_829307 
Protein GI116662253 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTCTGC GTTCTTTGCT CACGGCTGCC GAGCGCGGCC AAATCCTGGC AATGCCCGCT 
GAAACAGAGG ACCTTGCAGC CCACTACACG CTCAGTGATG CGGATATGTC GTTGATCCGG
CAACGCCGCG GGGACGCGAA CAGGCTGGGC TTTGCGGTCC AGCTGTGTCT GCTGCGCCAC
CCGGGCATCG GGCTGGCCGA CGACACCGAC GTGCCGCCGG AACTCATTGC CTGGCTGGCC
TCCAGCCTGG GCGTTTCCAT TGATGCCTGG GACGAGTATG GAACACGTGA GGAAACGCGG
CAGGAGCATG GACGGGAGAT CCGCGGGTAT CTGGGCATGT CAGCGTTCGG CATCGCGGAC
TACCGCTGGC TCGTGGAGCA TGTGGGTGTT GTGGCTGCGC ACACGGACAA GGGCCTGGTC
CTGGTCGAAA GCGCGAGGGA TTTCCTGCAG GCAAGGAAGG TCGCGTTGCC CGGGATCGGG
GTCATTGAAA AGGCCTGTGC GCAGGCGCTG ACCAGGGCCA ATAGGCGGAT TTACCTCACT
TTGGGTGAGC AGCTGACCGT GGGTCACCGG CAACGCCTCG ACGGGCTGCT GCGCCGCCGC
CGCGATAGTT CTCTGACGGA AATCGGTTGG CTGCGGCAGG CGCCGCTGCG ACCCAACGCG
CGGGCGATGA ATGAGCACAT CGACCGGCTC ACCACCTGGC GTGCGCTGGA GCTGCCCTGG
GCTGCGGGAC GGCTGGTTCA TCGGAACAGG TTGCTGAAGC TTGCCCGGGA GGGTGCCTCG
ATGACCGCGG CGGATCTGGC CAGATTTGAA CCGGCACGCC GGTACGCGAC CCTCTTCGCC
ATGGCCACCG AAAGCATGGC CACCGTCACC GACGAAATCA TCGATCTGCA CGACCGGATC
ATCGGCCGGC TCATCCGGAC CGCACAGAAC AAGCAAAACC AGGCCACCCT GGCATCCCGC
TCCACCGTCG CCGCCATGAT GCGCATCCAT TCCAGGCTCG GTGATGCCCT CTTTGAGGCC
AAGGAAAACG GCGAAGATCC CTTCGCCGCC ATCGAAACGG CCATCGGCTG GGAATCCCTG
GCCGAAAGCA TCGCCCACGC CAAGGAACTG ACCCGCCCGG CCCTCGAGGA CCCCCTCGCC
CTCGTCAGCG CCCACTTCAC CACCCTGCGC CGCTACACCC CGGCATTTCT TGCCGTCCTT
GACCTCAACG CGGCCCCGGC GGCACAGGAC CTGCTGGCAG CAATCAACCT CGTCCGCACC
CTGAACACCG CCGGAGCCCG AAAAATCCCC GACGATGCGC CCACCTCGTT CGTCCGGCCC
CGGTGGAAGC CGCTGGTCTT CACGGAAAAC GGCATAGACC GGGGCTTCTA CGAGTTCTGT
GCCCTCGCGG AACTAAAGAA TGCGCTTCGT TCCGGAGACT TGTGGGTCAC CGGATCCCGT
CAATTCCGTG ACTTCGATGA CTACCTTCTT GCCGGTCCTG ACTACACGGT TATGAAAACC
ACCGGGAAGC TGCCTCTGGT CACGACCGAC GGCGGCGAAA GCTATCTCCA GAACCGGCTG
GCCCTGCTCA ACGAACGGCT GCACCATGTC AACGACCTCG CCTCCCGCGA TGAACTGCCC
GGGGTGATGG TCACGGACAA GGGCGTGAAA ATCACCCCGC TGGAGACAAT CGTGCCAAAA
CACGCGCAGC CACTGATCGA TCAGGCAAGC GCAATGTTTC CGCGGATCCG GATCACCGAT
CTGTTGATGG AAGTTGATGG CTGGACCGGG TTCACCCGCC ATTTCACCAG CCTGAAATCC
GGCCAGCCCT CCAAAGACAA GCAACTTCTT CTCACCGCCA TCCTCGCGGA CGGAATCAAC
CTGGGCCTGA CGAAGATGGC CGAGTCATGC ACCGGCGTCA GCTACGCCCA GCTGGACCGC
CACCAGGCCT CCTACATCCG GGACGAAACC TACAGCGCCG CTCTGGCGGA ACTGGTGAAC
ACCCAGCACG GACACCCCTT CGCCGCACAG TGGGGCGACG GGACCACCTC CTCATCGGAC
GGGCAGCGGT TCCGTGCCAG CAGCAAAGCC GAATCCACCG GGCATGTGAA CCCCAAGTAC
GGTGCCGAGC CCGGCCGGCT GATCTACACA CACATCTCGG ACCAGTACTC GCCCTTCCAC
AGCAAGCTCG TCAACGTCGG CGACCGCGAC GCGACCTACG TCCTGGACGG GCTGCTCTAC
CACGAGTCCG ACCTGGCGAT CCAGGAGCAC TACACGGATA CGGCCGGATT CACCGATCAC
CTCTTCGCTC TTATGCACCT GCTCGGGTAC CGGTTCGCCC CACGGATCCG CAACATCGGC
GACACCCGTC TCTACACACC TACCACCGAT CCGGGACTTG CCACGTTGGC GCCGCTGATC
GGCGGGACCA TCAACACGAA AATGATTGCC CTGCATTGGG ATGAAATCCT CCGCCTCGCC
GCGTCCATCA AGACCGGCAC CGTGACCGCG TCCCTGATGA TGCGAAAACT CGGCGCCTAC
CCGCGCCAGA ACGGGCTCGC ACTCGCGCTG CGGGAGCTGG GCAGACTGGA GCGGACCCTC
TTCCTGCTGG ACTGGCTCCA GAACCCCGGC CTGCGCCGCA AAGTCACGGC CGGCCTGAAC
AAGGGCGAGG CCCGGAACAC CCTCGCCCGG GCCGTCTTCT TCAACCGCCT CGGCGAAATC
CGCGACCGCT CCTTCGAACA GCAACGCTAC CGCGCCAGCG GACTGAACCT TCTCACCGCG
GCCATTATTC TCTGGAACAC CGTCTACCTC GACCGCACCA TCACCACCCT CAATAAGGAC
GGGAACGCCA CGGACCCTGA CCTGCTGCGG TTCCTCTCAC CCCTGGGCTG GGAACACATC
AACCTCACCG GCGACTACAC CTGGCCCCGC GCCAACCAGA TCAAACCCGG CAAATACAGG
CCACTACGCC GCCCGGCAAA ACCTTAA
 
Protein sequence
MALRSLLTAA ERGQILAMPA ETEDLAAHYT LSDADMSLIR QRRGDANRLG FAVQLCLLRH 
PGIGLADDTD VPPELIAWLA SSLGVSIDAW DEYGTREETR QEHGREIRGY LGMSAFGIAD
YRWLVEHVGV VAAHTDKGLV LVESARDFLQ ARKVALPGIG VIEKACAQAL TRANRRIYLT
LGEQLTVGHR QRLDGLLRRR RDSSLTEIGW LRQAPLRPNA RAMNEHIDRL TTWRALELPW
AAGRLVHRNR LLKLAREGAS MTAADLARFE PARRYATLFA MATESMATVT DEIIDLHDRI
IGRLIRTAQN KQNQATLASR STVAAMMRIH SRLGDALFEA KENGEDPFAA IETAIGWESL
AESIAHAKEL TRPALEDPLA LVSAHFTTLR RYTPAFLAVL DLNAAPAAQD LLAAINLVRT
LNTAGARKIP DDAPTSFVRP RWKPLVFTEN GIDRGFYEFC ALAELKNALR SGDLWVTGSR
QFRDFDDYLL AGPDYTVMKT TGKLPLVTTD GGESYLQNRL ALLNERLHHV NDLASRDELP
GVMVTDKGVK ITPLETIVPK HAQPLIDQAS AMFPRIRITD LLMEVDGWTG FTRHFTSLKS
GQPSKDKQLL LTAILADGIN LGLTKMAESC TGVSYAQLDR HQASYIRDET YSAALAELVN
TQHGHPFAAQ WGDGTTSSSD GQRFRASSKA ESTGHVNPKY GAEPGRLIYT HISDQYSPFH
SKLVNVGDRD ATYVLDGLLY HESDLAIQEH YTDTAGFTDH LFALMHLLGY RFAPRIRNIG
DTRLYTPTTD PGLATLAPLI GGTINTKMIA LHWDEILRLA ASIKTGTVTA SLMMRKLGAY
PRQNGLALAL RELGRLERTL FLLDWLQNPG LRRKVTAGLN KGEARNTLAR AVFFNRLGEI
RDRSFEQQRY RASGLNLLTA AIILWNTVYL DRTITTLNKD GNATDPDLLR FLSPLGWEHI
NLTGDYTWPR ANQIKPGKYR PLRRPAKP