Gene Arth_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0738 
Symbol 
ID4446772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp786372 
End bp788168 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content64% 
IMG OID639688540 
Producttrehalose synthase 
Protein accessionYP_830236 
Protein GI116669303 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.197847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTTTTA GTCCGCAAAA CCCGAGCCAG CATTTCACCC CCAAGGGCAC GTTTGAGCTC 
AACGCCCCCG GCCTGCAACA CGATCCCGTC TGGTACCGCA AAGCAGTGTT CTACGAAGTC
CTGGTTCGCG GCTTTGCGGA CGGAAACGGC GACGGATCCG GCGACTTCCA CGGGCTCATC
GACAAACTGG ACTACCTGCA GTGGCTGGGC GTCGACTGCC TCTGGCTGCC GCCGTTCTTC
CAGTCGCCGC TCCGTGACGG CGGCTACGAC ATCTCGGACT ACAACTCCGT CCTGGACGAG
TTCGGCACCA TCAGCGACTT CAAGCGGCTC GTGGCCGAGG CCCATGCCCG CGGCGTCCGC
GTAATCATCG ACCTGCCGCT GAACCACACC TCGGACCAGC ACCCGTGGTT CCAGGAGTCC
CGCAAGGACC CGGACGGACC GTTCGGCGAC TTCTACGTCT GGAGCGATAC CGACGAGAAG
TACCAGGATG CCCGCATCAT CTTCGTGGAC ACCGAGGAAT CCAACTGGAC CTTCGACCCC
ATCCGGCGGC AGTTCTTCTG GCACCGGTTC TTCAGCCACC AGCCCGACCT GAACTTTGAG
AACCCGAAGG TTATCGACGC CCTTTTCGAC GTCGTCCGGT TCTGGCTGGA CCAGGGGATC
GACGGTTTTC GCGCGGACGC CATCCCGTAC CTGTACGAGG AAGAGGGCAC CAACTGCGAG
AACCTCCCCG CCACGCACGA TTTCCTGCGC AAGCTGCGGA AAATGGTGGA TGAGAACTAC
CCGGGCAGGG TCATCATTGC CGAGGCCAAC CAGCCTCCGG TCGAGGTGGT GGAGTACTTC
GGCACCGAGG AAGAACCCGA GTGCCACATG GCCTTCCACT TCCCCATCAT GCCGCGCCTC
TACTACGCGC TGAGGGACCA GAAAGCGGCG CCCATCATCG AGACGATGAA GGACACCCCG
GACATTCCCG AAGGCGCCCA GTGGGGTACC TTCCTGCGGA ACCACGACGA ACTCACCCTG
GAAATGGTCA CCGCCGACGA ACGCGCGGCC ATGCTCGGCT GGTACGCACC GGACCCCCGC
ATGCGCGCCA ACATCGGCAT CCGCCGAAGG CTGGCGCCGC TGCTGGATAA TTCCAGGTCC
GAGATTGAGC TCATCAATGC CCTGCTGCTT TCTCTGCCGG GGAGCCCGTT CCTGTACTAC
GGGGATGAAA TCGGCATGGG CGACAACATC TGGCTTGAGG ACCGCGACGC CGTCCGCACT
CCCATGCAGT GGAACCCTGA CCGGAACGCG GGCTTCTCGC ACGCGGATCC GGGCAAGCTC
TACCTGCCGG TCATCCAGTC GCTGGTGTAC AACTACGGCA TGGCCAATGT GGAGGCCGAG
GCCGCGCACT CCGGATCGCT GCTCCGTTGG ACCCGGCAGA TCCTCAGCGT CCGTAAGAAC
CACCCCGTCT TCGGGCTTGG CACGTTCAAG CACGTCGAGG CAGATCACGA CGTCGTGCTT
GCTTACCTGC GGGAACTGGC CCCGGACAAT GCGGCGGGTG AAGACGCCGA ATCGATCCTG
TGCGCGTTCA ACCTCTCGCA GCATCCCGTG GCCACCACGC TGCGGATTCC GCAGTACGCC
GGCCGCGGCC TCCGGGACGT CTTCGGCGGC CAGCCGTTCC CCGCCATCGG CGACGACGGG
CGTCTTACGC TGACGCTCGG CAGCCATGAT TTCTTCTGGC TGCGGATCCG TTCCGCTGCT
TCCAACCCGG CCTCGCCGTT CACCCAGGCC ATGCCGGTCC TGTCCATCGA AGGCTGA
 
Protein sequence
MSFSPQNPSQ HFTPKGTFEL NAPGLQHDPV WYRKAVFYEV LVRGFADGNG DGSGDFHGLI 
DKLDYLQWLG VDCLWLPPFF QSPLRDGGYD ISDYNSVLDE FGTISDFKRL VAEAHARGVR
VIIDLPLNHT SDQHPWFQES RKDPDGPFGD FYVWSDTDEK YQDARIIFVD TEESNWTFDP
IRRQFFWHRF FSHQPDLNFE NPKVIDALFD VVRFWLDQGI DGFRADAIPY LYEEEGTNCE
NLPATHDFLR KLRKMVDENY PGRVIIAEAN QPPVEVVEYF GTEEEPECHM AFHFPIMPRL
YYALRDQKAA PIIETMKDTP DIPEGAQWGT FLRNHDELTL EMVTADERAA MLGWYAPDPR
MRANIGIRRR LAPLLDNSRS EIELINALLL SLPGSPFLYY GDEIGMGDNI WLEDRDAVRT
PMQWNPDRNA GFSHADPGKL YLPVIQSLVY NYGMANVEAE AAHSGSLLRW TRQILSVRKN
HPVFGLGTFK HVEADHDVVL AYLRELAPDN AAGEDAESIL CAFNLSQHPV ATTLRIPQYA
GRGLRDVFGG QPFPAIGDDG RLTLTLGSHD FFWLRIRSAA SNPASPFTQA MPVLSIEG