Gene Arth_2907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2907 
Symbol 
ID4444429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3274031 
End bp3275785 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content68% 
IMG OID639690730 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_832386 
Protein GI116671453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.172907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCACC CCGCAAGCGG AAGCGGACGC TTCGACATCT GGGCCCCGGA GGTCACTGCC 
ATCACGTTGT TGGCCGACGG CTCTGAATAC CCCATGAGCC AGCGCGGCGA CGGCTGGTGG
ACGGCCTCGG ACGCCCCGGC CGGGGGAGAG GTGGACTACG GCTACCTGCC GGGGACGGAC
ACCACCCCCT TGCCGGATCC CCGGTCCCGC CGCCAGCCGG CCGGCGTGCA CTCTCTGTCC
CGCACCTTCG ACCCCGCCGC CCACGCGTGG GCCGACGGGA ACTGGGCGGG CCGGGAGCTG
CAGGGTGCCG TCATCTACGA ACTGCACATT GGCACGTTCA CGCCGGAGGG CACCCTGGAA
GCCGCAGCCG GAAAGCTGGG CTACCTCAAG GACCTCGGCG TCGACTTCGT GGAACTCCTG
CCCGTCAACG GCTTCAACGG CACCCACAAC TGGGGCTACG ACGGGGTCCT CTGGTACACC
GTCCATGAGG GCTACGGCGG CCCTGCCGCT TACCAGCGTT TCGTGGACGC CGCCCACGGC
GCAGGCCTGG GCGTCATCCA GGACGTCGTG TACAACCACC TCGGTCCCAG CGGAAACTAT
CTGCCGAAGT TCGGGCCCTA CTTGAAGTCC GGCGAAGGGA ACACCTGGGG CGACTCGGTG
AACCTGGACG GCAACGGATC AGACGAGGTC CGCCGCTACA TCCTGGACAA CGCAGCCATG
TGGCTCAGGG ACTACCACGT CGACGGGCTG CGGATCGACG CCGTGCACGC CTTCAAGGAC
GAGCGGGCGG TCCACCTCCT GGAGGAGTTC GGTGCCCTGG GCGACACTGT GGCCGCGGAA
ACCGGCCGCC CGATCACCAT GATCGCGGAG TCGGACCTCA ACAACCCCCG CCTGCTGTAC
CCCCGCGACG TCAACGGGTA CGGACTGGAG GGCCAGTGGA GCGACGACTT CCACCACGCC
GTCCACGTGA ACATCAGCGG CGAGACGGAG GGGTACTACA GCGACTTCGA TTCGCTGGGC
GCCCTGGCCA AGGTGCTGCG CGACGGGTTC TTCCACGACG GCAGCTACTC CAGCTTCCGC
GGCCGGCACC ACGGGCGGCC CATTAACACC GGGCTGGTGC ACCCCGCAGC CCTGGTGGTG
TGCAGCCAGA ACCACGACCA GATCGGCAAC CGCGCCACCG GCGACAGGCT TTCCCAGTCG
CTGTCCTACG GCCGGTTGGC CGTGGCGGCC GTCCTCACGC TGACGTCCCC GTTCACGCCC
ATGCTCTTTA TGGGGGAGGA ATACGGCGCC ACCACGCCGT GGCAGTTCTT CACCTCCCAC
CCCGAGCCGG AGCTGGGCAA GGCGACGGCG GAAGGCCGTA TCAAGGAGTT CGAACGCATG
GGGTGGGATC CCGCCGTCGT ACCTGATCCC CAGGATCCGG AGACCTTCAA CCGTTCGAAA
CTGAACTGGG CCGAGGCCAC CGAGGGTGAC CATGCCCGCC TCCTGGACCT CTACCGGACC
CTGACGGCGC TCCGCCGTTC CACCCCGGAA CTTGCGGGGC TGGGCTTCAC GGACACCGCG
GTGGACTACA GCGAAGAGGA GGGGTGGCTG CGGTTCCGGC GTGGAGACGT GCTGGTGGCG
CTGAACTTCT CCGAACAGAC GGTAAAGCTC GAAGATGCGG CCGGATCAGT GTTGCTTTCC
ACCGACGAGG CATCAGTGCC CGACGGCGGC TCGCTCTTGC TGGCGCCGTG GAGTGCCGTC
ATCGTGAGGG CCTGA
 
Protein sequence
MTHPASGSGR FDIWAPEVTA ITLLADGSEY PMSQRGDGWW TASDAPAGGE VDYGYLPGTD 
TTPLPDPRSR RQPAGVHSLS RTFDPAAHAW ADGNWAGREL QGAVIYELHI GTFTPEGTLE
AAAGKLGYLK DLGVDFVELL PVNGFNGTHN WGYDGVLWYT VHEGYGGPAA YQRFVDAAHG
AGLGVIQDVV YNHLGPSGNY LPKFGPYLKS GEGNTWGDSV NLDGNGSDEV RRYILDNAAM
WLRDYHVDGL RIDAVHAFKD ERAVHLLEEF GALGDTVAAE TGRPITMIAE SDLNNPRLLY
PRDVNGYGLE GQWSDDFHHA VHVNISGETE GYYSDFDSLG ALAKVLRDGF FHDGSYSSFR
GRHHGRPINT GLVHPAALVV CSQNHDQIGN RATGDRLSQS LSYGRLAVAA VLTLTSPFTP
MLFMGEEYGA TTPWQFFTSH PEPELGKATA EGRIKEFERM GWDPAVVPDP QDPETFNRSK
LNWAEATEGD HARLLDLYRT LTALRRSTPE LAGLGFTDTA VDYSEEEGWL RFRRGDVLVA
LNFSEQTVKL EDAAGSVLLS TDEASVPDGG SLLLAPWSAV IVRA