Gene Arth_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1839 
Symbol 
ID4445633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2065782 
End bp2068034 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content64% 
IMG OID639689657 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_831329 
Protein GI116670396 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.755932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA ACACGTCCAA CCTCACCCTG GAGCAAAAAG CATCGCTGCT ATCAGGAGAG 
AACTTTTGGC AGACGAAAGA ACTGCCGGAG GCAGGCATCC CCGCCATCGT CCTGACTGAT
GGGCCGCACG GTATCCGACG CCAACTGGCC GGGGAAGACC ACCTCGGCCT GCACCACAGC
GAACCCTCGA CATGCTTCCC GCCGGCAGTG GCAGTAGGTT CCAGCTGGAA TCCAGCGGTC
GCCCAGAGGC TCGGTGCCGG TGTCGGCAAG GAAGGCCACC GACTGGGCAT ATCCGTCGCA
CTGGGGCCGG GCGTGAACAT CAAGAGGTCG CCGTTGTGCG GCAGGAACTT CGAGTACTAC
TCCGAGGACC CGCTGCTCTC GGGCATCCTA GGGGCTGCGC ACGTCACCGG GCTCCAGGCC
GAAGGCGTCG GGGCCAGCGT CAAACACTTC GCAGCCAACA ACCAGGAAAC CGACCGGATG
CGGGTCAGCG TCGAGATTGA CGAACGCACC CTCCGGGAAA TCTACCTCCC GGCCTTCGAG
CGGATCGTCA AGGAAGCGCG GCCCGCCACT GTCATGTGCT CCTACAACAA AATCAACGGG
GTCTACGCCT CCGAAAACCG ATGGCTGCTC ACCGAACTGC TACGGGACGA ATGGGGCTTC
GACGGAGCCG TGGTATCCGA CTGGGGCGCG GTCTCGAACC GCGTCGCCGC ACTCAAAGCC
GGCCTGGACC TCGAAATGCC GGGCAACGGA GGAACCAGCA ACCGTGAAAT TGTCGAGGCC
GTTAAGAACG GAACGCTGGA CATCGACGAT GTCGACCGCG CAGCGGCCCG TGTTCTTTCC
CTGACACACA ATTCCGTGGC CTCACCGGGC CACTACGACG TCGGAGACAG CCATGCCTTG
GCACATGAGC TTGCCCGGGA GTGCATCGTG CTGCTCAAGA ACGACGGACA GGCGCTTCCA
CTGGCCGGCA ATTCCCGGGT TGCTGTCATC GGTCACTTCG CCGCCGCGCC CCGCTACCAG
GGCGGCGGCA GCTCCCATAT CAACGCGACG CGGGAAGAAT CCGCCCTGGA ATCCATCCGC
GAACATGCGG CACGGCTCGG GGCCGAGGTC ACATACTCGC CGGGCTTCAC CGTGGACGAC
CACGCTCCGG AGCAGGCCAC ACTGATGGGC GCAGCCGTTG AAGCTGCGAC CGCTTCGGAC
GTGGCGATCA TCTTCGCCGG ACTCTCCGAG CAAGACGAAT CCGAAGGGTT CGACCGAAAC
CACCTGGACC TTCCCGCGCA CCAAGTCCAT GCGATTCGTA CCGTCGCACA GGCAGCCCCG
AAGACCGTCG TCGTCCTCTC CCACGGGGGC GTCGTTTCCC TCGAAGGCTG GCATGACGAC
GCCGATGCGA TCGTTGAAGG GTGGCTGCTG GGCCAGGCCG GCGGGGCGGC GATCGCCGAA
GTCCTGTTCG GCGCCGTCAA CCCTTCCGGG CACCTTGCCG AAACAATTCC TTTGAGGCTT
CAGGACAATC CCAGCTGGCT CAACTTCCCC GGAGAGCAGC AGCATGTCCG CTACGGCGAA
GGCGTGTTCG TCGGATACCG CTACTACACC AGCGCGGACG TCCCCGTCCG CTACCATTTC
GGACACGGGC TCAGCTACAC CACGTTCCGT ACCGACAACC TCGATATCGA AGTCACAGGC
CCCTCATCCG CCCGTGCACG CGTGACCGTC ACCAACACCG GAAACCTGGC CGGGAAACAT
GTCATCCAGC TTTACGTGGC CACGACCGCT GGCCCCGTGA AACGCCCGGC CCGCGAATTG
AAGGCCTTCA CTAAAATCGA CCTGGACCCC GGGCAGAGCA AAACAGTGGA ACTGGGGCTG
GACCACCGGT CCTTTGCCTA TTACGACGAA CCCCTCGGCC GTTGGGTGGC CGCAGCGGGT
GACTACGCCA TCCAGATCGG CGTCGACGCC GGAACCATCG ACACCCAAAC GTCGATCACC
CTGGTGGGGG ACACTATTAC ACGTGAACTG AGCATGGATT CCCCGGTCGG TGAATGGTTC
GGGCACCCCG TCGTCGGCCC GGCACTCCTG CAGGGGATCA CGGCATCCAT GTCAGAGCAA
CAGGCACACG CTGCCGCGGC CAACCAGGAC AGCATGAGGA TGGTGGAATC TTTGCCGATG
AAGCAATTCG TCGGCTTCCT CGGAGACGCC CTCCCGCCCG AAGCACTCGA CCAACTCTTG
GCACTCAGCC GCCAGCCGGC CGAAGCCGTC TAA
 
Protein sequence
MSTNTSNLTL EQKASLLSGE NFWQTKELPE AGIPAIVLTD GPHGIRRQLA GEDHLGLHHS 
EPSTCFPPAV AVGSSWNPAV AQRLGAGVGK EGHRLGISVA LGPGVNIKRS PLCGRNFEYY
SEDPLLSGIL GAAHVTGLQA EGVGASVKHF AANNQETDRM RVSVEIDERT LREIYLPAFE
RIVKEARPAT VMCSYNKING VYASENRWLL TELLRDEWGF DGAVVSDWGA VSNRVAALKA
GLDLEMPGNG GTSNREIVEA VKNGTLDIDD VDRAAARVLS LTHNSVASPG HYDVGDSHAL
AHELARECIV LLKNDGQALP LAGNSRVAVI GHFAAAPRYQ GGGSSHINAT REESALESIR
EHAARLGAEV TYSPGFTVDD HAPEQATLMG AAVEAATASD VAIIFAGLSE QDESEGFDRN
HLDLPAHQVH AIRTVAQAAP KTVVVLSHGG VVSLEGWHDD ADAIVEGWLL GQAGGAAIAE
VLFGAVNPSG HLAETIPLRL QDNPSWLNFP GEQQHVRYGE GVFVGYRYYT SADVPVRYHF
GHGLSYTTFR TDNLDIEVTG PSSARARVTV TNTGNLAGKH VIQLYVATTA GPVKRPAREL
KAFTKIDLDP GQSKTVELGL DHRSFAYYDE PLGRWVAAAG DYAIQIGVDA GTIDTQTSIT
LVGDTITREL SMDSPVGEWF GHPVVGPALL QGITASMSEQ QAHAAAANQD SMRMVESLPM
KQFVGFLGDA LPPEALDQLL ALSRQPAEAV