Gene Arth_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0744 
Symbol 
ID4446749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp802157 
End bp803455 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID639688549 
Productextracellular solute-binding protein 
Protein accessionYP_830242 
Protein GI116669309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0587531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGC AGAACACCCT CACGAGCGGG ACCAGCCGCC GCGTCTTCAC GGCGGGCGCG 
CTCTCCGTCG TGGCAGCCTT GGCGCTCAGC GCCTGCGGCG GCGCATCGGC CACCGCGCCG
GCAACGGCTA CCGCCAAGCC GGACCTGAAG TCTGCCGGTG CAGGCACCAT CACCATGTGG
GTCGACGCCG AACGGTCACC GGCCCTGAAG GACATCACCG CCAAGTTCCA GGCTGACACC
GGCATCGAGG TCAAGCTCGT CGTCAAGGAC TTCGCGGCCG TCCGCGATGA CTTCATCACC
CAGGTACCCA CGGGCAAGGG CCCGGACCTG ATCGTCGGAC CGCATGACTG GGTCGGCAAG
TTCGTCCAGA ACGGCGTCAT TGCTCCGATC GAACTCGGGG ACAAGTCGTC GGCCTTCCAG
GATTCGTCCA TCAAAGCCAT GACCTTCAAC GGTTCCGTCT ATGGTGTCCC GTACGCCATC
GAAAATATCG CGTTGCTGCG CAACACCGAC CTGGTGGCCG AGAGCGCGGC AACGCTGGAC
GAGGTAGTGG CCAACGGCAA GAAGGCAGTG GCGGACGGCA AGGCCAAGTT CCCGTTCCTG
GTGGGCATGG ACCCCAAGCA GGGTGACCCC TACCACCTCT ACCCGCTGCA GGCCTCCATG
GGTTCGCAGG TCTTCGGCCA GGCTGCCGAC GGTGGCTACG ATCCCAAACA GCTCCTCATC
GGTGACGCCG CCGGTGTTGA GTTCGCCAAG AAGCTCGTGG CCTGGGGCGA CGCAGGTGAA
AAGATCATCA ATTCCAACAT CACCGGAGAC ATCGCCAAGG AGAAGTTCCT GGCAGGCGAA
TCCCCTTACT TCCTGACGGG CCCCTGGAAC GTTCCGGATG TCCAGAAGAA GGGCATCAAG
TTTGCCGTGG ACGCCCTGCC GACCGCCGGC GACAAGCCCG CCCAGCCGTT TATCGGCGTC
AACGGTTTCT TCATCAGCGC CAAGAGCGCT AACGCCCTTG CCACCAATGA ATTCGTAACC
AACTACCTCA CCTCCGAAGC GGCGCAGGAT TCCATGTACA AGGCCGGCGG CCGTCCGCCG
GCGCTGAAGG CCTCCTTCGA GAAGGCAGCC AGCGACCCCG TGGTGGAAGC ATTCGGCAAG
ATCGGCGCCA CCGGCGTGCC CATGCCTGCC ATTCCGGAAA TGAGCGCGGT CTGGGCCGAC
TGGGGAGCCA CTGAACTCGC GCTGATCAAG GGCCAGGGCG ATCCCGCTGC CGAGTGGGCC
AAGATGGCTG CCAGCATCAA GGCGAAGATC GCCGGCTGA
 
Protein sequence
MKVQNTLTSG TSRRVFTAGA LSVVAALALS ACGGASATAP ATATAKPDLK SAGAGTITMW 
VDAERSPALK DITAKFQADT GIEVKLVVKD FAAVRDDFIT QVPTGKGPDL IVGPHDWVGK
FVQNGVIAPI ELGDKSSAFQ DSSIKAMTFN GSVYGVPYAI ENIALLRNTD LVAESAATLD
EVVANGKKAV ADGKAKFPFL VGMDPKQGDP YHLYPLQASM GSQVFGQAAD GGYDPKQLLI
GDAAGVEFAK KLVAWGDAGE KIINSNITGD IAKEKFLAGE SPYFLTGPWN VPDVQKKGIK
FAVDALPTAG DKPAQPFIGV NGFFISAKSA NALATNEFVT NYLTSEAAQD SMYKAGGRPP
ALKASFEKAA SDPVVEAFGK IGATGVPMPA IPEMSAVWAD WGATELALIK GQGDPAAEWA
KMAASIKAKI AG