Gene Arth_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1102 
Symbol 
ID4446405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1194142 
End bp1195239 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID639688908 
Productmultiple sugar-binding periplasmic receptor 
Protein accessionYP_830596 
Protein GI116669663 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.891182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGA TTGGTAAAGC AGGAAAGGCA GCAGCAATCG CTGCTATTGC GGCACTGGCG 
CTGACAGCCT GCGGCCGCTC CGAGACCGGC ACCACGGGCA GCAGCAGCGG TGGCGAGGCG
TTCCCGAAGA ACTCCTCGAT CGGCGTCGCG CTCCCGCAGA AGACCAGTGA AAACTGGGTG
CTGGCCGAGA AGCTGTTCAA CGACGGACTC AACGGAGCCG GTTTCAAGGC TGATGTGCAG
TTCGCCAACG GCGGCGTATC CGAGCAGCAG AACCAGATCA GCGCCATGGT CACCAAGGGT
GCAAAGGTCA TCATCGTGGG TGCCATTGAC GGCGCCCAGC TGGGTACCCA GCTCAAGCAG
GCCAAGGACT CCGGCGCCAC CATCATCGCC TACGACCGCC TGCTCCTGAA CACCGAGAAC
GTGGACTACT ACGTGGCTTA CGACAACTTC AAGGTGGGTG AACTCCAGGG CCAGGCGCTG
CTGGACGGCA TGAAGGCCAA GAAGCCTTCC GGCCCGTACA ACATCGAGCT CTTCGCCGGC
TCCCCGGATG ACGCCAACGC GAAGGTCTTC TTCGACGGCG CCATGAGCGT GCTCAAGCCG
AAGATCGACG ACGGCACCCT CAAGGTTGTC TCGGGCCAGA CCTCGTTCGA GCAGGCCGTC
ACCCAGGGCT GGAAGGCTGA GAACGCCCAG CGTCGCGCCG ACACCCTGCT GACCGGCAGC
TACGGCACCG CTTCCCTGGA CGGCGTCCTG TCCCCGAACG ACACCCTGGC ACGTGCAGTA
CTGACGTCCG TCAAGGCCGC CGGCAAGCCG CTCCCGATCA TCACCGGCCA GGACTCCGAG
GTTGAGTCCG TCAAGTCCAT CATGGCCGGC GAGCAGTACT CCACCATCAA CAAGGACACC
CGCAAGCTCG TAGAGCACGC GATCACCATG GTCAAGGACA TCCAGGCCGG CAAGACGCCT
GAGATCAACG ATGACAAATC CTACAACAAC ACGGTCAAGA CCGTTCCGGC CTATCTGCTG
GATCCGGTCA TCGTGACCAA GGAGAACGTC AAGACGGCCT ACGTGGACGA TCCGGTACTG
GGCCCGATCA CCAAGTAG
 
Protein sequence
MQMIGKAGKA AAIAAIAALA LTACGRSETG TTGSSSGGEA FPKNSSIGVA LPQKTSENWV 
LAEKLFNDGL NGAGFKADVQ FANGGVSEQQ NQISAMVTKG AKVIIVGAID GAQLGTQLKQ
AKDSGATIIA YDRLLLNTEN VDYYVAYDNF KVGELQGQAL LDGMKAKKPS GPYNIELFAG
SPDDANAKVF FDGAMSVLKP KIDDGTLKVV SGQTSFEQAV TQGWKAENAQ RRADTLLTGS
YGTASLDGVL SPNDTLARAV LTSVKAAGKP LPIITGQDSE VESVKSIMAG EQYSTINKDT
RKLVEHAITM VKDIQAGKTP EINDDKSYNN TVKTVPAYLL DPVIVTKENV KTAYVDDPVL
GPITK