Gene Arth_2948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2948 
Symbol 
ID4444470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3314283 
End bp3315629 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID639690771 
Productextracellular solute-binding protein 
Protein accessionYP_832427 
Protein GI116671494 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATGA ATCTTGACCG CAGGCATTTC CTAGGGCTTG CCGGCGCAGG AGCCGGTGCC 
GCTGCGCTGG CAGCGTGCGG CGGCCCGTCC ACCGGCGGAA CAACGCCTGC AAGCGAAGCC
GCCGAGATCG ACTTCAGCGG CGTCAAGCCT GCCGCCTCCA TCGACTTCTG GACGAGCCAC
CCGGGCAAGT CCCAGGACGT CGAAAAATCC ATCATCGCCA AGTTCCACGC CAAGTTCCCG
GACATCAAGG TGAACCTGGT CACCGCCGGT GCCAACTATG AGGAGATTGC ACAGAAGTTC
CAGACCTCGC AGGCCGCCAA GGAGGCACTG CCGGGCCTTG TGGTGCTCTC CGATGTGTGG
TGGTTCCGCT ACTTCACGAA CGGCAACATC ATTCCGCTGG ACGGACTGGT GAAACAGCTG
GATATCAAGG TGGACGACTT CCAGAAGTCC CTCGTGGCCG ACTACCAGTA CGACGACAAG
CAGTGGGCCC TCCCCTACGG CCGTTCGACG CCGCTCTTCT ACTACAACAA GGACCACTTC
AAGGCGGCCG GCCTCCCGGA CCGGGCACCG AAAACCTGGC AGGAATTCGC CGAGTGGGCG
CCCAAGCTGA AGGCAAGCTC CGGCGCGCAG TACGCCTACA TCTACCCGGC GCTGGCCGGC
TATGCGGGCT GGACCCTGCA GAACAACCTC TGGGGATGGG GCGGCAGCTG GTCCAACGAG
TGGACCATCA ACTGCGACTC GGCGGAATCG GTGGAGGCCC TGCAGTGGGC CCAGGATTCC
ATCTACAAGG ACGGCTGGGC GGGTGTTTCC TCGAAGGAGG CCGCTGACGA CTTCGCCGCG
GGCATCACAT CCTCCACCAT CTCGTCCACA GGGTCCCTGC TCGGTGTGCT GAAGTCCGCC
AAGTTCAACG TGGGCGTGGG CTTCCTGCCG GGCGGCCCCA AGGTGGAAAG CGGCGTGTGC
CCCACCGGTG GTGCCGGCCT GGGCATTCCC AGCGGTGTGA GCAAGGAAGT GCAGCTGGCT
GCGGGCACCT TCCTGAAGTT CATGACCGAG CCGGAAAGCA CCGCGGAATT CTCTGCGGCA
ACGGGCTACA TGCCTACGCG TGTTTCGGCC GACATGACGT CGGTACTGGC CAAGACGCCG
CAGATCAAGA CGGCCATGGA CCAGCTCGCG GTCACCCGGG TCCAGGACAA CGCCCGCGTG
TTCCTGCCCG GCGCAGACCA GGAAATGGCC AAGGCCGCAG CGAAGATCCT CACCCAGCAG
GGCGACGTGA AGGCCACCAT GACCGCGTTG AAGTCCACGC TGGAGGGCAT CTACACGAAG
GACGTCAAGC CCAAGCTCAA GAGCTGA
 
Protein sequence
MGMNLDRRHF LGLAGAGAGA AALAACGGPS TGGTTPASEA AEIDFSGVKP AASIDFWTSH 
PGKSQDVEKS IIAKFHAKFP DIKVNLVTAG ANYEEIAQKF QTSQAAKEAL PGLVVLSDVW
WFRYFTNGNI IPLDGLVKQL DIKVDDFQKS LVADYQYDDK QWALPYGRST PLFYYNKDHF
KAAGLPDRAP KTWQEFAEWA PKLKASSGAQ YAYIYPALAG YAGWTLQNNL WGWGGSWSNE
WTINCDSAES VEALQWAQDS IYKDGWAGVS SKEAADDFAA GITSSTISST GSLLGVLKSA
KFNVGVGFLP GGPKVESGVC PTGGAGLGIP SGVSKEVQLA AGTFLKFMTE PESTAEFSAA
TGYMPTRVSA DMTSVLAKTP QIKTAMDQLA VTRVQDNARV FLPGADQEMA KAAAKILTQQ
GDVKATMTAL KSTLEGIYTK DVKPKLKS