Gene Arth_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1862 
Symbol 
ID4445621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2096702 
End bp2097994 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content64% 
IMG OID639689677 
Productextracellular solute-binding protein 
Protein accessionYP_831349 
Protein GI116670416 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.144249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCAC CAAACAAGCT CAGGATTTCG CTGGCCGTAA CAGCAGCAGC AAGCCTCATT 
GCAGTAAGCG GATGTGCCGC CAACACCCCC GCCGGAAGCT CGTCCGGGGG AGGGACCGAC
AAACTCGAAA TCACCTCCTG GTGGACCTCC GGATCCGAGG CCGACGCACT CAATGTGCTG
ATCGACGGGG TGAAGGCGGC CAAGCCGGGC CTGTCCGTGG ACAACGCCGC GGTCTCCGGC
GGCGGCGGCG CCAACGCGCG GCAGGCCCTG GCCGCGAGGC TCCAGGCCGG AAGCCCGCCG
GACGCCTGGC AGGTACACCC TGCCGGGCAG CTGAAGAGTT ACGTGGACGG CGGGCAGGTA
GCCGATCTGA CTGACTTGTG GACCGAGGGT GACTGGGCGT CGCAAATGCC CAAGGACGTG
GCCGAAGCCC AACAGGTCGA CGGCAAGTAC TACACCGTCC CGATCGGCGT CCACCGCGGG
AACGTCCTCT GGACCAACCC CGCCGTGCTC TCCAAGGCGA ACGTCACGAT CGATGCCGAT
GCCGGCATCG ACGGGCTGAT CTCCAGCCTG GAGCAGGTGC AGGCCAGCGG GACCACGCCG
CTCTGCCTGG GCGACAAGGA CATCTTCGCC TCATCCCAGC TGCTGGAGTC ACTCATCATG
TCCAGGGCGG GTGCGGACAA CTGGACGAAG CTGTTCACCA GCGAGTATTC CTTCGACGCC
CCCGAGGTGA AGCAGGCGCT GGAGGACTAC AAGACCATCC TCTCCTTCGC CAATAAGGAT
CACTCCGCTA TCACCTGGGA TGAAGCCGCG AAGAAAATGG CCGACGGCGA ATGCGCGGTC
AATCTCATGG GGGACTGGGC CTACGGTGAG TTGCTCAACG CCGGAAAGAA GCCCGGCACG
GACTTCGCCT GGGTGGCCTT CCCCGGCAAA GAGGACATCT TCGACTACGT AGGTGACGGC
TTCTCCATCC CGGCGAACAA TATTCCGCAT GCCGAAGCCG CCCGGGCCTG GTTGAAGACG
TTGATGGACC CGAAGATCCA GACCGAATTC GCCGCCAAGA AGGGTTCAAT CCCCGCAGTG
ACCTCGGCCG ACATCTCGGG GTTGTCCGAA TACCAGCAGG AAGCCGCCAA GAGCCTCGCC
TCAGGCGCAG TGGTCTCCTC GCTGGCCCAT GCCCAGGCCG CCGGAGCCGA ATTCGCCCAG
ACGTACGCCG ACGCCGTCTC CACCTTCAAC GGCAGCGGCA ACACCGATGC CTTCATCGCC
AGCATGACGC AGGCCCAGAA GACCCAGCTG TAA
 
Protein sequence
MRAPNKLRIS LAVTAAASLI AVSGCAANTP AGSSSGGGTD KLEITSWWTS GSEADALNVL 
IDGVKAAKPG LSVDNAAVSG GGGANARQAL AARLQAGSPP DAWQVHPAGQ LKSYVDGGQV
ADLTDLWTEG DWASQMPKDV AEAQQVDGKY YTVPIGVHRG NVLWTNPAVL SKANVTIDAD
AGIDGLISSL EQVQASGTTP LCLGDKDIFA SSQLLESLIM SRAGADNWTK LFTSEYSFDA
PEVKQALEDY KTILSFANKD HSAITWDEAA KKMADGECAV NLMGDWAYGE LLNAGKKPGT
DFAWVAFPGK EDIFDYVGDG FSIPANNIPH AEAARAWLKT LMDPKIQTEF AAKKGSIPAV
TSADISGLSE YQQEAAKSLA SGAVVSSLAH AQAAGAEFAQ TYADAVSTFN GSGNTDAFIA
SMTQAQKTQL