Gene Arth_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0048 
Symbol 
ID4447483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp53340 
End bp54881 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content62% 
IMG OID639687842 
Productextracellular solute-binding protein 
Protein accessionYP_829549 
Protein GI116668616 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTCGGT TCCGTTGTCA CGGATGGACC CGGCGGATGC CAAAACCTGC CAAATCGGCC 
AAAATATGCC AAACCGTTTC TTCTGCCTTC GACGTCGACT CTAGGGCTGC GCCCGCCCAT
GCGTCAAGGC CCCAAATAAT TTGCATTGTG ACGCTTGACA CGCCTGAACG GACGCCCCTA
GGGTGTTGCC AAGTCGTTTT GGCAAAACGA TTCTTCACCC CACAAACATT CTTGCCATTC
CGTTCCGAGA GAAAGTCGAC AGCGATGTCC ACTCGAAAGA CCATCTCCAG GCTCGCCGCC
ATCGGCGGCC TTTGCACGGC CGTGGCCCTG ACGGCCACCG CATGCGGCGC TGGGGGTCCG
GCGTCGTCCG GCAGCGCCGC AAGCTCCGTC AACGTCCTTG TCGAAGCCGG CGGGCACGCC
GAGCTCGCCG GCGTTGCCGA GGCCTGCAAA AAGGACACCG GCCTCGACGT CAACTTCGTC
GAACTGCCCT ACGACGGCTT GTTCAACAGG CTCTCCAGCG AATTTTCCTC CGGCACTGTC
TCCTTCGATG TCGCCGCTCT GGACTCCGTC TGGCTCCCCA GCTTCAAGGA TGCGGTCCAG
CCCATCGACG AGCTCTTCAC CGATGAGGCC AAGAAGGACA TCTTCCCCGC ACTGGTCAAG
GAAGCCAACG TTGATGGCCA CTTCATCGGC ATGCCCGCCT GGACCAATGC CGAAATCATC
CTGTACCGCA AGGACCTCTT CGAGGACGCC AAAAACAAGG CCGACTTCAA GGCAAAGTAC
GGATACGAAC TTGCAGCCCC CACCACCTGG AAGCAGTACC AGGACATCTC CGAGTTCTTC
ACCAAGGATG GCATGTACGG CACCGACGTG AAGGGTGCCG TCGAAACCGA ATGGCTGGCC
CATGTTCTCC AGGCCGGGTC CCCGATGGTC CTGGACGACC AGAACAACGT CGTGGTCGAC
AACGCAGCCC ACAAGGAAGC CCTCGATTTT TACACGAGCC TTGTTAAGTC CGCGCCGTCC
GGAGCGGCAC AGGTCGACTG GGCTGCCGCG CAGAACCTCT TCAACCAGGG CAAGACCGCG
ATGACCAGGT TCTGGGCCCA CGCCTACCGC CAGATCCCCG CCGACGCCGC TGTCTACGGC
AAGGTGGGCG CGGCTCCCAT GATCGGCGGA TCCGCCGGCG TCGCCGGCGT CCCGGGACCG
TGGTACCTCT CCGTCCCCAA GGCGACGAAG AACGCAGACG CCGCCAAGAA GTTCATCAAG
TGCGCCTACG ACCACAACGA CCTGGGCATC GAGTCCAAAC TGGGGCTCGC CGCCCGCATC
TCGGCCTTCG AGAAGTACCA GGACAAGCCC GGTTATGAGA GCTTCAAGCC GTTGATCGAG
ACGCTCAACG GCGAGGCCAC GGCGACCCGC CCGGCAACGG CGAAGTGGCA GCAGATTGTG
GACACGGTCC TGGTACCGAC GCTGCAGAAG GCAGTGGCCG GAGGGGACAG CGCGTCCCTC
CTGGCTGAAG CCAAGACCAA GATCCAGGCC CTCGTCAAAT GA
 
Protein sequence
MLRFRCHGWT RRMPKPAKSA KICQTVSSAF DVDSRAAPAH ASRPQIICIV TLDTPERTPL 
GCCQVVLAKR FFTPQTFLPF RSERKSTAMS TRKTISRLAA IGGLCTAVAL TATACGAGGP
ASSGSAASSV NVLVEAGGHA ELAGVAEACK KDTGLDVNFV ELPYDGLFNR LSSEFSSGTV
SFDVAALDSV WLPSFKDAVQ PIDELFTDEA KKDIFPALVK EANVDGHFIG MPAWTNAEII
LYRKDLFEDA KNKADFKAKY GYELAAPTTW KQYQDISEFF TKDGMYGTDV KGAVETEWLA
HVLQAGSPMV LDDQNNVVVD NAAHKEALDF YTSLVKSAPS GAAQVDWAAA QNLFNQGKTA
MTRFWAHAYR QIPADAAVYG KVGAAPMIGG SAGVAGVPGP WYLSVPKATK NADAAKKFIK
CAYDHNDLGI ESKLGLAARI SAFEKYQDKP GYESFKPLIE TLNGEATATR PATAKWQQIV
DTVLVPTLQK AVAGGDSASL LAEAKTKIQA LVK