Gene Arth_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3687 
Symbol 
ID4443688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4147156 
End bp4148517 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content63% 
IMG OID639691511 
Productextracellular solute-binding protein 
Protein accessionYP_833162 
Protein GI116672229 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACTTGA TTCGACCTGC TTCTCCGCTC CAGGAGGACA CCGCAACATC ACAGACGTCC 
GCACACATCA CTGAACTGCG CGCAGGTTCC TTGCGGACCC GCTTGGTGAC TGTCGCCGCA
GCGGCTATCG CTGCGGGCCT CCTGATCAGC GGTTGCGGCG GGGGAGCCGC ACCCCAAGGT
GCGGAACAGG CGGCTGCTGC CGATCCGGCG TCGGGCGCTA ATGCCGGCGG CAGCGTCAAC
ATCTGCGGCG TCAAGGATGC GTCGGGGATC TACAAAGGCA CTGCGGCGGC ATTCACCAAG
GCGAACGGCA AGGTCACGGC CAAGTACACC GAGATCGGTG CCACCACAGA CGAGGCCCGC
ACCCAGATCG TCCAGCGGCT CGAAGGTAAG TCCACTGAAT GCGATATCTT CCTCACGGAC
GTCATTTGGA CCTCCGAGTT CGCATCCCAG GGCTGGCTGC TGGACCAGAC CAAGCTCGTA
GAGGCCAACA AGGACCGGCT CATTCCTTCC ACGGTGGAGA CCACCAAGTA CCAGGACAAG
TACTGGGCCT CGCCGTTCTT CACGAACGCC GGATTGGTCT ACTACCAGAA GGACAAGGTG
GCCAAGCCTG AGACGTGGCA GCAGCTCTAC GCAGAAGCAG CCAAGGCGCC CGGCAACGGA
TACGTCTACC AGGGCAAGCA GTACGAGGGC CTGACGGTGA ACTTCCTCGA AATGCTCTAC
AGCGCGGGCG GCGAAGTACT CGACGACAAG GGCGAGGTGG CGATTGATTC GCCGGAGACC
CGCGAGGTCC TCGATTTCAT GAGCAACGGG CTCAAGAACG GCTCAGCTGA CCGCGCAGTC
CTGACCTATA ACGAAGATCC CGCCCGTCTC GCTTATGAGT CCGGCAACTT CGGCTACCAG
CGCAACTGGC CGCATGTATA CCGCCTGCTC AACGCCACAT CACTGGCCGG CAGTTTTGGC
GTGGCGCCGC TGCCCGCATG GGAAGGCGGC AAGGCGTCCG GCGTGCTGGG TGGCTGGAAC
CTGGCCATCT CCGCCCACGC CACGAACCAG TCCGGGGCCG TGGCGTTCAT CGACTTTGCC
ACCACGCCGG AATGGCAGAA GCACGTGGCC ATGGATTACT CCCAGGCCCC GGTCAATGAA
GCCGCCTATT CTGATGCGGC AGTTCTCCAA AAGATGCCCT TCGCCACCGA ACTGCTCGCG
TCCGTCAAGG GTGCCAAGCC CCGCCCGATC TCCCCGGTCT ACCCGCAGAT CTCCCAGGCG
ATCTACAAGA ACGTATATGC CGTCCTTTCC GGTACAGCCT CCGCCGAGGA CGCCGTGAAA
AAGATGGCCG AGGAGATCAC CACCGCCAAG GCGAGCTTTT AG
 
Protein sequence
MDLIRPASPL QEDTATSQTS AHITELRAGS LRTRLVTVAA AAIAAGLLIS GCGGGAAPQG 
AEQAAAADPA SGANAGGSVN ICGVKDASGI YKGTAAAFTK ANGKVTAKYT EIGATTDEAR
TQIVQRLEGK STECDIFLTD VIWTSEFASQ GWLLDQTKLV EANKDRLIPS TVETTKYQDK
YWASPFFTNA GLVYYQKDKV AKPETWQQLY AEAAKAPGNG YVYQGKQYEG LTVNFLEMLY
SAGGEVLDDK GEVAIDSPET REVLDFMSNG LKNGSADRAV LTYNEDPARL AYESGNFGYQ
RNWPHVYRLL NATSLAGSFG VAPLPAWEGG KASGVLGGWN LAISAHATNQ SGAVAFIDFA
TTPEWQKHVA MDYSQAPVNE AAYSDAAVLQ KMPFATELLA SVKGAKPRPI SPVYPQISQA
IYKNVYAVLS GTASAEDAVK KMAEEITTAK ASF