Gene Arth_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3895 
Symbol 
ID4445096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4386376 
End bp4387677 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content63% 
IMG OID639691720 
Productextracellular solute-binding protein 
Protein accessionYP_833370 
Protein GI116672437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCT TTTCAACAGC CGCGCGCGTA TTGTCCGTGG CCGCCCTGAC CATCATGGGC 
GCAGGCCTCG CTGCCTGCGG CGGGGGAGGG GGAGGCAGCT CCGCCGACTC GGCCACGGCC
GCCAAGGGCC CCATCAAGAT CTGGTACTCC AACAACGAGT TCGAGGTGAA GTGGGGCAAG
GCAATGGTGG AGTCCTGGAA CGCCGCCCAC CCGGACGAGA AGATCGATGC CCAGGAAATC
CCGGCTGGCA AGAGCTCCGA GGAAGTCATC GGCGCAGCCA TTACAGCGGG CAATGCCCCG
TGCCTGGTCT TCAACACCGC TCCGGTGGCG GTGCCGCAGT TCCAGAAGCA GGGCGGACTC
GTCGCCCTCG ACTCCTTCCC CGACGGCGCG CAGTACATCA AGGACCGCAC AGGCGACCTC
GCCGATCAGT ACAAGTCCAC GGACGGGAAG ATGTACCAGC TTCCGTGGAA ATCCAACCCG
GTGGTCCTGT TCTACAACAA GGACATCTTC GCCAAGGCCG GCCTGGACCC CGAAAACCCC
AAGCTCGGCT CGCACGCCGA ATTCCTCGAA ACCGCACGGA CCCTCGTAAA GTCCGGCGCC
ACAGCCAACG CCGTATGGCC CTCCCCGGCC AGCGATTTCT TCCAGTCGTG GTTCGACTTC
TACCCGTTCT ACGCTGCCAA CACCGACGGC ACTCCGCTCC TGAAGGACAG CAAGGCCACC
TTCGACAGCG AAGAGGGCAA GCAGGTCGCC ACCATGTTCG CGACCCTCTA CAAGGAGAAC
CTGGCGTCCA AGGAGGTCTT TCAGGGCGAC GCCTTCGGTG AGGGAAAATC AGCCATGTCC
CTGGCCGGCC CGTGGGCCAT CGCCGCGTAC AAGGACAAGG TCAACTGGGG CGCCGTGCCG
GTGCCCGCCG CTGAATCCAA GGCTGGTACG TCCACCTTCT CGGACGCCAA GAACGTGGCC
ATGTACTCCG CCTGCGAGAA CCAAGGCACA GCCTGGGAGG TTATGAAGTT CGCCACCAGC
CAGGAGCAGG ACGGCAAGTT GCTGGCCGAG ACCGGCCAGA TGCCCATGCG CAAGGACCTG
ACCACTGCGT ACGCCGACTA CTTCACCAAG AACCCGCTCT ACACCCAGTT CGCCGAGCAG
GCCGGACGCA CCGTCGAGGT GCCCAACGTG CCCAACTCGG TTGAGGTCTG GCAGACCTTC
CGGACCGCCT ACGCCAAGTC AGTGATCTTC GGCAACGAAA GCATTGACTC CGCGTTCAAG
GGCGCCGCAG AAAAGATCGA CCAACTGGCA GCACAGAAGT AG
 
Protein sequence
MKRFSTAARV LSVAALTIMG AGLAACGGGG GGSSADSATA AKGPIKIWYS NNEFEVKWGK 
AMVESWNAAH PDEKIDAQEI PAGKSSEEVI GAAITAGNAP CLVFNTAPVA VPQFQKQGGL
VALDSFPDGA QYIKDRTGDL ADQYKSTDGK MYQLPWKSNP VVLFYNKDIF AKAGLDPENP
KLGSHAEFLE TARTLVKSGA TANAVWPSPA SDFFQSWFDF YPFYAANTDG TPLLKDSKAT
FDSEEGKQVA TMFATLYKEN LASKEVFQGD AFGEGKSAMS LAGPWAIAAY KDKVNWGAVP
VPAAESKAGT STFSDAKNVA MYSACENQGT AWEVMKFATS QEQDGKLLAE TGQMPMRKDL
TTAYADYFTK NPLYTQFAEQ AGRTVEVPNV PNSVEVWQTF RTAYAKSVIF GNESIDSAFK
GAAEKIDQLA AQK