Gene Arth_3465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3465 
Symbol 
ID4443775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3897763 
End bp3898998 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content63% 
IMG OID639691289 
Productextracellular solute-binding protein 
Protein accessionYP_832940 
Protein GI116672007 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCAG CATTCACCAA GAGCCGTCCT CTCGCCGTTG CCGTCACCGC CCTCATCGGG 
CTGCCGGCCA TGCTCTCCGG CTGCGGTACC GCCGCCTCAT CATCATCCAC GGAGTCAGTC
TCGGAGATCT CCGTCATGGA CTACTACAAC AATGAACCTG ACAAGACCTT CATCGGCGAC
GCCCTCACCG CCTGCGGCAC CAAGGCCGGC GTCAACATCA AGCGGGAAAC GGTGCCCGGT
AAATCCCTGA TCTCCAAGGT CCTTCAGCAG TCCTCATCCA AGACCTTGCC GGACGTGCTG
ATGCTGGACA ACCCGGACCT GCAGCAGATT GCGGCCACGG GCGCCCTGGC TCCCCTGGCC
GATTTCAACA TCAGCACCGC CGACTTCGCC CCCGGCGTGC TCAGTGCGGG CACGTACAAG
GACAAGGTCT ACGGGCTGGC CCCCACTGTC AATACCATCG CCCTCTTCTA CAACAAGGAC
ATCCTGACGA AGGCCGGAGT GACCCCGCCG GCCACGTGGG ACGAGCTGGA AGCGGCCGCC
GCCAAACTGA CGTCCGGGGA CCAATACGGG CTGGCGTTCA ATGCCAACCC CACTTATGAA
GGCACCTGGC AGTTCCTGCC GGTCATGTGG TCCAACGGCG GCAACGAAAA GAACATCGAC
ACGGAGGAAA CGGCACAGGC CCTGCAGCTC TGGACCGACC TGGTGAAGGA CGGCTCGGTG
TCCTCCTCCG CCCTGAACTG GACGCAGGCC GATGTCAAGG ACCAGTTCCT GGCCGGCAAG
GCCGCCATGA TGGTCAACGG ACCGTGGCAG ATCCCCTCCC TCGACAAGCA GGCCTCCCTG
CAGTACGGCG TGGTGAAGAT ACCCGTCAGG GAGGCCGGCC AGACCGTTGT TGCCCCGCTC
GGCGGAGAAG TCTGGACTGT TCCGCAGACC GGGAACAAGG CCCGGCAGGC CAAGGCCGCA
GAGGTGGTCT CCTGCCTCAA CAGCGACGAA AACCAGCTGG CCATGGCCAA GGTCCGGAAC
ACTATCCCGT CCAAAACGAC CCTGGCAGCC AAGTTTGCCG AAGAAAACCC AAAACTCGCC
ACGTTCACCG AACTTGTGAA AACCGCCCGC GCCCGCACGG GACAGCTGGG TGAGGAATGG
CCCGCGCAGG CCACCAAGAT CTACACCGCC ATCCAGACGG CCCTCACCGG TAAGGCGACC
CCGTCCGAGG CCCTGAAGCA AGCGCAGGGA CAGTAG
 
Protein sequence
MKSAFTKSRP LAVAVTALIG LPAMLSGCGT AASSSSTESV SEISVMDYYN NEPDKTFIGD 
ALTACGTKAG VNIKRETVPG KSLISKVLQQ SSSKTLPDVL MLDNPDLQQI AATGALAPLA
DFNISTADFA PGVLSAGTYK DKVYGLAPTV NTIALFYNKD ILTKAGVTPP ATWDELEAAA
AKLTSGDQYG LAFNANPTYE GTWQFLPVMW SNGGNEKNID TEETAQALQL WTDLVKDGSV
SSSALNWTQA DVKDQFLAGK AAMMVNGPWQ IPSLDKQASL QYGVVKIPVR EAGQTVVAPL
GGEVWTVPQT GNKARQAKAA EVVSCLNSDE NQLAMAKVRN TIPSKTTLAA KFAEENPKLA
TFTELVKTAR ARTGQLGEEW PAQATKIYTA IQTALTGKAT PSEALKQAQG Q