Gene Achl_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3131 
Symbol 
ID7294611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3480280 
End bp3481548 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content61% 
IMG OID643591541 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002489181 
Protein GI220913872 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00803878 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGGGCGG CTGCTGCAAC AGCCGTACTG GCCTTGGCGC TCACAGGTTG TGGCAGCAGT 
CCGCAGGCCG GAAAGGTCGG CACGGCGGAG GATCCCGTGA CCATCCGTTT CGCCTGGTGG
GGCAACGATT CCCGCGCCAA AACCACGCTT GAAGTCATCA AGGACTTCGA AGCGGCCAAC
CCCACCATCA AGGTGCAGGG CGAGAACACT GAGTTCAGCT CCTACTGGGA CAAGATGGCC
ACGCAGATTG CGGGCGGCAC CACCCCGGAC GTTTTCGCCA TGAGCGGTTC CTACCCCAGC
GAATACGCGT CGCGCGGGGT GCTGCTGGAC CTGGACAAGG TCAAGGACCA GATCGATACC
TCCAAGTTTG CCGACGGAAC CGTGGAGTTG GGCCAGCTGG ACGGCAAGCA GTACACCATC
ACCGCCGGCG TCAATGCCAT GTCCATGGTC CTTGATCCCA CAGTGTTTGA AGCCGCCGGC
GTACCACTGC CGGACGACGA AACCTGGACC TGGGACGACT ACGTCGATAT TGCCGCCAAG
ATCAGCAAGA ACTCCCCCGC CGGCACCTTC GGCACCACGC CGATGTCCAA CGATTCGTTC
GTTGCCGTCT GGGCACGCCA GAGCGGCGAA GAGCTGTACA CGGACGACGG AAAGAAGATG
GGTATCAGCG AGGGCACCCT CGCCAAGTGG TTTGAGTTCA ACAAGAAACT CATGGACACC
GGCGGCGCAC CTTCCGCGTC GCAGACCGTC GAAGACGGCT CAGCGCAGCC GGAACTGACG
CTGATGGGCC AGGGTAAGCA GGCGATGAAG GTGTCGTGGA GCAACCAGAT GACCTCTTAC
TCGGGTGCGC CCCTGACCAT GGTGAAGCTG CCCGGGGAAA GCAAGCAGCC GGGAACCTGG
CTGCGCTCCT CCATGGAGTA CGCCATCTCG TCCAAGTCCG CCCAGTCCAA GGAAGCTGCC
CTTTTCATCA ATTATTTGGT GAACAACATG GATGCTGCCA GCAAGATCAA GAGTGACCGC
GGCATGCCCG CCAACACCGA TCTCAAGGCG GGCATCACCC CCCTGCTGAA GGAAACCCAG
CAGAAGGAGG CGGGATACCT GGACCGCATC GCCGAGCTGG ACGTCAAGCC GCCCCAGCCG
TTCCCGGCAG GTTCTTCTTC CACCCTGGAA GTTTTGAACC GATACAACAC GGATGTACTC
TTCGGGAAGA TCTCGCCGCA GGATGCGGCA AAGGGCGTCA TCAGTGAGGT CAATTCGAAC
CTGGGGTAG
 
Protein sequence
MGAAAATAVL ALALTGCGSS PQAGKVGTAE DPVTIRFAWW GNDSRAKTTL EVIKDFEAAN 
PTIKVQGENT EFSSYWDKMA TQIAGGTTPD VFAMSGSYPS EYASRGVLLD LDKVKDQIDT
SKFADGTVEL GQLDGKQYTI TAGVNAMSMV LDPTVFEAAG VPLPDDETWT WDDYVDIAAK
ISKNSPAGTF GTTPMSNDSF VAVWARQSGE ELYTDDGKKM GISEGTLAKW FEFNKKLMDT
GGAPSASQTV EDGSAQPELT LMGQGKQAMK VSWSNQMTSY SGAPLTMVKL PGESKQPGTW
LRSSMEYAIS SKSAQSKEAA LFINYLVNNM DAASKIKSDR GMPANTDLKA GITPLLKETQ
QKEAGYLDRI AELDVKPPQP FPAGSSSTLE VLNRYNTDVL FGKISPQDAA KGVISEVNSN
LG