Gene Achl_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2094 
Symbol 
ID7293555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2359037 
End bp2360362 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID643590493 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002488152 
Protein GI220912843 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000152964 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGATTCGA AATTGATGAC GCGCTCCCGC ACTGCCGTGG CCCTGACGGT GGCGTCCGCA 
GCGCTACTGA CGGGATGCGC CAGTGGCAGC GCAACGCCGG CGGCAAAGGA CGACGGGCAG
CCCATTGAAG TCTGGGCACG TGCCGGGACC GACGCCGCCA CCACCTACGC GGCGATGTTC
AAGGAGTTCA CGGACAAGAC CGGCGTGCAG GTCAACTTCC AGGGCGTCCC CGACCTTGAC
CAGCAGCTGC AGACCAGGGC CGCCTCGAAG AAGTTGCCGG ACATTGTCAT CAACGACTCC
GCGGCTTTGG GCAACTACAC CTCGCAGGGC TACCTCCAGA AGATCGACAA ATCTTCGGTG
ACCGGCAACG ACGCCATCGC CGACTCCCTG TGGAACGAAA CCACAGGCCT GGACGGCGCC
ACCTACGGGG TGCCGTTCTC CCGCCAGACC ATGGTGACCA TGATCCGCAA GGACTGGCGC
GAGAAACTCG GCCTGCCCAT CCCCACCACA CAGGAGGAAC TCGCGAAGCT CGCCACTGCC
TTCGCCACCC AGGATCCGGA TGGCAACGGC CAAGCCGACA CCTACGGCAT GACCGTTCCC
GGCTCCACCG AACGCGGCTA CCTCGGCTGG TGGGCCTCTT CCTACCTGTG GCAGGACGGC
GGCTCCTACC TCAAGGACGA GGGCAGCGGA AAGTTCTCCG CTTCCGCATC TTCGGCGAAG
GACGGCGTCA CCTGGATCAA GCAGCAGTTC TGCACCCCCG GAAACACCCA GCCAGGCGCA
CTGACCGCGG CCACCAGCGT CGCCTCCCCC TTCTTCCAGA CAGGCAAGAC CGGGATCATC
CTCACCGGCC CCTACAACTT CTCCTCGTTC GACACCGCGC TTGGGAAAGA CGCCTACGAA
GTCATCGAAA GCCCCAAGGG CACCGAAGAC AACACCGTCC TCGCGGAAGG TGAAAACATC
TACGTCACGG CCAGCAACGG CAAACCGGAC CAGACCAAAA AGGTCATCGA CTTCCTGGTA
TCTGCCGACG GCCAGAAGGC AGGCATGACA GCCGGCAAGC AGCCGGTGGT CCGGGTCCCG
GTGAACTCCG GTGTCGACGC CGCCGCCGTC TACAACGATC CGCGTTGGGC CGTTGTCCAA
GACGCCCTCA AGAATTCCTC CAAGGCATTC CCCTCCGCCA TCAACTTCGT GCCCATCAAA
CAGGCTGCCG CCGAAGCCCT GAACAAGATC GTCTCAGACT GCGGAGCGGA CAACATTGCG
TCCGGACTCA AGGATCTTGA CGCGGCCATC GACAACGAGC TCGAAAGCCA GAACGCCAAG
TCATGA
 
Protein sequence
MDSKLMTRSR TAVALTVASA ALLTGCASGS ATPAAKDDGQ PIEVWARAGT DAATTYAAMF 
KEFTDKTGVQ VNFQGVPDLD QQLQTRAASK KLPDIVINDS AALGNYTSQG YLQKIDKSSV
TGNDAIADSL WNETTGLDGA TYGVPFSRQT MVTMIRKDWR EKLGLPIPTT QEELAKLATA
FATQDPDGNG QADTYGMTVP GSTERGYLGW WASSYLWQDG GSYLKDEGSG KFSASASSAK
DGVTWIKQQF CTPGNTQPGA LTAATSVASP FFQTGKTGII LTGPYNFSSF DTALGKDAYE
VIESPKGTED NTVLAEGENI YVTASNGKPD QTKKVIDFLV SADGQKAGMT AGKQPVVRVP
VNSGVDAAAV YNDPRWAVVQ DALKNSSKAF PSAINFVPIK QAAAEALNKI VSDCGADNIA
SGLKDLDAAI DNELESQNAK S