Gene Achl_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3534 
Symbol 
ID7295015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3915675 
End bp3917012 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content62% 
IMG OID643591940 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002489579 
Protein GI220914270 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.582574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGCC ACATCATGAA GAAAGCACCA AGCCCGGCTG GCCGGAGGTT TCTCTCCGTT 
GCGGCACTGG CATCCGTTAC CGCATTGGCG TTGAGCGCCT GTGGTGGGGG CGACACCAAC
AGCAACGCAC CCATCGCTGA AGAAACCGGC CCTGTTGAGA TCACCCTCGC AACCCCTGCC
TTTACAGGTG GTGGCGCAGG AAACCCTTAC CTGACATTGA TCGACGCCTT CAAGGCCAAG
AACCCAAACA TCACGGTCAA GCTGGTGGAG TCGCCCAATG ACCAGCACGG TCAGACCATG
CGCACCCAGC TCCAGGCAGG CAACGCGCCG GACATTTTCT ACGTCACGGC AGGGCGGGGA
AACAACCAGT CGTTCGCCTC GCTGGCCGAG GCAGGCTACC TGCAGGACCT GACGGACCAA
AAGTGGGCAG CCGACGCGAT CCCCGCTTCC GCCAAGAACC TGTACTACGA CGACGACAAG
GTCTTTGCCG TTCCGGCCGA CCTCGCACCG ATCACCATGC TTCAAAACAC GGGTGTCTTG
AAGGAGCTTG GGCTCCAGGA ACCCGCCACC CTGGATGAGC TGATCACCCA GTGCAAGACT
GCGCGGGCAG CCGGCAAGTC GTACTTCGCA GTGGCAGGAA CCTCCGGCGC CAACACGGGC
CTCCAGGCAA TGCAACTCGC GGCATCACTG GTTTACGCCA AGGACCCCCA GTGGGACGCC
AAACGCGCCA AGGAGGAAAC CACCTTCGCT GACTCGGACT GGAAGAAGGT GCTGGAGCAG
ATCGTGAAGT TCAAGGACGC CGGCTGCTAC CAGGACGGTG CCGCGGGTGC CGGCTTTGAC
CAGCTTTTCC CGTCAGTTGC CCAGGGCAAG GTGGCGGCAG CATTCGCCCC CGCCGGCGCT
GTTGCCGCAC TCCGGGCGCA GGTCAAGGAC GGATCCTTCG ATGTAGCTGT TCTGCCTGGC
GAAACGGCTG AAGACAGCCG GCTGATCGCA AGCCCGGGCA ACGCCATGGC CGTCAACGCT
GCCGGCAAGC ACAAGGGTTC CACCCTGAAG TTCCTGGAAT TCCTGGCCCA GCCCGCCAAT
CAGGATGCCT TGGCGAAAGC AAACGGAAAC GTATCTGTCA CCTCGGCACT TTCAGGTACC
GTGCCCGAGC AGTTCCCGCT GCTGGAACCG TACTTCTCTG AGCCCGAGAA AAAGATCGTC
ACCCAGCCCA ACTACCTCTG GCCCAACAGC GGCGTCTACG ACTCGCTCGG TACCGGCATC
CAGGGACTTC TGACCGGGCA GGCCACCCCT GACCAGGTTC TGAAGACCAT GGACGAGCAG
TACGACCGCG GCGCCTAG
 
Protein sequence
MKGHIMKKAP SPAGRRFLSV AALASVTALA LSACGGGDTN SNAPIAEETG PVEITLATPA 
FTGGGAGNPY LTLIDAFKAK NPNITVKLVE SPNDQHGQTM RTQLQAGNAP DIFYVTAGRG
NNQSFASLAE AGYLQDLTDQ KWAADAIPAS AKNLYYDDDK VFAVPADLAP ITMLQNTGVL
KELGLQEPAT LDELITQCKT ARAAGKSYFA VAGTSGANTG LQAMQLAASL VYAKDPQWDA
KRAKEETTFA DSDWKKVLEQ IVKFKDAGCY QDGAAGAGFD QLFPSVAQGK VAAAFAPAGA
VAALRAQVKD GSFDVAVLPG ETAEDSRLIA SPGNAMAVNA AGKHKGSTLK FLEFLAQPAN
QDALAKANGN VSVTSALSGT VPEQFPLLEP YFSEPEKKIV TQPNYLWPNS GVYDSLGTGI
QGLLTGQATP DQVLKTMDEQ YDRGA