Gene Achl_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0623 
Symbol 
ID7292053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp669799 
End bp671085 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content62% 
IMG OID643589021 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002486710 
Protein GI220911401 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.58509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC CTAGATTCCT TTTGCCGGTT GCCACTGCCG GCGTTCTGGC CCTTTCCCTG 
TCCGCCTGTG CCGGCGGAGG AGGTGGCGGA ACCTCGGGCG GCGGCAGCGA CGCTGAAGCC
AATCTCGACA GCCGTGGCCC CATCACCTAC GTGCAGGGCA AGGACAACAG CAACGTTGTC
CGTCCGCTGA TCGAAAAATG GAACGCTGCG CACCCCGACG AAAAGGTCAC TTTCAAGGAG
CAGACGGACA ACGCCGACCA GCAGCACGAT GACCTGGTCC AGAACTTCCA GGCAAAGAAC
GCGGACTATG ACGTAGCCAG CGTGGACGTC GTCTGGACGG CCGAGTTCGC CGCCAAGGGC
TGGCTCCAGC CGCTCAAGGA CAAGATGGCC ATCGACACCA AGGGCATGCT GGAGCCCACC
ATCGAGGCCG GCTCCTACAA GGGCACCCTC TATGCGGCTC CCGTTTCCTC CGACGGCGGC
ATCCTGTACT ACCGCAAGGA TCTGGTGCCC ACACCGCCCA AGACCTGGGA CGAGATGATG
GGCATGTGCT CCATCGCCAA GCAGAACAAC ATGGGCTGCT ACGCCGGCCA GTTCAAGCAG
TATGAGGGCC TCACCGTCAA CGCCTCGGAA GCAATCAACT CCGCCGGCGG ATCCGTCCTC
GACAAGGACG GCAAGCCGAG CCTGAACACC CCCGAGGCCG AAGCAGGCCT GGACAACCTG
GTGAAGGCTT TCAAGGACGG CAACATCCCG GCTGAAGCCA TCACCTACCA GGAAGAGGAA
AGCCGCCGTG CGTTCCAGGA CGGCAAGCTC CTGTTCCTCC GCAACTGGCC TTACGTCTAC
AACCTGGCAA CCACTGAAGG TTCCTCCAAG GTCAAGGACG TTCTGGGCAT GGCGGCACTT
CCGGGCAAGG ACGGCCCCGG TGCTTCTTCC CTCGGTGGCC ACAGCGCAGC CGTCAGCGTC
TACTCCGACC ACAAGGCCAC GTCCCTGGAC TTCGTGAAGT TCCTGGTTGA AGAAGAGCAG
CAGAAGTTCT TCGCAACCCA GGGTTCGCTT GCCCCGGTCC TCGGTGACCT GTACGAGGAC
CAGGAACTGG TTGCAAAGCT GCCTTACCTG CCGGTCCTCA AGACCTCCAT CGAAAATGCT
GTTCCCCGGC CGGTAACCCC CTTCTACCCT GCAGTCACCA AGGCCATCCA GGACAACGCC
TACGCGGCGC TGAAGGGTGA AAAGCCTGCC AAGGATGCGC TCTCCGACAT GCAGAAGTCC
ATCGAGACCG CCGGCGCAGG ATCGTAA
 
Protein sequence
MKTPRFLLPV ATAGVLALSL SACAGGGGGG TSGGGSDAEA NLDSRGPITY VQGKDNSNVV 
RPLIEKWNAA HPDEKVTFKE QTDNADQQHD DLVQNFQAKN ADYDVASVDV VWTAEFAAKG
WLQPLKDKMA IDTKGMLEPT IEAGSYKGTL YAAPVSSDGG ILYYRKDLVP TPPKTWDEMM
GMCSIAKQNN MGCYAGQFKQ YEGLTVNASE AINSAGGSVL DKDGKPSLNT PEAEAGLDNL
VKAFKDGNIP AEAITYQEEE SRRAFQDGKL LFLRNWPYVY NLATTEGSSK VKDVLGMAAL
PGKDGPGASS LGGHSAAVSV YSDHKATSLD FVKFLVEEEQ QKFFATQGSL APVLGDLYED
QELVAKLPYL PVLKTSIENA VPRPVTPFYP AVTKAIQDNA YAALKGEKPA KDALSDMQKS
IETAGAGS