Gene Achl_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1199 
Symbol 
ID7292644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1320192 
End bp1321490 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID643589604 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002487279 
Protein GI220911970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000829014 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGTC ACAGCACCCA CCGCCTCCGC AAATCCGTCG CAATGCTCAG CGCGGCATGC 
CTGCTTGGGC TGACAGCCGC GTGTTCCGCA CCGTCGGGCG GAGGAGGCGA CGGGCCGGTG
GAGATCCGGT TCGCCTGGTG GGGCAATGCG GGCCGCGCAG AGCTCACCAA CAAGGCCATC
GCCGAATTCG AAGCCGCGAA CCCTGACATC AAGGTGAAGC CCGAGTTCGG GGACATCGGC
GGTTACTTCG ACAAACTCGC CACCCAGGTG GCCGCGAACG ATGCACCGGA CGTCATCACC
ATGGGCGGTG CCTACCCGGC GGAGTATGCC AACCGTGGGG CGCTGCTGGA CCTCTCAACG
GTCCGGGGTC AACTCGACCT CAGCAAGATG GACCAGGGGG CGCTGGACAA CGGCCAGGTG
CAGGGCAAGC AGTACGGCAT TTCCACCGGT GCCAACGCCC TCGCCATCGT GGTGAACCCC
GCCGTTTTTG CGGCGGCCGG CGTTCCCCTC CCGGACGATG CCACCTGGAG CTGGGAGGAC
TTTGCTGAAA CCGCCGCGAG CGTGACGGCG AAGAGCCCCA AGGGCACCTC TGGGACGGCA
ACGGTCCTCA CCCACGACTC CCTGGACGCC TTCGCACGGC AGCGGGGGGA GTCCCTCTAC
ACCCAGGACG GCCAGCTTGG TCTCACCAAG GAGACAGTCC AGGACTACTT CGACTTCTCC
CTCAAACTCA GCGAGTCCGG CGCTGCGCCC AACGCCTCCG AGACAGTGGA AAAGCTCAGC
GTCAGCACCG AACAAACACT CATGGGCATG GGCCAGGCCG GCATGATGCT CACGTGGAGC
AACTCTTTGA CGGCGCTCAG CAAGGCCTCC GGAGCCGAAC TGAAACTCCT CAAGCTCCCC
GGCGAGAAGC CCACACCGGG CATCTGGCTC CAGTCATCGC AGTTCTACAC CATTTCCGCC
CGGAGCAAGC ACACCGAAGC CGCGGCCAAG CTGGTGAACT TCCTGGTCAA TAACCAGGCC
GCCGCCAAGA TCATCCAAAG CGACCGCGGC GTGCCCAGCA ACCCCGAAAT GCGCACGGCC
ATCCAGGACC TCCTGACGCC GCAAGGCAAG GTCGAAGCTG CCTACATCGG TGAGGTCGGC
AAGATGGACT TCGCGCCCAC CTACATCGGG CCCACGGGGT CGACGGCGGT CTCAGAGATC
ACGGCGAGGA TCAACACCGA CGTCCTGTTC AAGCGGTTGA CCCCCGAAAA GGCGGCCGAA
CAGTGGATCA GTGAAAGCAA GGCCGCTATC GGCAAGTAG
 
Protein sequence
MTRHSTHRLR KSVAMLSAAC LLGLTAACSA PSGGGGDGPV EIRFAWWGNA GRAELTNKAI 
AEFEAANPDI KVKPEFGDIG GYFDKLATQV AANDAPDVIT MGGAYPAEYA NRGALLDLST
VRGQLDLSKM DQGALDNGQV QGKQYGISTG ANALAIVVNP AVFAAAGVPL PDDATWSWED
FAETAASVTA KSPKGTSGTA TVLTHDSLDA FARQRGESLY TQDGQLGLTK ETVQDYFDFS
LKLSESGAAP NASETVEKLS VSTEQTLMGM GQAGMMLTWS NSLTALSKAS GAELKLLKLP
GEKPTPGIWL QSSQFYTISA RSKHTEAAAK LVNFLVNNQA AAKIIQSDRG VPSNPEMRTA
IQDLLTPQGK VEAAYIGEVG KMDFAPTYIG PTGSTAVSEI TARINTDVLF KRLTPEKAAE
QWISESKAAI GK