Gene Achl_2663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2663 
Symbol 
ID7294139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2990860 
End bp2992266 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content66% 
IMG OID643591073 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002488717 
Protein GI220913408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTGCG CCCGCAGTAC CCCACCGGCA CCACAGAAAG GCACAACCTT GGCACGAAAT 
ATCCTCACCT CCCCGGTGGG CCGGCGGCTC TTCCTCTCCC TGGCCGGCGC CGGGGCGGGC
GCGGCGGCCC TCACCGCGTG CGGCGGTCCC TCCACGTCGG CGGGCTCCGA ACAGACCACC
GCGGCGCTTG ACTTCGACGG CGTGAAGCCC GCCGCATCAT TCGACTTCTG GTCCAACCAC
CCCGGCAAGT CGCAGGACGT GGAGAAGAGC ATCATCGCGA AGTTTGAGGC CAAAAACCCT
GGCATCAAGG TCAACCTGGT CACGGCCGGT GCGAACTATG AGGAGATTGC CCAGAAGTTC
CAGACCGCAC AGGCCGCCAA ATCGGGCCTT CCCGCGCTGG TGGTCCTCTC CGACGTGTGG
TGGTTCCGCT ACTACCTGAA CGAAAGCATC ATTCCGCTGG ACGCCCTCAT CAAGCAGCTG
GACGTCAAGC TGGACGATTT CCGTACGTCG CTGGTGGACG ACTACAAATA CGACGGCCAG
CAGTGGGCGC TCCCCTATGG CCGTTCCACC CCGCTGTTCT ACTACAACAA AGACCACTTC
GCGGCCGCCG GCCTTCCGGA CCGCGCGCCC GCCACCTGGC AGGAATTCGC CGAGTGGGCG
CCGAAGCTCA AGGCAGCCAC CGGCGCCCAG TACGCCTTCA TGCACCCGGC CCTGGCCGGC
TACGCTGGCT GGACCCTGCA GAACAACCTG TGGGGCGAGG GCGGCGGCTG GTCCAAGGAC
TGGGACATCA CGTGCGACTC GCCCGAGTCG GTAGCCGCGC TCCAGGCGGT GCAGGACTCG
GTCTACAAGG ACAGCTGGGC CGGGGTGTCC TCGAAGGAGT CTGCTGACGA CTTCGCTGCG
GGCCTCGCAT CGGCCACTCT GTCCTCCACG GGCTCGCTCA TCGGCATTCT GAAATCTGCC
TCTTTCAACG TGGGCGTCGG ATTCCTGCCG GGCGGCTCCA AAGCCAAGAC AGGCGTGTGC
CCCACCGGCG GGGCGGGCCT GGGCATCCCC AGCGGCGTGA CCCGCGAAGA ACAGCTCGCA
GCCGCAATGT TCCTCCAGTT CGTCACCGAA CCGGAGAACA CCGCTGAGTT CTCCGCTGCC
ACCGGCTACA TGCCCACGCG CACGTCAGCG GACATGACCG CGGTGCTTGC CAAGACACCA
CAGATCAAGA CTGCCATGGA CCAGCTGGCC GTCACCCGCG TCCAGGACAA CGCCCGCGCG
TTCCTGCCCG GCGCCGACCA GGAAATGGCC AAGGCCGCCG CGAAGATCCT CACCCAGCAG
GCCGATGTCA AAGCCACCAT GACGGAGCTG AAGGCCACCC TTGAGGGCCT GTACACCAAG
GATGTGAAGC CCAAGCTGAA GGCATAG
 
Protein sequence
MGCARSTPPA PQKGTTLARN ILTSPVGRRL FLSLAGAGAG AAALTACGGP STSAGSEQTT 
AALDFDGVKP AASFDFWSNH PGKSQDVEKS IIAKFEAKNP GIKVNLVTAG ANYEEIAQKF
QTAQAAKSGL PALVVLSDVW WFRYYLNESI IPLDALIKQL DVKLDDFRTS LVDDYKYDGQ
QWALPYGRST PLFYYNKDHF AAAGLPDRAP ATWQEFAEWA PKLKAATGAQ YAFMHPALAG
YAGWTLQNNL WGEGGGWSKD WDITCDSPES VAALQAVQDS VYKDSWAGVS SKESADDFAA
GLASATLSST GSLIGILKSA SFNVGVGFLP GGSKAKTGVC PTGGAGLGIP SGVTREEQLA
AAMFLQFVTE PENTAEFSAA TGYMPTRTSA DMTAVLAKTP QIKTAMDQLA VTRVQDNARA
FLPGADQEMA KAAAKILTQQ ADVKATMTEL KATLEGLYTK DVKPKLKA