Gene Achl_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1103 
Symbol 
ID7292548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1215643 
End bp1217217 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID643589509 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002487184 
Protein GI220911875 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC TTCCTCAAGG ACGTGAGATT TCCCGCCGCC GCCTGCTGCA GTTCGGCACG 
GCCGCAGGGT TTCTGCTGGG CACCGGCAGC CTGGCCGGCT GCGCCGGTCC CACCGGCCTC
CCCGGACCCA GCACCCTGAC CCTGGCCCTG AACCGCTCCC TGGTCAGCCT GGACAACAAG
CTCAACCAGT TCGATGCCGC CGTCACCGTA CAGCGCTCCG TCCGCCAGGG CCTCACCGCC
ATCGGCCCCG AAACCAAGCC CGTCCTGGTG CTGGCCGAAC GCTTCGAAAT GACCGGCCCC
ACCGAATGGA CCGTCACCCT CCGCGAAGGC ATCCGCTACT CGGACGGCAG CCCCGTGCAG
ATCGAGGATG TGGCCACCGC CCTGAAGATG TACAAGCAGG TGCAGGGCTC CTTCGTAGCA
GGCTTCTTCC CCGAATTCCC CGAGGTTGTC CCCGTGGACA ACCGCACGTT CAAGATGGTG
TCCAAGAACC CCGTCCCCAT CCTGGACAGC CTCATGAGCA TGATCCTGAT TACCCCGGCC
GCACAGAACA AGCCGGAGGA ACTCCAGGAA GGCGTGGGCA CCGGCCCCTA CAAGGTCACC
AAGTTCAACC GCGGCGCCGG CACCTACAGC CTGGCACGCA ACGAGAACTA CTGGGGCCCG
GCGCCGGAGA TCGAGAACGT GGAAGTCCGG TTCCTCCCCG AGGAATCCAG CCGCGTCATC
GCCCTGCGCA GCGGCGAGGT GGACATCATC GACTCCATCA CCCCGGACTC CCGCGAACAG
CTGGCCGGCC TCCCCGGCGT CCAGCTGGCC GAGGCGTCCA GCCTGCGGCT CAACCAGATC
TTCTACAACT TCCGCAAGCC CGCCGGCCAC CCCCTGGCCG ATGTCCGCGT CCGTGAAGCC
CTCAGCTGGG CCATCGACGG CGAATCCCTG GTCAAGGACG TGCTGGTGGA CTCCGTCAGC
GCCGCCGAGG GCGTCACGCC CGGCAGCCTC ACCGGCTACC ACAAGACCGG CACCTACACC
TACGATCCGG AGAAGGCCAA GGCCCGGCTC GCCGAGCTCG GCGTCAAGGA CCTCACCCTG
AAGATCATCT GGGAAACCGG AGAATTTGCC TCCGACACCT CCGTGATGGA AGCCCTGGTG
GAAATGTTCG GCAAGATCGG CGTCAAAACC GAACTCCAGC AGTTCGAACC CGGCGGCAAC
ATCCTGGCCT GGCGCCAGGG CAAGCAGGGC GACTGGGACC TGCTGGGCAA CGGCTTCTCC
AGCCCCACCG GCCTGGCCAT CACCATGATG CAGGGCATGT ACGCCGGCAC CCCGGAGAAG
GAAAAGACCC GCGACACCTA CCAGGGCTAC GTCATCCCCG AGGTGCAGGC CAAGATCCAG
GCCGCCTCCT CCGAGGTGGA CGCCACCCGC CGGCAGGAAC TGCTGGCCGA CGCGCAGCAG
GCCATCTGGG ACACCTGGCC CTGCGCCTGG GCGTTCGTGC CCAAGTCCGT CTTGGCCCAC
CGGAACCGGG TCTCCGGCAT CAACCTGGCA CCCACCAACT CCTACCCGCT CGTCGACGCA
CGGCTGGAGG CCTAA
 
Protein sequence
MTVLPQGREI SRRRLLQFGT AAGFLLGTGS LAGCAGPTGL PGPSTLTLAL NRSLVSLDNK 
LNQFDAAVTV QRSVRQGLTA IGPETKPVLV LAERFEMTGP TEWTVTLREG IRYSDGSPVQ
IEDVATALKM YKQVQGSFVA GFFPEFPEVV PVDNRTFKMV SKNPVPILDS LMSMILITPA
AQNKPEELQE GVGTGPYKVT KFNRGAGTYS LARNENYWGP APEIENVEVR FLPEESSRVI
ALRSGEVDII DSITPDSREQ LAGLPGVQLA EASSLRLNQI FYNFRKPAGH PLADVRVREA
LSWAIDGESL VKDVLVDSVS AAEGVTPGSL TGYHKTGTYT YDPEKAKARL AELGVKDLTL
KIIWETGEFA SDTSVMEALV EMFGKIGVKT ELQQFEPGGN ILAWRQGKQG DWDLLGNGFS
SPTGLAITMM QGMYAGTPEK EKTRDTYQGY VIPEVQAKIQ AASSEVDATR RQELLADAQQ
AIWDTWPCAW AFVPKSVLAH RNRVSGINLA PTNSYPLVDA RLEA