Gene Achl_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3798 
Symbol 
ID7295286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4238774 
End bp4240447 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content64% 
IMG OID643592208 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002489840 
Protein GI220914531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAC CCCGATTCTT CCGGGCTGCC CGGATCACGG CTGCGGGCCT GGCCGTGGGA 
GCGATGCTGC TGACCGGCTG CGCGGCCAAC ACCAACAAGG CAAACAGCAC CGACGCCGGC
TCCAACGCGA GTGCGCTGCT CACCATCCCG CGTGAGGACA TGGGCACGTT CGTGCAGAAC
TTCAACCCCT TCGCCCCCAC AGTGAACCCC ATGGTCCAGC AGTCCATCTA CGAATCGCTG
CTGATCTTCA ACCCGGCCAA CGGGGACACG GTGCCGTGGC TCGCCACCGA GTGGAAAGCC
GCGGACGACG GCAAGTCCGT CACGTTCACG CTGCGCGACG GCGTGAAGTG GTCCGACGGG
CAGCCGCTGG TGGCAGACGA CGTGGCCTAC ACGTTCGAGC TGCAGAAGAA GATCAAGGGC
GGGTTCGAAT ACCTCGACGG CGTCACCGCC GAAGGCAACA AGGTCACGTT CAACTTCAAC
AAGCCGTGGT CCCCGGCTCT TTACGACGTC GGCCAGCTCA CCATCCTGCC CAAGCACATC
TGGTCCACGC TCGCCGACCC CGAAAAGGAA GCGAACGCCA AGCCCGTGGG AACCGGTCCC
TACACCGAGG TGGACAGCTT CCAGGCGCAG TCCTTCGTGC TGAAGAAGAA CCCCAACTAC
TGGCAGCCGG AGAAGCAGAA GATCGCCGGC ATCAAGATGC TCGCCTTCGC CGGCAACGAC
GGCGCCAACC TCGCAGCGGC CAACGGGGAT GTGGACTGGG CACCGCAGTA CATCCCGAAC
ATTGAAAAGA CGTTCGTCTC CAAGGACAAG GACCACCGCC AGTACTGGTT CCCGCCCACG
GGCGCCATGA TCAACTGGCA GCTCAACACC ACCAAGGCGC CGTTCAATGA TGTTGACGTC
CGCAAGGCAC TGAGCATGGC CGTGGACCGG GACCAGGTCA CCAAGATCGG CATGAGCGGC
TACGCCCAGC CCGCCGACTG CACGGGCCTG TCCGGTAACT ACGAGACGTG GAAGAGCAGC
GAGGTTAAGG ACAACTGCAC CTGGACCAAC CACGACGTGC AGAAAGCCAA CGAGCTGCTG
GACAAGGCCG GCTACGCCAA GGGCGCGGAC GGCAAGCGCA CGCTCAAGGA CGGCAAGCCG
TTCGAGTTCA AGATCTCCGT GGGAGCCTCC TCCTCCGACT GGCTGTCCGT AGCCAACGTG
ATCGCCCAGA ACCTTGCAGA GGTGGGGGTG ACCGCCAAGG TGGATTCCCC TGACTGGGCC
GCCGTGGTGG CCGGCTACGA AACGGGCACC TTCGATTCCG GCATCGTCTG GAGCGCCAAC
GATCCCAGCC CGTACAAGTA CTTCAACACC GCCATGGGCT CGGCAACCGT GAAGCCGGTG
GGCACCAAGA CGTTCGACAA CTACCACCGC TTCGGCGACG CGAAGGCCGA CGCCCTGTTG
GCCCAGTTCG CCGCGGAATC CGACGAGTCC AAGCAGAAGG ACCTCGCCAA CAAGCTCCAG
GAAGAGTACA GCGACGCCGC GCCGCTGGTG CCGCTCTTTT CCGGCCCGGA GTGGGGCGCC
TTCAACGACA CCCGCTTCAC CGGCTGGCCC ACCCAGGACA ACCCCTACGC CACCCTCTCG
GTCCGCGCAC CCACCACGGT GCTGGTGCTG ACCTCGCTGG AACCGCGCAA GTAA
 
Protein sequence
MTQPRFFRAA RITAAGLAVG AMLLTGCAAN TNKANSTDAG SNASALLTIP REDMGTFVQN 
FNPFAPTVNP MVQQSIYESL LIFNPANGDT VPWLATEWKA ADDGKSVTFT LRDGVKWSDG
QPLVADDVAY TFELQKKIKG GFEYLDGVTA EGNKVTFNFN KPWSPALYDV GQLTILPKHI
WSTLADPEKE ANAKPVGTGP YTEVDSFQAQ SFVLKKNPNY WQPEKQKIAG IKMLAFAGND
GANLAAANGD VDWAPQYIPN IEKTFVSKDK DHRQYWFPPT GAMINWQLNT TKAPFNDVDV
RKALSMAVDR DQVTKIGMSG YAQPADCTGL SGNYETWKSS EVKDNCTWTN HDVQKANELL
DKAGYAKGAD GKRTLKDGKP FEFKISVGAS SSDWLSVANV IAQNLAEVGV TAKVDSPDWA
AVVAGYETGT FDSGIVWSAN DPSPYKYFNT AMGSATVKPV GTKTFDNYHR FGDAKADALL
AQFAAESDES KQKDLANKLQ EEYSDAAPLV PLFSGPEWGA FNDTRFTGWP TQDNPYATLS
VRAPTTVLVL TSLEPRK