Gene Achl_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3708 
Symbol 
ID7295190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4124756 
End bp4125820 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content67% 
IMG OID643592114 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002489752 
Protein GI220914443 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCT TCGCATCCCA CATGCCCGGC GCCCGGCGTG CGCGCACTTT GGCGGCCCTG 
CCCGCCGTCG TCCTCCTCGG AACCGCAGCG CTCGCAGGCT GCGCCGATCC CGGCGCCTCC
GCGTCCGGCG ATGCGTCCGG AGCGGCGCAG ACGACGGCGG CACGCAACGG CGTGGTCTAC
AACACTTCGC CGGACCAGCA GCGGATCCGC GGCGAGAAGG ACGCGGCCCT CGCCGCCAAG
GTCCCCGAGC TGATCGGCAA GGACGGCAAG CTGACGGTGG CCACCACGGC CGGGTCCATC
CCCCTGTCCT TCCATGCCAC GGATGACAAG ACCCCGATCG GCTCCGAGCT GGACATCGCC
CAGCTGGTGG CGGACAAGCT GGGCCTGGAA CTCGATGTGC AGGTCACATC CTGGGAGAAC
TGGCCGCTGA AGACCCAGTC CGGCGACTTC GAAGTGGTCT TCTCCAACGT GGGCGTCAAC
AAGGACCGTG TCAAGCTGTT CGACTTCGCC AGCTACCGCG CAGCGTTCAT GGGCTTCGAA
GCCAAGAATT CAGCCACCTA CGACATCAAG GGCGCGGACG ACATCTCCGG ACTGCGCGTC
TCCGTTGGCT CCGGCACCAA CCAGGAAAAG ATCCTGCTGG CCTGGAACAA GGAGCTCGAG
GAAAAGGGCA AGGCGCCGGC CGCCCTGCAG TACTACCAGT CCGAGGCGGA CACCATCCTG
GCCCTGTCCT CCGGCAGGAC AGACCTCAAC ATCGCCCCCT ACCCGTCCAC CGTGTACCGG
GAAAACACCC GCGATGACCT GAAAGTCGTG GGGAAGGTGA ACGCCGGCTG GCCGTCCGAG
ACGCTGGTGG CTGCCACCTC GCTCAAGGGC AACGGGCTGG CGCCGGTGAT CACCGAGGCC
TTGAACTCCG CCATCAAGGA CGGCTCCTAC GGCAAGGTAC TGGAACGCTG GGGCCTGTCC
GAGGAGGCGC TGCCGGAAGC CAAGACCATC ACCGAGGAGA ACTACGCGGC CACCCAGGCC
ACGGCCACCG CCACCGCATC CGCCGAAGCC TCGAAGAAGT CCTGA
 
Protein sequence
MAFFASHMPG ARRARTLAAL PAVVLLGTAA LAGCADPGAS ASGDASGAAQ TTAARNGVVY 
NTSPDQQRIR GEKDAALAAK VPELIGKDGK LTVATTAGSI PLSFHATDDK TPIGSELDIA
QLVADKLGLE LDVQVTSWEN WPLKTQSGDF EVVFSNVGVN KDRVKLFDFA SYRAAFMGFE
AKNSATYDIK GADDISGLRV SVGSGTNQEK ILLAWNKELE EKGKAPAALQ YYQSEADTIL
ALSSGRTDLN IAPYPSTVYR ENTRDDLKVV GKVNAGWPSE TLVAATSLKG NGLAPVITEA
LNSAIKDGSY GKVLERWGLS EEALPEAKTI TEENYAATQA TATATASAEA SKKS