Gene Achl_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3304 
Symbol 
ID7294785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3662296 
End bp3663666 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content64% 
IMG OID643591714 
ProductGeneral substrate transporter 
Protein accessionYP_002489353 
Protein GI220914044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACG AAACAACACC GCCCCAGGCG GGAGTCATCG CCAAGGATGC CGATGGTTCC 
GCCCTGGCAG GCGGCGGCGT TAGTCCCCTA ACGCCCAGCA AGGACGTACG ACGCCGGGTG
GTTACCGCAA GCTTTATCGG CAACTTCGTC GAATGGTTCG ATTACGCCGT CTACGGCTAC
CTCGCCGCCG TCATCTCATC GGTCTTCTTC CCGGAAGCGG AACGGCAGAC AGCGCTGCTG
GCCACCTTTG GCGTCTTCGC AGTCTCGTTC TTCGTCCGGC CGCTGGGCGG ATTTGTCTGG
GGCCACATCG GCGACAAGCT CGGCCGGCGG AAGGCCCTGT CCTTGTCCAT CGTCATCATG
TCCGTCTCAA CGTTCTGCAT CGCGCTGATT CCCGGCTACG CATCCATCGG GCTGATGGCT
CCGGTCCTGC TCCTCCTCGT CCGCATCGTC CAGGGCTTCT CAGCAGCCGG CGAGTATGCG
GGCGCCTCGG CCTTCCTGGT GGAGTACGCC CCGGCGAACC GGCGCGGCCT CTATGCAGCA
GTGGTTCCGG CCAGCACCGC AGCCGGCCTG CTCCTGGGCT CCCTTATCGC AGCGCTCCTG
AGCTCGGTAC TCACCGCGGA CCAGCTGCAC GAGTGGGGAT GGCGGCTGCC GTTCCTGCTG
GCTGCCCCCA TGGGCCTCAT CGGACGCTAC ATCCGCACCA AACTCGAGGA CACCCCGGCC
TTCCGGGAAT TGGCTGCGAA GGAAGGCACC GAAGAGAAGG CCCCCGCGCT GGCCATGTTC
AAGACCTACC GGAAGCAGCT CGTCATCGCC TGCGGCGCGG TGATGCTCAA CGCCGTTGGC
TTCTACGTCA TCCTCAGCTA CATGCCCACC TACCTTTCCG AGGAACTGGG CTTCGGCCCC
ACCGAGTCCT TCCTGGCCAC CACCATTGCC CTGGCCAGCT ACATCGGGTT CATCTTCCTT
ACCGGCATGG CCTCGGACGT CTTTGGCCGC AAGCGGATGC TCATCACGGC ATCCATCCTT
TTCATGGTCC TTACCGTTCC GGCGTTCATG CTGCTGGAAA CCGGTGATTT CCTGGTCATC
GTCCTGGTCC AGATCCTCCT GGGCGGCATG CTCACACTGA ACGACGGAAC ACTGCCGAGC
TTCTTGGCCG AGCTGTTCCC CACCAAGGTC CGCTACAGCG GGTTCGCCGT CAGCTTCAAC
CTCTCCAACG CCCTCTTCGG CGGGACCGCG CCGTTCATGG CCACCCTGCT GATCGCCATG
ACCCAGAGCA AGATCGCCCC GGGCTGGTAC CTGGTGGCGG CTTCAGCGGT GTCCCTGGCG
GCAGTCCTGT TCGCCACTGA GACGTCGCGA AAGCCCCTGA AGCACCTCTA A
 
Protein sequence
MSHETTPPQA GVIAKDADGS ALAGGGVSPL TPSKDVRRRV VTASFIGNFV EWFDYAVYGY 
LAAVISSVFF PEAERQTALL ATFGVFAVSF FVRPLGGFVW GHIGDKLGRR KALSLSIVIM
SVSTFCIALI PGYASIGLMA PVLLLLVRIV QGFSAAGEYA GASAFLVEYA PANRRGLYAA
VVPASTAAGL LLGSLIAALL SSVLTADQLH EWGWRLPFLL AAPMGLIGRY IRTKLEDTPA
FRELAAKEGT EEKAPALAMF KTYRKQLVIA CGAVMLNAVG FYVILSYMPT YLSEELGFGP
TESFLATTIA LASYIGFIFL TGMASDVFGR KRMLITASIL FMVLTVPAFM LLETGDFLVI
VLVQILLGGM LTLNDGTLPS FLAELFPTKV RYSGFAVSFN LSNALFGGTA PFMATLLIAM
TQSKIAPGWY LVAASAVSLA AVLFATETSR KPLKHL