Gene Achl_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1030 
Symbol 
ID7292472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1129819 
End bp1131078 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID643589435 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002487113 
Protein GI220911804 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA TGTTTCGCGC CCTCGAGAAC CGCAACTACC GGATCTGGGC CGGCGGTGCC 
CTGGTCTCCA ACGTGGGCAC CTGGATGCAG CGCATCGCCC AGGACTGGCT GGTACTCACC
GTCCTGACCA ACCATGACGG CGCCGCCGTC GGCATCACCA CCGGCCTGCA ATTCCTGCCG
ATGCTGCTGC TGGGCCCTTA CGGCGGAGTC CTGGCAGACC GCTACCGCAA ACGCGTCATC
CTGCTGTGGA CCCAGCTGGC CATGGGCTTC ACCGGCCTGG CCATCGGCCT GCTGGTGGTC
ACCGGCACCG CCCAGCTGTG GCATGCCTAC GTCGCCGCCT TGTGCCTGGG CATTGCCAGC
GCCATTGACG CCCCGGCGCG GCAGTCCTTT GTCTCGGAAC TGGTGGGCCA GGACAACATC
TCCAATGCCG TGGCCCTGAA CTCGGCATCC TTCAACACTG CCCGCCTCAC GGGCCCGGCC
GTCGCCGGCG TCCTGATCGC CTGGGTGGGC ACCGGGCCGG TGTTCCTGCT CAACGCCGCC
AGCTACGCCG CAGTGATCTG GTCCCTGTTC CTGATCCGCA CCTCCGAGCT TGTGCCCACC
GTGCGGGCAG AGCGCGGCAA ACACCAGGTG ACGGAGGGCA TGCGGTACGT GAAGCAGCGG
CCGGACCTCG TCCTGATCAT GGTCCTGGTG GGCATCCTCG GAGCCTTCGG CATGAATTTC
CCCATCACGA ACGCCCTCAT GGCCACCACC GAATTCCACG CCGGGCCGGG CGAGTTCGGC
CTGCTGGGCT CCATCATGGC CGTTGGCACC CTGGCCGGCG CACTGCTGGC CGCCCGGCGC
GCGCGGCCGC GACTGCGGTT CCTGCTGGGC GGCGCCCTTG GCTTGGGGAT CTTCACGCTG
GTAGCCAGCG TGGCGCCGTC GTTCTGGCTG TATACCGCAG TGCTGATTCC GGTGGGCCTG
GCATCCATCA CGTTCCTGAA CAGCTGCAAC ACCAGCATCC AGCTGTCCGT GGAGCCGCAG
TTCCGCGGCC GGGTACTTGC CCTCTACCTG GCCATCCTGC AGGGCGGCAC AGCCGTGGGA
TCGCCGCTGA TCGGGTGGGT GGGCAGCGAA TTCGGCGCCC GCTGGTCCGT GGCGGTGGGT
GGCCTGGTGG TCCTGCTGAC CGGGCTGGCC GCCGTGATTG TGGTCAGCCG CCGGAGCAAG
CTCAGCCTCC GCAGGGCGGT CCGTATCGCG TTCAGCGGCA AGCGCGAAGC CGCCGCCTGA
 
Protein sequence
MSQMFRALEN RNYRIWAGGA LVSNVGTWMQ RIAQDWLVLT VLTNHDGAAV GITTGLQFLP 
MLLLGPYGGV LADRYRKRVI LLWTQLAMGF TGLAIGLLVV TGTAQLWHAY VAALCLGIAS
AIDAPARQSF VSELVGQDNI SNAVALNSAS FNTARLTGPA VAGVLIAWVG TGPVFLLNAA
SYAAVIWSLF LIRTSELVPT VRAERGKHQV TEGMRYVKQR PDLVLIMVLV GILGAFGMNF
PITNALMATT EFHAGPGEFG LLGSIMAVGT LAGALLAARR ARPRLRFLLG GALGLGIFTL
VASVAPSFWL YTAVLIPVGL ASITFLNSCN TSIQLSVEPQ FRGRVLALYL AILQGGTAVG
SPLIGWVGSE FGARWSVAVG GLVVLLTGLA AVIVVSRRSK LSLRRAVRIA FSGKREAAA