Gene Arth_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1788 
Symbol 
ID4445687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2003000 
End bp2004301 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content59% 
IMG OID639689606 
Productmajor facilitator transporter 
Protein accessionYP_831278 
Protein GI116670345 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.158886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG TTGAGCAGAA CCCAGCCCAA CCCCAGACAG CCCCCGGACG CATCCGGCTT 
CAACGTAAAT CCTTGCTCGC CACGGGCGTG GGTAACCTGC TCGAATGGTT CGACTGGACC
ATTTACACGG TTGCGTCCGT GTACCTAGCG GGCAGTCTCT TTAACTCCGG CAACCCGATG
TCGTCACTGC TCAGCACACT GGCTGTCTTT GCCGTCGGCT TTCTAATGCG GCCCATCGGT
GGACTCGTTT TTGGTCCCCT GGCGGACAAA TGGGGGCGCC GTAAAGTGCT GCTCACCACC
ATGTTCCTCA TGGCCGGTGC CAGTTTGGGA ATTGCCCTGA TTCCGTCCTA CGCGTCGATC
GGCAGTTGGG CCTCCTTCCT GCTGCTAGTG GCACGGCTGG TCCAGGGCTT TGCCCACGGC
GGAGAGGCAA CGACGTCGTA CGCATATATC GCGGAGATTG CCCCGCCCAA GCGACGCGGC
CTGTGGTCCA GCACAGTCTT CATAGCCGTA GGCTCCGGTT CCCTACTCGC CACCTTCTTC
ATGGCACTCC TTACTGGCGT CCTCAGCAAG ACTGAAATGA TGGAGTGGGG ATGGCGGTTA
CCCTTCGCCG CTGGTGCCTT GCTCGCTGTG GCTGCATTGT GGTTGCGCCG GGGCATGATG
GAAAGCGAGC ACGTGGCCAC TGCCCCCGGC GGCAGCGCGG TGACGCCATG GAGTCCCCGT
CAAGTCTTCC AGGCCGGGGT GAAGCTGTTC CTGTACGAGG CAGGCTCCAC TTTGACCTAT
TACACCTGGG TGACCTCGGC GGCGATCTAT GCCATTGGCG TCAAGGGGAT GGATCCGGGT
CAGGCTTTCT TCATGAGCGT GATCGCACAA GTGGTGTACA TTGCGTTCCT GCCGGTTTCG
GGATGGATCT CGGACCACTG GGGCCGCAAG GCAACGACCC TGATCTCCCT GGTAGGTATT
GCAGCCACCG TTTTCCCCCT ATGGGGTTTG ATGTCGAGTG AGCCCTGGAC GTTGCTGGTG
GCTCAGACCG TCGGGCTGTT GCTGGTTGCG TTCATCACAG GGTCTAAACC AGCCGCCATC
TCCGAGCAGA TCCCGACACG ATACCGCACC CGCATCTTCG GAGTCTCAAT CTCACTGGGC
GTTGCAGTCT GCGGCGGAAC GGCGTCCTAC CTGAGTACAT GGTTGTACTC CATCGGATCC
GGTTGGATAT TCAACGTCTA CGTCATCGCT GTCGCAGCAG TATCCAGTGC TGTCGTTCTT
ACTTGGAAAA ACAACAAAGG CGTCCCATTG GATCAGATTT AG
 
Protein sequence
MTIVEQNPAQ PQTAPGRIRL QRKSLLATGV GNLLEWFDWT IYTVASVYLA GSLFNSGNPM 
SSLLSTLAVF AVGFLMRPIG GLVFGPLADK WGRRKVLLTT MFLMAGASLG IALIPSYASI
GSWASFLLLV ARLVQGFAHG GEATTSYAYI AEIAPPKRRG LWSSTVFIAV GSGSLLATFF
MALLTGVLSK TEMMEWGWRL PFAAGALLAV AALWLRRGMM ESEHVATAPG GSAVTPWSPR
QVFQAGVKLF LYEAGSTLTY YTWVTSAAIY AIGVKGMDPG QAFFMSVIAQ VVYIAFLPVS
GWISDHWGRK ATTLISLVGI AATVFPLWGL MSSEPWTLLV AQTVGLLLVA FITGSKPAAI
SEQIPTRYRT RIFGVSISLG VAVCGGTASY LSTWLYSIGS GWIFNVYVIA VAAVSSAVVL
TWKNNKGVPL DQI