Gene Arth_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1989 
Symbol 
ID4445468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2243691 
End bp2244869 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID639689798 
Productmajor facilitator transporter 
Protein accessionYP_831470 
Protein GI116670537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.143925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGGTG AACTGGGCCG CCGTTCAGCG CTGGCACTTC TCCTTCACTC GACTCTGATC 
CAGGCCGTGA CCTTCCTGGT CCGGCCGGCA ACCACTTACC GGGCGCTGGA ACTGGACGTG
CCGGGATATG CCCTGGGCCT CCTCGCGGCC AGCTACGCCG TTTTTCCCCT GCTCCTGGCA
GTCCCAACCG GCGCCCTCGT GGACCGGTTA GGTGAGAGGC GGCTCATGGT GACCGGGTCC
GCCGTCGTAC TTGGTTGTTC CCTTTTCCTG CTGTTTTGGG GGACCTCAGT CCCTGCGTTG
GTTGGCGGTA CGGCGCTGCT TGGCGCAGGA CAGCTGGCGT GTGTCGTGGG GCAGCAGGCT
GTCGTCGCAA ACAATGCCGT GGCGTCAGGG CTTGACTCCG CGTTCGGATA CCTGACCTTC
GCGGCGTCGT TGGGCCAGGC GCTGGGCCCG CTGGCAATCT CCGTGGTTGG TGGAGCCTCC
GTCCGTCCGG ACACGCAGGC AATCTTCTTC CTCTCGGCCG GCATGAGCCT GGTTCTCTTC
CTCACCACCT TCCTCATCTC GACGAAGGCC ACCGGAAGGA AGTCCGGTGC CGCAGCAACA
GACGGCCCTA AAGGAAGTGT TGCATCGCTG CTCAGGACGC CGGGGCTGGT CCGGGCACTG
GCTACCAGCG CCACGGTGCT GGCCGTGGTG GACCTGACCG TGGTCTACCT GCCGGCACTC
GGCACGGAGC GGGGGTTCAG TGCGGCCGCA GTGGGCCTCA TGCTCGCGGT GCGGGCCGGG
TTCTCGATGG TTTCGAGGCT GGGGCTCGGG CGCCTGTCCC GGAGGTTCGG CCGGGGGCGG
CTCATGGCTT CAAGCCTCGC GCTTTCCACC GTCGCACTGG CCGTCGCCGC GATTCCGATG
CCGCAGTGGC TCCTCTTCAT GGTGATGGCC GGGCTGGGCC TCGGGCTGGG CATCGGCCAG
CCGCTGACCA TGTCCTGGCT TTCGGCGCAG GCCCCGGACG GCCAGCGCGG AAAGGCCCTG
GCCCTGCGGC TTGCCGGGAA CAGGGTGGGC CAGGTGGTCC TGCCCAGCGC CATCGGGGTT
GTGGCGGCTG GGCTCGGGGC GGCCGGGGTG TTCCTGGCGT CAGCGGTCGT TGTGGGCGGA
ACGCTTCTGC TGGTCCGCGG CGTGCGCCTG GACGACTAA
 
Protein sequence
MIGELGRRSA LALLLHSTLI QAVTFLVRPA TTYRALELDV PGYALGLLAA SYAVFPLLLA 
VPTGALVDRL GERRLMVTGS AVVLGCSLFL LFWGTSVPAL VGGTALLGAG QLACVVGQQA
VVANNAVASG LDSAFGYLTF AASLGQALGP LAISVVGGAS VRPDTQAIFF LSAGMSLVLF
LTTFLISTKA TGRKSGAAAT DGPKGSVASL LRTPGLVRAL ATSATVLAVV DLTVVYLPAL
GTERGFSAAA VGLMLAVRAG FSMVSRLGLG RLSRRFGRGR LMASSLALST VALAVAAIPM
PQWLLFMVMA GLGLGLGIGQ PLTMSWLSAQ APDGQRGKAL ALRLAGNRVG QVVLPSAIGV
VAAGLGAAGV FLASAVVVGG TLLLVRGVRL DD