Gene Arth_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3164 
Symbol 
ID4444224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3552774 
End bp3553991 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID639690990 
Productmajor facilitator transporter 
Protein accessionYP_832642 
Protein GI116671709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.761338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCCG CACAAGCCAC CAAACCCTTC AGCCTCCGCA GCATTGCCGT CCCCGCGTTC 
GGACCGGCGC TGCTGTTCTG CATCGGCGAA GGGGCGGTGC TTCCGGTAGT GGCGCTTTCC
GCGCGCGACC TCGGCGCGTC CGTGGCGGTG GCCGCGCTGA TCGTCACTCT GATCGGCCTG
GGCTCATGGT TCTTCAACCT GCCGGCCTCC CTGATCACCC TCAAATTCGG CGAACGCTGG
TCCATCGTCG GGGCCGCCGC GGCCGGTGCC CTGGCGCTGG CGGCGGCAGC GCTGTCCTCG
GTGATTCCCG ACGGACTGTG GCTGCTCGCG GCGGCGATGG TCGTCGTCGG GATGGCCGCC
AGCGTCTTCA GCCTGGCCCG GCAGAAATAC CTGACCGAGG CGGTGCCCGT GGCCTTCCGC
GCCCGCGCCC TGTCCACGCT GGGCGGCGTG AGCAGGATCG GCATCTTCAT CGGCCCGTTC
GTGGGCGCCG GCGTCATGCA GTTTGCCGGG ATCAGCGGCG CGTACTGGGT GGGCGTTGCG
GCCATGGCAG CGGCCGCCAT CCTGTCCGTC ACCATCCCGG ACCTGCCGCC CGCGCCGGGA
TCCGCCGACG GGAACCGCGG ACCGGAGCCC ACCATGCGGG GCATTGCCGT GTCCCACGCC
GGCGTGTTCC TTACCGTGGG CGCCGGGATC CTGCTGCTCA GCGCCCTGCG CGCCTCCCGC
CAGGTGGTCA TCCCGCTGTG GGCGGACAAC CTGGGCATGG ACGCCACGCA CGCCTCGCTG
ATCTACGGAC TCTCCGGGGC AATCGACATG CTGGTGTTCT ACCCGGCCGG CAAGCTCATG
GACCGCAAGG GCCGGCAATG GGTGGCCATC CCGTCCACGG TAATCATGGG CACCGCCCTG
ATGCTCATCC CGATCACGGG CACCTTCGTG GGCCTGCTGC TGGCGGCGCT GCTGATCGGG
TTCGGCAACG GCATCAGCTC CGGCCTGATC ATGACCCTCG GCGCGGACTT CTCCCCGGAC
CGCGGCCGCG GCCAGTTCCT GGGACTCTGG CGGTTCATTG CCGACGCCGG CGCCACGGGC
GGCCCGGTGC TCCTCTCCGG CGTCACCGCC GCCGTCTCAC TGGGGGCCGG CGTGTGGGCC
ACCGGCGTGC TGGGGTTCGC CGCCGCCGTC GTCTTCGCCA TCACGATTCC GCGGCTCAAA
CACCGCCGGA ACTACTAG
 
Protein sequence
MTSAQATKPF SLRSIAVPAF GPALLFCIGE GAVLPVVALS ARDLGASVAV AALIVTLIGL 
GSWFFNLPAS LITLKFGERW SIVGAAAAGA LALAAAALSS VIPDGLWLLA AAMVVVGMAA
SVFSLARQKY LTEAVPVAFR ARALSTLGGV SRIGIFIGPF VGAGVMQFAG ISGAYWVGVA
AMAAAAILSV TIPDLPPAPG SADGNRGPEP TMRGIAVSHA GVFLTVGAGI LLLSALRASR
QVVIPLWADN LGMDATHASL IYGLSGAIDM LVFYPAGKLM DRKGRQWVAI PSTVIMGTAL
MLIPITGTFV GLLLAALLIG FGNGISSGLI MTLGADFSPD RGRGQFLGLW RFIADAGATG
GPVLLSGVTA AVSLGAGVWA TGVLGFAAAV VFAITIPRLK HRRNY