Gene Arth_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2289 
Symbol 
ID4445332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2574632 
End bp2576008 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content65% 
IMG OID639690098 
Productmajor facilitator transporter 
Protein accessionYP_831769 
Protein GI116670836 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.21518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCTG CACCCGCAAC CCGAATCCCG CCACATGACG CCCAGGGCAC GACACCGCGC 
AAAACCCCGG GCAAGGCAGC CTTGGCGTCA TTCCTCGGCA GCACTCTGGA GTACTACGAC
TTCTTCATCT ACGGAACCGC CGCCGCCCTC GTGTTTCCGC ACCTTTTCTT CCCTTCCGCC
GACCCCGCCA TCGGCCTGAT CGGCGCCTTC GCCACCTTCG GCGTCGCCTA CGTAGCCCGG
CCGGTGGGCG GGCTGGTGAT GGGCCACTTC GGCGACAAGC TGGGCCGGAA GAAGATCCTG
CTGCTGACCT TGGGCATCAT GGGCCTGGCA TCGCTGGGCA TCGGATTCCT GCCCACCTAC
GAACAGGTCG GCGTCTGGGC ACCGGTCCTC CTGGTGGCGG GGCGGCTGGC ACAGGGCTTC
TCCGCCGGTG CGGAGTCGGC CGGCGCTTCC ACCCTCACCC TGGAACACTC GCCCGAAGGT
AAACGCGGCT TCTTCACCAG CTTCGTGATG ACCGGCTACG CCTCCGGCAT GGTGCTGGCC
ACCCTCGTGT TCATTCCGGT CACGGCCCTG CCGCAGGAAG CAATGATGAG CTGGGGCTGG
CGCATCCCGT TCTGGCTCTC CATCGTGGTC CTGGCCATCG CCTACTGGGT GCGGACGCAC
CTGGACGAAA CTCCGGTCTT CGAAGAGGCC CAGGAACACC GGAAGGTCGC CCCGATGCCG
CTCAAGGAAG TGCTTAAGTT CCAAGGCCCT GATGTGATGC GCGTTGTGGG AATGTCGATC
ATGTCCGTCA TGCAGACCAT CTTCACCGTT TTCGGCCTGG CGTATGCCAC CTCCACGGCA
GGCTTTGACC GGGCCTCCAT CCTGACCGTC AACGCCGTCG CCATCGGGCT GTCCATGTTT
GCCATGCCGG TGGCGGCCAG ACTTTCGGAC CGGATCGGCC GCCGGCCCGT GCTGCTTACG
GCCGCGTTCG GGTGCTCAGC CACGATCTTC CTGTACTTCC TTGCACTGTC CTCCGGCAAC
ATCGTGCTGG TCTTCCTGGC GGCTTTCCTG AACATGACGC TGCTGTACTC GGGCTTCAAC
GGCATCTGGC CCGCATTCTT CGCGGAACAG TTCGCCGCAC CGGTCCGCTA CACAGGCATG
GCGATGGGAA ACCAGCTGGG ACTCGTCCTG GCCGGCTTCG CCCCGATGAT TGCCGGCCTG
CTCCTGACCC CGGGCGTCAC CGGCTGGGTT CCCGTGGCTG TGTTCGGCAC GGTGTGCATG
CTCATAGCTG CAGCCTCGGT GTACTACTCC CGTGAGACGT TCAAAACGCC GATCGGGGAG
CTGGGTGCTC CGTACCTGGC CGGTACCGCC GCCCGGCGGG ATCTGCAAAA CCATTGA
 
Protein sequence
MQSAPATRIP PHDAQGTTPR KTPGKAALAS FLGSTLEYYD FFIYGTAAAL VFPHLFFPSA 
DPAIGLIGAF ATFGVAYVAR PVGGLVMGHF GDKLGRKKIL LLTLGIMGLA SLGIGFLPTY
EQVGVWAPVL LVAGRLAQGF SAGAESAGAS TLTLEHSPEG KRGFFTSFVM TGYASGMVLA
TLVFIPVTAL PQEAMMSWGW RIPFWLSIVV LAIAYWVRTH LDETPVFEEA QEHRKVAPMP
LKEVLKFQGP DVMRVVGMSI MSVMQTIFTV FGLAYATSTA GFDRASILTV NAVAIGLSMF
AMPVAARLSD RIGRRPVLLT AAFGCSATIF LYFLALSSGN IVLVFLAAFL NMTLLYSGFN
GIWPAFFAEQ FAAPVRYTGM AMGNQLGLVL AGFAPMIAGL LLTPGVTGWV PVAVFGTVCM
LIAAASVYYS RETFKTPIGE LGAPYLAGTA ARRDLQNH