Gene Arth_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1817 
Symbol 
ID4445646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2035498 
End bp2036718 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID639689635 
Productmajor facilitator transporter 
Protein accessionYP_831307 
Protein GI116670374 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000443507 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATTG GCCTGTTAGC CCTAGCCCTC GGCGGGTTCG GTATCGGACT CACCGAGTTC 
GTGATCATGG GCCTGCTGCC CGAGGTCGCC GCAGACTTCA GCGTCAGCGA GGCCACGGCC
GGCTGGCTGA TCTCCGGCTA TGCGCTCGCC GTCGTCGTCG GCGCCCTGCT GCTCACGGCG
GCCGTGACAC GCTTCGAACG CAAGCCGGTC CTGGCCGTCC TGCTGGTGCT GTTCATTGCC
GGCAACCTGG TCTCCGCCAT CGCCCCCGAC TACTCCATGA TGATGATCGG CCGGATTGTG
GCCGCCCTGG CCCACGGCGC GTTCTTCGGC ATTGGGGCAG TCGTGGCCGC GGACATGGTG
GCCCCCACTA AGAAAGCCGG CGCCATCGCC ATCATGTTCA CCGGACTCAC CGCCGCCAAC
GTCCTGGGCG TGCCGTTCGG CACCATGCTC GGCCAGGCCG CCGGCTGGCG CTCCACCTTC
TGGGCCATCA CGGGCATCGG CGTCCTGGCC CTCGTCGGCA TCCTGACCCT GGTCCCTAAG
ACCGGCCGCG GCGACACCGC CCCCGGGAGC CTCCGCAGCG AACTGCGGGC CTTCCGCTCC
GGCCAGGTCT GGCTGTCCAT CCTCGTCACC ATCCTCGGCT ACGGCGGCAT GTTCGGCGCC
TTCACCTACA TCGCCTACAC CCTCACCGAG GTCACCGGCT TCGCCGCCTC CACCGTGCCC
TGGCTCCTGA TCCTCTTCGG CATCGGACTG TTCATCGGCA ACACCGTGGG CGGCAAGGCG
GCGGACCGGA ACGTGGACCG CACCCTTCTG GTGGTCCTGG CTGTGCTCGT GGCGGTCCTC
GTGGGGTTCG CGCTGACCGC CGGCAACCAG CCCCTCACCA TCGCCTCCAT AGTCCTGCTC
GGCGGCTTCG GCTTCGCGAC GGTCCCCGGA CTGCAGATGC GGGTCATGAA ATACGCCCAC
AGCGCCCCCA CTTTGGCCTC CGGCGCCAAC ATCGGCGCGT TCAACGTCGG CAACGCCCTC
GGCGCCTGGC TCGGCGGCGT GACCATTACC GCCGGCCTCG GCTACACCTC ACCCATCTGG
GCCGGAGCCG GCATCACCCT CCTGGGCCTC GGCGTGATGG CCATCGCCGC AGCCGGCGCC
AAACGCTCTA AAACGGCGGC CATTATTGGC GACAACACCT CTCAAACCGT GACTGACGCC
GTCGTAGAAG CATCAATCTA G
 
Protein sequence
MPIGLLALAL GGFGIGLTEF VIMGLLPEVA ADFSVSEATA GWLISGYALA VVVGALLLTA 
AVTRFERKPV LAVLLVLFIA GNLVSAIAPD YSMMMIGRIV AALAHGAFFG IGAVVAADMV
APTKKAGAIA IMFTGLTAAN VLGVPFGTML GQAAGWRSTF WAITGIGVLA LVGILTLVPK
TGRGDTAPGS LRSELRAFRS GQVWLSILVT ILGYGGMFGA FTYIAYTLTE VTGFAASTVP
WLLILFGIGL FIGNTVGGKA ADRNVDRTLL VVLAVLVAVL VGFALTAGNQ PLTIASIVLL
GGFGFATVPG LQMRVMKYAH SAPTLASGAN IGAFNVGNAL GAWLGGVTIT AGLGYTSPIW
AGAGITLLGL GVMAIAAAGA KRSKTAAIIG DNTSQTVTDA VVEASI