Gene Arth_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1959 
Symbol 
ID4445508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2209094 
End bp2210560 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content64% 
IMG OID639689769 
Productmajor facilitator transporter 
Protein accessionYP_831441 
Protein GI116670508 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.789773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCAG AGGGAGTCGG CTTCCGCTCG AAGCGCGGTC CTATACTAAT TTCCCTGATG 
CTCTCTACGG GCTTGGTGGC GATCGATTCG ACGATCGTCG CGACCGCGGT GCCGTCGATC
GTACATGACA TTGGCGGCTT TGCGTCCTTT CCATGGCTCT TTTCTGCCTA CCTGTTGGCG
CAGGCCGTGT CCGTCCCCGT CTACGCGAAG CTCTCCGACG TCGTCGGCCG CAAACCGATC
ATCCTGATCG GCATCGGGCT ATTCCTATTC GGTTCGATCC TTTGCGGAGT GGCATGGAGC
ATGCCTGCCC TCATCGCGTT CCGTGTGCTG CAGGGCCTGG GCGCGGGAGC AGTCCAGCCG
ATGGCGATCA CGATCGCGGG CGATATCTAT TCGCTGACTG AGCGGGCGAA GGTGCAAGGC
TACCTTGCGA GCGTATGGGC AGTGTCCTCG GTCGTTGGCC CGACGCTCGG GGGCGTGTTT
TCAGCAATGG GCATGTGGCG TGGAATCTTC CTCGTGAACA TCCCGCTGTG CCTCATCGCA
GGGTGGATGC TGAGCCGGAC GTTCCACGAG AAGGTCGAAC GCGCCAGGCA CCGAATCGAC
TATCTGGGGG CGGGCCTGCT GACTGGCTCG CTGACGCTGA TCATTCTTGG GGCCCTCGAA
GGCGGCCAGG CGTGGGGATG GACCTCCGCC ATCAGCGTTG CGGTGTTCGC CGGCGGAGCG
CTCTTGTTCG CCGTGTTCAT CCTCGTCGAG CGCAGGGCCG CCGAGCCAGT CCTGCCGCCA
TGGGTCGTCT CCCGGCGGCT GCTGGCGACG ACGGCACTGA TCTCCTTTGG CGTTGGAGCG
GTCATGCTTG GCCTCACCTC TTACGTTCCC ACGTTCCTCG AAGGAGCCCT CTCGACCTCC
CCGATCCTGG CCGGGCTCGC GCTGGCAGCA TTGACGATCG GCTGGCCGAT CAGTGCGTCA
CAATCGGGCC GGTTCTATCT TCGGTTGGGG TTCCGGAAGA CCGCGATGAT CGGCATTACT
ATCACCGTCA TCGGCACAGC GGTACTTGCG CTCACCGCCT CCGCACCCAG CGTTCTCCTG
GCGGCGGCGA GTTGCTTCAT CGTAGGGCTA GGGCTCGGAC TGGTTGCCAC CCCGAGCCTT
ATCGCCGCCC AGTCCAGCGT CGACTGGAAC GAGCGCGGAG TTGTCACCGG TACCAATCTT
TTCGCGCGGT CGATCGGTAG TTCCCTCGGT GTCGCGGTTT TCGGGGCGGT TGCGAACGCA
ATCTACGCAG GCACTCCGGG CGGCAATACG GACCCGCACA CAGTCGTCGT GGCCTCCGGG
GCTGTTTTCA TAGCCGTACT GGCCGTCGCC GTACTCACCG TCGTCGCCGT CATCGCGATG
CCCGCTACAG ACAACGAGAC CCCCACCCGC TCGACTGCTG ACCCTGTGGT GGCCGCCGCG
CACAGTACGG CATCCTCACC GGATTAG
 
Protein sequence
MPAEGVGFRS KRGPILISLM LSTGLVAIDS TIVATAVPSI VHDIGGFASF PWLFSAYLLA 
QAVSVPVYAK LSDVVGRKPI ILIGIGLFLF GSILCGVAWS MPALIAFRVL QGLGAGAVQP
MAITIAGDIY SLTERAKVQG YLASVWAVSS VVGPTLGGVF SAMGMWRGIF LVNIPLCLIA
GWMLSRTFHE KVERARHRID YLGAGLLTGS LTLIILGALE GGQAWGWTSA ISVAVFAGGA
LLFAVFILVE RRAAEPVLPP WVVSRRLLAT TALISFGVGA VMLGLTSYVP TFLEGALSTS
PILAGLALAA LTIGWPISAS QSGRFYLRLG FRKTAMIGIT ITVIGTAVLA LTASAPSVLL
AAASCFIVGL GLGLVATPSL IAAQSSVDWN ERGVVTGTNL FARSIGSSLG VAVFGAVANA
IYAGTPGGNT DPHTVVVASG AVFIAVLAVA VLTVVAVIAM PATDNETPTR STADPVVAAA
HSTASSPD