Gene Arth_3829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3829 
Symbol 
ID4447668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4314623 
End bp4315912 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content68% 
IMG OID639691653 
Productmajor facilitator transporter 
Protein accessionYP_833304 
Protein GI116672371 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCCG GAAACACCTC CCCGAAACAT CAGCCCGGCA CTAGGGGTGC CATAACCCTG 
GGGCTGCGGC AGAATCTTGC CCAGTTCATG ATCCTGGTGG CCGTCAATGC CCTGGTTGGC
GGCACGCTGG GCCAGGAGAG GACCGTCCTG CCCCTCTTGG CCGGTGAGGT CTTCAAGCTG
GACCTGTACA CCAGCGCCTT GACGTACATC CTGGCGTTCG GGGTGGCCAA GGCGGCCACC
AACTACTTCG CCGGAACCCT GTCGGACCGT TACGGGCGCA AGCCGGTCCT GGTGGCGGGG
TGGCTGGTCG CGCTGCCCGT CCCGCTGATG CTGATCTTCG GGCCGTCATG GGGGTGGATC
GTCGCGGCCA ACATCGTCCT CGGCATCAGC CAGGGGCTGA CCTGGTCCAC GACGGTCATG
ATGAAGATGG ACCTCGTGGG TCCTTCACGC CGGGGCCTGG CCATGGGCCT CAACGAGGCC
GCCGGCTACC TCGGCGTCGC CGGGACCGCA CTGGCCACCG GCTACATCGC CTCCACTTAC
GGGTTGCGCC CCGGCCCGTT CCTGCTGGGT GCTGCCTTCA TCGCCGTCGG CCTGGGCCTT
TCCGTGCTCA CCGTGCGGGA AACGCACCAC CACGCCAGGG CTGAGGCCGC CAGCCACGTT
GCGGTCCACG AGGGGGCGCA CGGGCAGCTG AGCAACCGCG AGGTGTTCAC CCTGACGAGC
TTCCGTGACA AATCGCTGTC CGCTGTCAGC CAGGCGGGCA TGGTGAACAA CCTCAACGAC
GGCCTGGCCT GGGGCCTTTT TCCGGTCCTC TTTGCGGCCG CCGGATTGAC AATTGAACGC
ATCGGCATCC TCGCGGCGGT GTATCCGGCC GTGTGGGGAG CCGGCCAGCT GGTGACCGGG
GTGCTGTCGG ACCGGATCGG GCGCAAGCCC CTGATCGTGG GAGGCATGCT GGTGCAGGCC
GTCGCGCTCG GCATGGTCGC CTTTGGCGCC GCCTTTGAGA TCTGGCTGGC CGCGGCAGTC
CTGCTGGGGG CAGGCACGGC CATGGTCTAC CCGACGCTGC TGGCCGCCAT CGGCGACGTT
GCCCACCCCG AATGGCGGGC CAGGTCGGTG GGGATCTACC GGCTGTGGCG CGACGGCGGC
TTTGCGGTCG GCGCCCTGCT GTCCGGAATC ATTGCAGACG CCTACGGCAT TCCTGCGGCA
GTCGCCGTCG TTGCCGTCCT GACCGGAGTG TCCGGTGTTG TGGTGGCTGT CCGGATGCGC
GGCGCCGATC ACAAGCCCTC CTTCCGCTAG
 
Protein sequence
MTPGNTSPKH QPGTRGAITL GLRQNLAQFM ILVAVNALVG GTLGQERTVL PLLAGEVFKL 
DLYTSALTYI LAFGVAKAAT NYFAGTLSDR YGRKPVLVAG WLVALPVPLM LIFGPSWGWI
VAANIVLGIS QGLTWSTTVM MKMDLVGPSR RGLAMGLNEA AGYLGVAGTA LATGYIASTY
GLRPGPFLLG AAFIAVGLGL SVLTVRETHH HARAEAASHV AVHEGAHGQL SNREVFTLTS
FRDKSLSAVS QAGMVNNLND GLAWGLFPVL FAAAGLTIER IGILAAVYPA VWGAGQLVTG
VLSDRIGRKP LIVGGMLVQA VALGMVAFGA AFEIWLAAAV LLGAGTAMVY PTLLAAIGDV
AHPEWRARSV GIYRLWRDGG FAVGALLSGI IADAYGIPAA VAVVAVLTGV SGVVVAVRMR
GADHKPSFR