Gene Arth_4096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4096 
Symbol 
ID4447686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4618002 
End bp4619201 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID639691927 
Productmajor facilitator transporter 
Protein accessionYP_833571 
Protein GI116672638 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCACAG CGCCAATCCC CACGCCGACC GCAGCCGTCA CCGCGCCACT GTACGCCGCA 
GGCTTTGTCA CAGCCTTTGG TGCCCACAGC ATCGCAGCCG GGATGGGCGC CCACAGCGGT
GATATCGGCC TGAGCCTGCT CAATCTGGGC GTCCTCCTGG CTGTTTACGA CCTCGCCGAG
GTGGTGCTGA AACCGGTCTT CGGAGCTTTG AGCGACCGCA TCGGCACAAA GCCGGTGGTC
GTGGCAGGGC TTTTCGCGTT TGCGCTGATG TCGTTGATCG GATTGTGGGG CTCCAACCCC
CTGATGCTCG GGCTCGCCCG GATCGGCCAG GGCGCCGCCG CCTCGGCGTT TTCCCCGGCG
TCCTCGGCGA TGGTGGCCAG GCTTGCCGGC CGCAACGCAG GAACGTATTT CGGCCGCTAC
GGCTCGTGGA AAAGCCTGGG CTACGTCGCG GGCCCGCTGA TCGGTGCCGG CCTGATCTTC
CTGGGCGGCT TCACCCTTCT CTTTGCCGCC CTGGCCATCC TCGCGGCGGC CACTGCGGTG
TGGGCGATGG TGACGCTGCC GCAGCTGGCC CCTCTGCCCC GCCCGCGGTA CACGCTGTTG
GATCTTGCCC GCCAGGTAAC CCATCGGAGC TTCCTCGTGC CTACGCTCGT TCTTGCGGCA
GCCACCGGGG CCCTGGGCAC AGCCATCGGC TTCCTCCCTG CGCTGGCAAC GCGGCACGGC
CTGGACCCTG TGGCGGCCGT TGCCGCGGTC AGCGTGCTGG CACTCGCGTC CGCTGCCACC
CAACCCTGGA TCGGCCGCCT GCGTGACGGG GGCCGGCTCC ATGACGGCCC CGGCATGACA
GCCGGGCTGC TGCTGACGGC GGCCGGAATC GCCGCGGTGG CACTGCTTCC GGGACCGGTC
ACCATTTTTT GCGCTGCGGC GGCCATCGGC ACGGGAATCG GTGTTGCCAC GCCGCTGGGC
TTCGCGCACC TTGCCGCCAC CACTCCGCCT GAGCGTTTGG GAAGGACCAT GGGAACAGCC
GAGCTGGGAC GGGAGCTTGG CGACGCCGGT GGTCCGCTCC TGGTTGGCGC CGTGGCTACA
GCTTCAGCTC TGCCGCTGGG CCTCGGAGTC CTTGCCGCGG CCGTCACCGC CGCGTCCCTG
CTCGGCGTCG GCAGCATCGG CCGCCGGGCG CCGTCGCCGG AACCGGCCGC CAAACCGTGA
 
Protein sequence
MTTAPIPTPT AAVTAPLYAA GFVTAFGAHS IAAGMGAHSG DIGLSLLNLG VLLAVYDLAE 
VVLKPVFGAL SDRIGTKPVV VAGLFAFALM SLIGLWGSNP LMLGLARIGQ GAAASAFSPA
SSAMVARLAG RNAGTYFGRY GSWKSLGYVA GPLIGAGLIF LGGFTLLFAA LAILAAATAV
WAMVTLPQLA PLPRPRYTLL DLARQVTHRS FLVPTLVLAA ATGALGTAIG FLPALATRHG
LDPVAAVAAV SVLALASAAT QPWIGRLRDG GRLHDGPGMT AGLLLTAAGI AAVALLPGPV
TIFCAAAAIG TGIGVATPLG FAHLAATTPP ERLGRTMGTA ELGRELGDAG GPLLVGAVAT
ASALPLGLGV LAAAVTAASL LGVGSIGRRA PSPEPAAKP