Gene Arth_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0236 
Symbol 
ID4447292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp249162 
End bp250502 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID639688032 
Productmajor facilitator transporter 
Protein accessionYP_829737 
Protein GI116668804 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCGC TCGGAAGTAA AGTTGCCAGT ACGTCATCCC CAAAAAATCC CCGCCTCATG 
ACCCGGAAGC GCTGGGTCAT TATCTGGCTT GCCTTCATCG GGCTGAGCAT CAACTACCTG
GACCGCTCCA GCCTCAGTGT TGCGTTGCCC TTCATGGGCA AGGACTTCGA ACTGACCGCC
ACCCAGCAGG GCCTGATCTT TGCCGCCTTC TTCTGGGCCT ACGACTTCTG CCAGCTGGCT
GCCGGTTGGT ACGTGGATAA GGTCGGACCG CGCAAGTCGT TCTCCTTGGC GGCGCTCTGG
TGGTCCGTGT TCACCATGGT GACCGCCGCG GCGACGAGCT TCTGGTCCCT CTTCGCGGCC
CGCTTCCTCC TGGGCGTCGG CGAAAGCCCC GCGCCCAGCA CGGCAGCCAA GGTGGTGGCC
ACCTGGTTCC CCGTCCGTGA ACGGGCTTTT GCCACAAGCA TCTGGGATTC GGGTTCCCGG
GTGGGGGCCG TCATTGCGCT TCCCATCGTT ACCCTGATCG TGGCGCTGAC GTCTTGGCAC
GCGGTATTCA TCATCATCGG CATCGCCGGC GTGATCTGGG CGGCCGTCTG GTGGAAGGTG
TACCGCAGCC CCCAGGAGCA CCCCGGTGCC AACGCCGCCG AAGTGGCGTA CATCGAAGAA
GGTGGCGCCC GCGGCGAAGG CAGCGACGAC GCCAACGCCG CCAAGCTGCC GTGGCGCTCG
CTCTTCAAGT ACCGCACCAT CCTCAGCATG ATGTTCGGCT TCTTCTGCTT GAACAGCGCC
ATCTACTTCT TCATCACCTT CTTCCCGAGC TACCTCGTGA AGGAACGCGG ATTCGACTTG
CTCAAGCTTG GCTTCTTCGG TGCCATCCCC GGCATCTGCG CGGTGCTGTG CGGCTGGCTG
GGCGGATACC TGGCGGACCG CGCGGTCCGG GCCGGGGCAT CCGTTACTAA GGTCCGCAAG
ACCGCTATCG CTGGCGGCCT CGCAGGTGGC TCGGTCATCA TGTTTGCCGC CCTCGTGCCG
GAAGCCTGGA TGGCCCTGGC GCTGCTTTCC GTTGCCTACT CAAGCCTTAC CGTCGCAGCG
ACCGGCATCT GGTCGCTACC CGCGGACGTT GCCCCGAGTT CCAAGCACGT CGGATCCATC
GGCGGACTTC AGAACTTCGC CTCCAACCTG GCCGGAATCT TCACCCCGAT CCTCATCGGC
GTGCTGGTGG ACCAGACGGG TTCCTTCGTG GCCCCGCTGG CGGTCATCGG AGCCATCTCG
CTCGTGGGTG CCGCCAACTA CCTCTTCGTC ATGGGCAAGA TTGAACCCCT GAAGGTCAAA
GAGCCCGTAG CGGTCGCATA A
 
Protein sequence
MTALGSKVAS TSSPKNPRLM TRKRWVIIWL AFIGLSINYL DRSSLSVALP FMGKDFELTA 
TQQGLIFAAF FWAYDFCQLA AGWYVDKVGP RKSFSLAALW WSVFTMVTAA ATSFWSLFAA
RFLLGVGESP APSTAAKVVA TWFPVRERAF ATSIWDSGSR VGAVIALPIV TLIVALTSWH
AVFIIIGIAG VIWAAVWWKV YRSPQEHPGA NAAEVAYIEE GGARGEGSDD ANAAKLPWRS
LFKYRTILSM MFGFFCLNSA IYFFITFFPS YLVKERGFDL LKLGFFGAIP GICAVLCGWL
GGYLADRAVR AGASVTKVRK TAIAGGLAGG SVIMFAALVP EAWMALALLS VAYSSLTVAA
TGIWSLPADV APSSKHVGSI GGLQNFASNL AGIFTPILIG VLVDQTGSFV APLAVIGAIS
LVGAANYLFV MGKIEPLKVK EPVAVA