Gene Arth_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4045 
Symbol 
ID4447881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4565059 
End bp4566324 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID639691876 
Productmajor facilitator transporter 
Protein accessionYP_833520 
Protein GI116672587 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACGA CGTCGTTGGA AAACAAATCA CGCTGGCCCG TCTGGCTCTG CTGGCTGGCC 
ATGGTCCTGG ACGGCTTTGA CCTGGTAGTG CTTGGCACCG TCATCCCCAC GCTCATCAAG
ACCCACGACC TTGGCTTCGA CGCCGTGGGC GCCACCTTTG CTGCGACTAT CTCCCTGGTG
GGCGTGGGCC TCGGAGCGCT GTTCATTGCA CCGCTTTCCG ACCGATTCGG CCGACGGAAC
CTGCTGGTTG CCTGCGTTAC GTGGTTCTCC ATCTTCACCA TTGCCGTGGT CTTTGCCCCC
AACGTGGCAG TCTTCAGCGC CTTCAGGCTG CTGGCGGGCC TGGGGCTGGG CGCCTGCCTT
CCCGCTGCTT TGGCCTACAT GAACGATTAC GCCCCCGCGG GATCCGCCGG CAAGTCCACC
ACCCGGACCA TGACGGGCTA CCACGCAGGC GCAGTGGCCA CCGCCTTCCT GGCGCTCATG
GTCATCCCTG ACTGGCGCAT CATGTTCGTA GTTGGCGGCC TTGCGGGCTT CGTGCTGGTC
CCGTTCCTGT GGTTCAAGCT GCCGGAAACG CTGCCCGCCG TCATCTCCCT TCCGGCGCCC
GGCAAGGCCG CGGCAAGAGA ACCCGCTCCT GCGGTGGAAG ACCGTGCCAG CTTCAAGGAC
CTCGGGCGGA AACCGTACCC GCTCGTGGCC GCCGGTGTGG CCGTGGCCTC GTTCATGGGC
CTGCTGCTGG TGTACGGACT GAATACCTGG CTGCCGCAGC TCATGTCGTC CGCCGGCTAC
ACGCTCAGCG CCGGGCTCTC CCTCCTGCTG GTCCTGAACG TGGGCGCCGT GGCGGGCCTG
GTAGTGGCCG GTATCCTGGC GGACAAGCAC GGAACCAAGA AGATCGTGCT TCTCTGGTTC
GGGCTCTCCG CCGTGTTCCT GGCAGTACTT AGCGTGAAAA TCCAGAACGA GCTGTTCCTG
AACGCGGCCG TCTTCGTCAC CGGGGTCTTC GTCTTCAGCT CACAGGTGCT GGTGTATGCC
TGGGTGAGCC AGCTGTTCCC GCCGCGGCTG CGCGGCACCG CGCTGGGCTT CGCCGCAGGC
GTCGGACGCC TGGGGGCCAT CCTCGGTCCG GCCGTGACAG GCACCCTTGT GGCCGCCGGA
ATCGCTTACC CCTGGGGCTT CTATGTCTTT GCCGCCGCGG CCGTTCTCGC CGTTGCAGCC
CTCGCCCTGG TCCCGCAGGC GGTCACCGCG GCGGCGGGCA AGCGGACCGC CGTCGGGCCT
TCCTAA
 
Protein sequence
MSTTSLENKS RWPVWLCWLA MVLDGFDLVV LGTVIPTLIK THDLGFDAVG ATFAATISLV 
GVGLGALFIA PLSDRFGRRN LLVACVTWFS IFTIAVVFAP NVAVFSAFRL LAGLGLGACL
PAALAYMNDY APAGSAGKST TRTMTGYHAG AVATAFLALM VIPDWRIMFV VGGLAGFVLV
PFLWFKLPET LPAVISLPAP GKAAAREPAP AVEDRASFKD LGRKPYPLVA AGVAVASFMG
LLLVYGLNTW LPQLMSSAGY TLSAGLSLLL VLNVGAVAGL VVAGILADKH GTKKIVLLWF
GLSAVFLAVL SVKIQNELFL NAAVFVTGVF VFSSQVLVYA WVSQLFPPRL RGTALGFAAG
VGRLGAILGP AVTGTLVAAG IAYPWGFYVF AAAAVLAVAA LALVPQAVTA AAGKRTAVGP
S