Gene Arth_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3540 
Symbol 
ID4443763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3978445 
End bp3979836 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content67% 
IMG OID639691364 
Productmajor facilitator transporter 
Protein accessionYP_833015 
Protein GI116672082 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAACG TCCCAGTTCA GGCTTCCGGC GCAGCGCCGC GCCCGGGCAA GCCGATGCAC 
CCGAAGGGCC TGTATAAGGC CTTTGCCGCA AGCCTTACCG GCACCGCACT CGAGTGGTAC
GACTTCGCCG TCTACTCAGC CGCAGCCGCC GTCGTATTCC CCATCGTCTT CTTCCCGTCA
TCCGATCCCC TGACCGGCAC CATCCTGGCG TTTTCAACCT ACGCTGTGGG CTACGTTTCC
CGCCCCGTGG GCGGCATCAT CTTCGGCCGG CTCGGCGACC GGATCGGCCG CAAGAAGGTC
CTGGTCACCA CCCTCATGAT CATCGGCGTG GCCACCGTGC TGATCGGCGT GCTTCCCGGG
TACGGCAGCA TCGGCATCAC CGCCCCGATC ATCCTGGTGC TGCTGCGCTT CGCCCAGGGC
GTGGGCGTAG GCGGCGAATG GGGCGGCGCC GTGCTGCTCT CCAGCGAATA CGGGGATCCC
CACCGGCGCG GCTTCTGGGC ATCCGCCGCC CAGGTGGGCC CTCCCGCCGG CAACCTCATG
GCGAACGGCG CGCTGGCCGT CCTGACCCTC ACCCTGACCG AAGAGCAGTT CATCAGCTTC
GGGTGGCGCA TCGCCTTCCT GGTCTCGGCC GTGCTGGTCG GATTCGGGCT CTGGATCCGG
CTCAAGCTGG AAGACACTCC GATCTTCAAG GCCATTGAGG CCCACGGCGA ACAGCCCAAC
GCCCCGGTCC GGGAGGTCTT CAGCAAGGAA CTCCGGCCGC TCATCGCCGC CACGCTGTGC
CGGGTTGGTC CTGACGTGCT CTACGCCCTG TTCACCGTCT TCACCCTTAC CTATGGCATC
CAGGCCCTCG GCTACGAGCG CAGCCAGGTC CTCACCGCTG TGCTGATCGG CTCCGCATTC
CAGCTGTTCA TGATCCCGCT GGCCGGCGCC GTATCGGACC GCTTCAACCG CCGCCTGGTC
TACGGCACGG CCGCGGTGCT GGGCGCCGTC TGGACATTCA TCTTCTTCGG CATCCTCGGC
GGAGACAATG AGCCGATGCT GATCGCGGGC ATCGTCCTGG GCCTCATGGC ACACTCATTC
ATGTACGGAC CGCAGGCCGC CTTCATTGTG GAGCAGTTCT CCCCCAGGCT CCGGTCCACC
GGAAGTTCGC TGGCATACAC CTTCGCCGGC GTGATCGGCG GCGCGATTGC CCCGCTGATG
TTCACGCTGC TGCTGTCCCA GTTCGGCACC TGGATTCCGG TGGCCATCTA TGTTGCCGTG
GCCGCCGCCG TCACCGCAGT AGGCCTGGCG CTCGGCCGGG ATTCCAACAC AGTGGAGGAC
GAGGACTACC GCCTGCTGCT CGAAGGATCC GCAGCAGCGC GCCAGCCGTC CGCCGTCGCG
GAATCCCGCT GA
 
Protein sequence
MANVPVQASG AAPRPGKPMH PKGLYKAFAA SLTGTALEWY DFAVYSAAAA VVFPIVFFPS 
SDPLTGTILA FSTYAVGYVS RPVGGIIFGR LGDRIGRKKV LVTTLMIIGV ATVLIGVLPG
YGSIGITAPI ILVLLRFAQG VGVGGEWGGA VLLSSEYGDP HRRGFWASAA QVGPPAGNLM
ANGALAVLTL TLTEEQFISF GWRIAFLVSA VLVGFGLWIR LKLEDTPIFK AIEAHGEQPN
APVREVFSKE LRPLIAATLC RVGPDVLYAL FTVFTLTYGI QALGYERSQV LTAVLIGSAF
QLFMIPLAGA VSDRFNRRLV YGTAAVLGAV WTFIFFGILG GDNEPMLIAG IVLGLMAHSF
MYGPQAAFIV EQFSPRLRST GSSLAYTFAG VIGGAIAPLM FTLLLSQFGT WIPVAIYVAV
AAAVTAVGLA LGRDSNTVED EDYRLLLEGS AAARQPSAVA ESR