Gene Tpau_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3358 
Symbol 
ID9157532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3456875 
End bp3458245 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003648281 
Protein GI296141038 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACA GCGCGTCTCC CTTCGTCGAT CCGGACTATC GCCGCCTGTT CGCGGCGCAG 
GTGGTGGGCC TGTTCGGCTC CGGCTTGACG ACGGTGGCCC TGAGCCTGCT CGCGTATGAG
CTCGCCGGCG CCGATGCGGC GGGCGTGCTC GCCACCGCTC TCACCATCAA GATGGCGCTG
TACGTGGTGG TCGCGCCGCT CGTCGCCGCG CAGGCCTACC GGTTGCCGCG CCGCGCACTC
CTGGTGGCCC TGCACGTGAT CCGCGCCGCC GTCGTGCTCG CCCTTCCCGT GGTCACCGAG
ATCTGGCAGA TCTATCTGCT GGTCGCGCTG CTCCAGGCCG CGTCCGCGGC CTACACCCCG
ACCTACCAGG CCGTGATCCC CGATCTGCTG CCCGACGAGC GGCAGTACAC CCGCGCGCTA
TCGGCGTCGC AACTGGCCAC CACGATGGAG ACGTTGCTCA GTCCCATGTT GGCCGCGGCG
GCCCTGCTGG TGATGAGCTA CCAGTACCTG TTCGTGGGCA CCGCGATCGG CTTCCTGCTC
GCGGCGGCAC TGGTGCTGCG CTCGCGAGTG CCCAATCCCG ATGCGGCGCC GGAGGGCTCG
TTCCTGCAGC GGCTCGGCTC GGGGACGCGG ATCTTCGTGG CCACGCCGCG GCTGCGCGGG
CTGCTGGGTC TGAATCTCAC GGTGGCGGCG GCCGGTGCCA TCGTCATGGT GAACACGGTG
AACTTCACCC GTGACGAGCT CGCCGGGACC CAGGCGGATA TGGCGCTGGT ACTGGCCGCC
AACGGCCTCG GGACGATGGT GGTCGCGCTG ATCGTGCCGC GGCTGCTGGA CCGGACCCCG
ACCCGCACCG TCATGCTCAC CGGCGGTGCC GTACTGCCGC TGGCGTTGGC CGCCGCGGTC
GGATTGTCAC TGGCCGGTGA CGGCACCTGG CGGTGGGCGG CGCTCGCCAC GATCTGGTTC
GCGATCGGCG CCGGGACCGC CGCGGTGGTC ACACCGTCGG GACAGGTGCT GCGCCGCTCA
TCGAACGATG CCGACCGGCC CGCCGTGTTC GCGGCGCAGT TCTCGCTCTC CCACGTGGCC
TGGTTGATCA CCTATCCGCT GACCGGATGG CTCACCGGCT CGGCCGGGCT CACCGTGACG
TGGTCGGTGA TGCTGGTACT CGCCGTGACC GGCCTGGTCG TAGCAGCGCT CTCGTGGCCG
CGTCAGGATC CGGTGGAGAT CACCCACGCC CATCGCGAGG GCGACGTGGA CCCGGCCGTG
ATCGCCGATG CGACGCCGGT GGGGGACGGC TGGTTCGAGC ACACTCACGC CTACGTGATC
GACGGTGCGC ATCCTCGCTG GCCCGACCCC GACGGTCGGC TCATCGGCTA G
 
Protein sequence
MADSASPFVD PDYRRLFAAQ VVGLFGSGLT TVALSLLAYE LAGADAAGVL ATALTIKMAL 
YVVVAPLVAA QAYRLPRRAL LVALHVIRAA VVLALPVVTE IWQIYLLVAL LQAASAAYTP
TYQAVIPDLL PDERQYTRAL SASQLATTME TLLSPMLAAA ALLVMSYQYL FVGTAIGFLL
AAALVLRSRV PNPDAAPEGS FLQRLGSGTR IFVATPRLRG LLGLNLTVAA AGAIVMVNTV
NFTRDELAGT QADMALVLAA NGLGTMVVAL IVPRLLDRTP TRTVMLTGGA VLPLALAAAV
GLSLAGDGTW RWAALATIWF AIGAGTAAVV TPSGQVLRRS SNDADRPAVF AAQFSLSHVA
WLITYPLTGW LTGSAGLTVT WSVMLVLAVT GLVVAALSWP RQDPVEITHA HREGDVDPAV
IADATPVGDG WFEHTHAYVI DGAHPRWPDP DGRLIG