Gene Tpau_4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4220 
Symbol 
ID9158408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4351634 
End bp4352959 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003649127 
Protein GI296141884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.665807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGTGA GCGGTCCGGG GTCGGACCCG CAGCCCCAGC AGGCGGCGTG GCGCACCTTC 
CGCGACCTCC CCGATCTGCT GCGGCTGCTC GGCGTCCGGT TGCTGAGCCA GTACGCCGAA
GGCCTGTTCC AGGCGGCCCT GGGCAGCGCG ATCGTCTTCA ACCCGCAGCG CGGAGCGTCG
CCCGCGGCGA TCGCGGCCGG GCTCGCCGTA CTGCTCCTGC CGTACTCCGC CGTCGGGCCG
TTCGCCGGCG CGTTGCTCGA CCGGTGGGAC CGGCGCCGGG TTTTCATCGT CGCGAATCTG
ATCCGCGCCG CGCTCATCGT GGTGTGCGCG GCGATCCTCG CATCGGGCGC CGGGGAGACG
CCGATCTTCA TCGTCGCGCT CATCGTCGGC GGCGCCGGAC GCTTCGTGGC CTCCGGCCTG
TCCGCATCGC TACCGCACGT CGTCGACCGC GACCAGCTGG TCGCGATGAA CTCGGTGACC
ACCACCCTCG GCGCGGGCGC CACCGCACTG GGCGCCTCGA CCGCCGTAGG ACTGCGCGCG
ATCTTCGGCC CCGACGATGA GGGCAGTGCC GCCGTACTCG GCTGCGCAGC CCTGATCGCC
GTGGCCGGCG CGGCGCTCGC CTCCCGCTTC CCGGCCGGCG TACTGGGGCC CGACCACGAC
CCCGCGCTGC CCGCCGAGCG GACCTCCGCG TTCCACGACC TCGTCACCGG CCTCGCGCAC
GGCGCGGTCG CGGCCTGGCG CGCACCGTCG GTCACTGCGG CCCTGACCGG AATGGGCGCG
CACCGCACCG TCTTCGGGTT CAACACGATG ATGCTGCTCC TGCTGACCCG GCACCACTTC
ACCGACGGCA CCCTGGGCCT GGCCGGCTTC GGCGCCGTGG CCGGTGCCAC CGCCCTCGGT
ATGTTCGCCG CCGCGGTGAT CATCCCGTTC GCGGTCGCCA AGGCGGGCCG GCGCATCACG
GTGGTGGGAG CCCTGGCGAT CGCCTGCCTC ACCCAACTGA CGGTGCTCAC GCTCAACTTC
GCCGTGCTGG TGTGCGCGGC TGCGGTACTC GGCCTGGCGG GGCAGGTGGT GAAGCTCTCC
GCCGACGCAG CCATGCAGAT GGACGTGCCC GACGAGCGCC GCGGCCAGGT CTTCGCCTTT
CAGGACGCAC TGTTCAATGT GACCTTCGTG GCCGCCGTCG CCTTCGCCGC CGCTGTGGTC
CCGTACGACG GTGCCAGCCG ACCGCTCGCC CTCTTCGGGG CCGTGCTCTA CGCGGTGGCC
GTGGTGGTGG TGCTGGCCCT GTACCGGCGG ACGGGAACCG AGGTCCCGGC CGGCGCGTCC
AATTGA
 
Protein sequence
MTVSGPGSDP QPQQAAWRTF RDLPDLLRLL GVRLLSQYAE GLFQAALGSA IVFNPQRGAS 
PAAIAAGLAV LLLPYSAVGP FAGALLDRWD RRRVFIVANL IRAALIVVCA AILASGAGET
PIFIVALIVG GAGRFVASGL SASLPHVVDR DQLVAMNSVT TTLGAGATAL GASTAVGLRA
IFGPDDEGSA AVLGCAALIA VAGAALASRF PAGVLGPDHD PALPAERTSA FHDLVTGLAH
GAVAAWRAPS VTAALTGMGA HRTVFGFNTM MLLLLTRHHF TDGTLGLAGF GAVAGATALG
MFAAAVIIPF AVAKAGRRIT VVGALAIACL TQLTVLTLNF AVLVCAAAVL GLAGQVVKLS
ADAAMQMDVP DERRGQVFAF QDALFNVTFV AAVAFAAAVV PYDGASRPLA LFGAVLYAVA
VVVVLALYRR TGTEVPAGAS N