Gene Tpau_3599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3599 
Symbol 
ID9157778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3711238 
End bp3712395 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003648516 
Protein GI296141273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.904243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTCCG AACCGCTCAC CTCGGCGATC GCCGAGGCCG AGGCTCTCGT CGCCGCCGCT 
GCGCACATCG AATCCGAGGC CGATCTCCTG GAAGGCCTGC AGTACCTGGC GCAGGGCGTG
GCCGCGTGCA TTCATGGCGC CTTCCATTTC GACAAGGACC ACCCGTTCCT GCTCAGCGGC
ACCGGACCGT TCACCAAGAT GGGGCTCGAC AATCCCGACA CCCTGTACTT CGGTGCACGC
GTGGACGGTT CCCACGAGTA CCTGGTCACC GGCCGTCGCG GTACCACCGC AGACATCAGC
TTCCAGGTAC TCGGAGGCGG CGAATACACC GACGAGAACG TGCCCGCCAG CACCGTCGCC
TTCGACGACC GCGAGCTCAC CATCGGCGCC GACGGCCGGT TCGCGGTGCG ATTCGGGCCC
GGCCGAGCCG GGCCGGACTA CTACCACCTG CCACCGGGTA AGGCACAACT GGTGATCCGC
GAAGTCTTCG ACGACTGGTC GGCCCAGCGC AGTACTTTCG CGATCACTCG CACCGACACC
ACCGGTACCG CCCCGCCGCC GCTCACCGAC GAGCTCATTC GCAAGCGCTA CGCCGCCGCG
GGCACCCAAC TGGTCAACCG CGTGAAGACC TGGCTGCAGT TCCCGCGGTG GTTCTACGAT
CCGCTGCCGG TGAACACCCT CTCCGCGCCG CGCCTCACCC CGGGCGGCCT CGCCACCCAG
TACTCGTCCG TGGGCCACTA CCATCTCGCC GACGACCAGG CGTTGATCAT CACCGTTCCC
CGTGGCGACG CGCCCTACGT CGGCTTCCAG CTCGGCAGTC TCTGGTACAT CTCGTTGGAC
TACATCAACC ACCAGACCTC GCTCAACGGC AGCCAAGCGC AGGTAGACCC GGATGGGAAC
ATCCGGATCG TGGTCTCCGG CAAGAACCCC GGCATCACCA ACTGGATCGA GACCGTGGGA
CACCGCCGCG GCTACCTGCA ATTCCGCTGG CAACGTACCT CCGGTCCGGT CACCGAAGGC
CCCACCGCGC ACGTGGTCCC GCTCGACGAC GTGGCGCGGC ATCTGCCCTT CCACGCGCAG
AACACGATCG ACGAGCACCG TTGGCGGGCG CGGATCGCGG AGCGGCAGCG CCTCATCGGT
GAGCGGATGG TGGGCTGA
 
Protein sequence
MYSEPLTSAI AEAEALVAAA AHIESEADLL EGLQYLAQGV AACIHGAFHF DKDHPFLLSG 
TGPFTKMGLD NPDTLYFGAR VDGSHEYLVT GRRGTTADIS FQVLGGGEYT DENVPASTVA
FDDRELTIGA DGRFAVRFGP GRAGPDYYHL PPGKAQLVIR EVFDDWSAQR STFAITRTDT
TGTAPPPLTD ELIRKRYAAA GTQLVNRVKT WLQFPRWFYD PLPVNTLSAP RLTPGGLATQ
YSSVGHYHLA DDQALIITVP RGDAPYVGFQ LGSLWYISLD YINHQTSLNG SQAQVDPDGN
IRIVVSGKNP GITNWIETVG HRRGYLQFRW QRTSGPVTEG PTAHVVPLDD VARHLPFHAQ
NTIDEHRWRA RIAERQRLIG ERMVG