Gene Tpau_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_0331 
Symbol 
ID9154466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp340258 
End bp341997 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003645313 
Protein GI296138070 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG GTGCAACGAA CGTTCACCTG CCTTTTGCAG AGCAGAACAA GACTGAGGGC 
ACACCGGTGC CCACCACGCC GCAGTCCGTC GCGGGCGCGC TGGCGAGTAA TGCGGCGGCC
GTGCGACCCG TGGTGACCCC CGGGCCGGTG TCCGACGAGG GCACGGATTC GGCACCGTCG
ACCACGGAGT GGGAGGAACC CGAGCTCCCC GTCGTCGACG ACGCGGGCGA GGCCGGCCCC
GTGCCCACCG CCGCCGAGGA AGCCCCCGCC GCCGCGGTCG CCGACGCCGC CGCGGTCGCC
GACGCCGTCG CGTCCGACGA CGACCGACCC ACCCGGGAGC AGCCCGTGAT CGTCGACGAG
CCGGCCGGCC GGCCGGAGGC GGAGGCCGCC CCCACCGCCG ACACGGCGGC AGCGGGCGGG
CAGGCCTCGC CGCAGGGAGG CCCGCAGGTC ACCGGTCCGC AGCCGGCGGC GCAGGCCCCA
CAGCAGGCGC GGACCGTGCC CGTTCCGGCG CAGCAGCCCC AGCAGCAGTG GCAGGGCCAG
GCCCCGCAGG CGCAGCCCGG CCAGCATCAG TTCGCGCAGC CGCACACCGG TCAGTTCCAG
CAGCCGCAGT TCGCCGGCCA GCCGCCCACC GGGCAGTTCC AGGCTCCCGT GCAGGGCCGC
CCGCCGTCAC CGCTGCCGTT CGGCCACGCC GCACCCGGCC AGATCGCACC GCCGCAGCCC
CAGCACGCGG GTATGCCGAT GCGGCCGATG GGGGAGTCGG TCTCCGCGCC GATGTCCACG
GCCTCGTCAT CGTTCCAGCA GCACATCCGG ATGTCCGATT CGGCGATGCG GGTCACCGGC
TGGCGCGCGT GGCTCAAGCG CATGGGTATC GACGTGGGCC CCAGCACCTC CGATCAAGAG
CATGCGGAGC GGGTGCACCG GATCCGCGTC CCGAAGAACA AGTTCCACGT CACCAGCGTC
TTCGCCGATA GCGCGGGCGG CACCCTGCTG GTGGGCGTGC TGGGCCAGAT TCTGGAGCGC
ACCCGCGCCG ACAACGTGGT CGCGCTCGAC CTCGATCCCG ACGGCGGCGA TCTCGACCAG
GTGACCGCGT GGCATCAGGG CGGTTCGACC GCGCGCACCC TGATCCAGCA GAGTGATCTG
TCGGACCGCA ACCAGGTGGA CAAGCACCTG GCCGTCACGT CGACGAATCT GCACGTGCTG
CCCACCCCGT GGCGGTTCAA CGGCCGCGAC GTCGCCGATT ACGATGACGT GCTCGATCTG
TACTCGATAT TCCGCCCGCA CTACAGCCTG GCGCTCGTCG ATGCGGGCCG AGGACTCCAG
ACCGTCACGG GCACCGGCGT TCTGGAGATC TCTTCCGCGT TGATCCTTCC CGCCTCGGCC
ACGACCCGCG GCGTGCGCAA GGTGGCCGCG ACCATCGATT GGCTGCGTCA CCACGGGTGG
CACGGATTGC TCGCCAACAC CATCGTGGTC ATCAATCACA CCAAGAAGCG CGGCAGTGTC
ACCGTCGAGC AGTTCGACGA GCTGTTCCGC GCCGGCCAGA AGCTGCGGGT CCACGAGATC
CCCTACGACC CGCACCTCGA TGCCGACACC CCGATCGACC TCGATCTGCT CAAGCCGCGC
ACCGTGCGCG CGTTCGAGCT GCTCGCGGCC GACCTCGCCG ACACCTTCAA TTCCGGATAC
GAGCCGCCGG CCGCCACCAA GCTGGCCGAA CTGCAGGTGC GCTCGCACGA GGGACGCTGA
 
Protein sequence
MSDGATNVHL PFAEQNKTEG TPVPTTPQSV AGALASNAAA VRPVVTPGPV SDEGTDSAPS 
TTEWEEPELP VVDDAGEAGP VPTAAEEAPA AAVADAAAVA DAVASDDDRP TREQPVIVDE
PAGRPEAEAA PTADTAAAGG QASPQGGPQV TGPQPAAQAP QQARTVPVPA QQPQQQWQGQ
APQAQPGQHQ FAQPHTGQFQ QPQFAGQPPT GQFQAPVQGR PPSPLPFGHA APGQIAPPQP
QHAGMPMRPM GESVSAPMST ASSSFQQHIR MSDSAMRVTG WRAWLKRMGI DVGPSTSDQE
HAERVHRIRV PKNKFHVTSV FADSAGGTLL VGVLGQILER TRADNVVALD LDPDGGDLDQ
VTAWHQGGST ARTLIQQSDL SDRNQVDKHL AVTSTNLHVL PTPWRFNGRD VADYDDVLDL
YSIFRPHYSL ALVDAGRGLQ TVTGTGVLEI SSALILPASA TTRGVRKVAA TIDWLRHHGW
HGLLANTIVV INHTKKRGSV TVEQFDELFR AGQKLRVHEI PYDPHLDADT PIDLDLLKPR
TVRAFELLAA DLADTFNSGY EPPAATKLAE LQVRSHEGR