Gene Tpau_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2142 
Symbol 
ID9156298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2233137 
End bp2234297 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003647092 
Protein GI296139849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.216003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACAGG TGGTCCCACA GTACGGGTCG GCCTCGCTCG CCGATGTCGT ACCGTCGGCC 
GCCGCCGCGC TCGGCGTTCG CGGTTTCGTG AACCAGCTCG GTTTCCGCCC GACGCGACGG
GTCTGCCTGC TGCTCGTGGA CGGCCTCGGA CATCTTCTCC TGGAACGGTA CGCCGCACAG
GCGCCGTTCC TCTCCGAGCT CACCGCCACC CGCATCTGCG CGGGCTTCCC GTCGACCACG
GCCACCAGCA TCTCCTCGAT CGGCACCGGA CTGCCCCCGG GGGAGCACGG CATCGTCGGC
CTGTCCTTCG CCGTCTGCGG TGACGGTATC GCCACGGGGA CCACCATCAA CGCTCTCGGC
TGGAATTCCT ACGGGGTCCG GCATGCGCGG GACCTCCGCG AGTCGGTGGT CCCGGAGCGG
GTGCAGCCGG AGCGCACACT GTTCGAGGCG ATGGCGGCCG ACGGCGTCGC CGTGACCACG
GTGACGCCGA AGGATCACGT GGGAAGCGGT CTGAGCCGCG CCGTCCTGCG GGGCGCGGAT
CCGGTAGCGG CGACCGCGCT GGGTGACATC GTCGGTCGCG TGGCGGCCGC CACCGCCACG
GGGACGGGCG AACGCGCCTT CTGCTACGCC TACCACGGCG ACCTCGACAT GCTGGGCCAC
GTCTACGGTC CCGGTTCGCT GCCGTGGCTG ATGCAGTTGC GGCAGGTCGA CACCCTGGCC
GAGTCACTGG CGATGGCGCT GCCCCCGGAC TGTCTGCTCG TGATCACGGC CGATCACGGC
ATGATCGAGG CTCCGGAGCA GTCCCGCATC GACTTCGACG CGGAGCCCGC GCTTCGGGCC
GGCGTCCGGC AACTGGCGGG AGAGCCGCGG GTCCGGCACG TGTACACCGC CGACGGTGCC
GTTACCGACG TCCGCGCGGC ATGGTCGGCA GTGCTCGGAG AACGAGCGTG GATCCATACC
CGGGACGAGG CCGCCGAGGC CGGCTGGTTC GGTCCGCGGG TGCTCGACCG CACCCGGGAG
CGGATCGGAG ACCTGGTCGT GGCGATGCGC GGTGCGCACA CCGTGGCCGT CCCGTCCGCG
GAGCCGGTCG TGTCGAACCT GCTCGGCCAA CACGGCTCAC TCACCGAAGA CGAGCAGCTC
GTCCCGGTCC TGGTGCGCTA G
 
Protein sequence
MGQVVPQYGS ASLADVVPSA AAALGVRGFV NQLGFRPTRR VCLLLVDGLG HLLLERYAAQ 
APFLSELTAT RICAGFPSTT ATSISSIGTG LPPGEHGIVG LSFAVCGDGI ATGTTINALG
WNSYGVRHAR DLRESVVPER VQPERTLFEA MAADGVAVTT VTPKDHVGSG LSRAVLRGAD
PVAATALGDI VGRVAAATAT GTGERAFCYA YHGDLDMLGH VYGPGSLPWL MQLRQVDTLA
ESLAMALPPD CLLVITADHG MIEAPEQSRI DFDAEPALRA GVRQLAGEPR VRHVYTADGA
VTDVRAAWSA VLGERAWIHT RDEAAEAGWF GPRVLDRTRE RIGDLVVAMR GAHTVAVPSA
EPVVSNLLGQ HGSLTEDEQL VPVLVR