Gene Tpau_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3737 
Symbol 
ID9157917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3854387 
End bp3856087 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content66% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003648654 
Protein GI296141411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.871211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCCT CTGTTTCCGC ACCCGTCGAT ACCGTCACCG TCGGCCCGAT CGAGGGATCC 
GAGAAGGGTT ATCGCGCTAT CGCGGGAAGC AATGGCACTC TGCGCGTGCC CTTCCGGCGG
ATCTCGCTGA CCAACGGCGA CCATCACGAC GTCTACGACA CCTCCGGTCC GTACACCCAG
TACGCCTCGG AGGATCAGCT GCACGACCTG CAGGCGGGCC TGCCGAAGAC ACGCGATGAA
TGGGCGAAAC CCGAGCCCGT GGCCACCGAG AACGGCGGTG CCGGCGCCCG CACCCAGCTC
GCCTGGGCAC GCGCCGGTGT GATCACCGAC GAGATCCGGT TCGTCGCTGC CCGTGAGGGA
TTCGACCCTG AGTTCGTCCG TGCCGAGGTG GCCGCCGGTC GCGCGGTGAT CCCGGCGAAC
CACAAGCACC CCGAACTGGA ACCCGCGATC ATCGGCAAGG CGTTCGCGGT GAAGATCAAC
GCGAACATCG GCAACTCGGC GGTCACCAGC TCGATCGGCG AGGAGGTCGA GAAGATGGTG
TGGGCCACCC GCTGGGGCGG CGACACCATC ATGGATCTCT CCACCGGCAA GGACATCCAC
GAGACCCGCG AGTGGATCAT GCGGAACTCA CCGGTCCCGG TGGGCACGGT GCCGATCTAC
CAGGCCCTGG AGAAGGTCAA GGGCGATCCC ACCAAGCTCA CCTGGGAGAT CTACCGGGAC
ACGGTGATCG AGCAGTGCGA GCAGGGCGTC GACTACATGA CCGTGCACGC GGGTGTGCTG
CTGCGCTACG TTCCGCTGGC CGCGAACCGG GTGACCGGCA TCGTCAGCCG CGGCGGTTCG
ATCATGGCGG CCTGGTGCCT CGCGCACCAC GAGGAGTCGT TCCTGTACAC GCACTTCGGT
GAGCTGTGCG AGATCCTCCG CGAGTACGAC GTGACCTTCT CCCTGGGCGA TGGCCTGCGC
CCCGGATCCA TCGCGGACGC CAACGACGAG GCGCAGTTCG CCGAACTGCG CACCTTGGGT
GAGCTGACGA AGATCGCGAA GTCGTATGGC GTGCAGGTGA TGATCGAGGG CCCCGGACAC
GTGCCGATGC ACAAGATCGT GGAGAACGTG CGCCTGGAGG AGGAGCTGTG CGAGGAGGCG
CCGTTCTACA CGCTCGGCCC GCTCGCCACC GATATCGCGC CGGCGTACGA CCACATCACC
TCGGCCATCG GCGCGGCGAT CATCGCGCAG GCCGGTACGG CGATGCTCTG TTACGTGACG
CCGAAGGAGC ACCTGGGCCT GCCGAACCGC GACGATGTGA AGACCGGCGT GATCACGTAC
AAGATCGCCG CGCACTCGGC CGATCTCGCG AAGGGCCACC CGGGCGCGCA GTCCCGCGAT
GACGAACTCT CCAAGGCGCG CTTCGAGTTC CGCTGGGTGG ACCAGTTCAA CCTATCGCTG
GACCCCGACA CCGCCCGTGA GTTCCACGAC GAGACACTGC CCGCCGAACC GGCGAAGACG
GCGCACTTCT GCTCGATGTG CGGCCCGAAG TTCTGCTCCA TGCGGATCAG CGCCGATGTG
CGCGCCTACG CGCAGGAGCA CAACCTGGTG ACGCAGGAGG ACATCGCCGC CAAGCTCGCC
GCCGATATGG CGGAGAAGTC GAAGGAATTC ACCGACGCCG GTGGCCGGGT GTACATCCCG
CTCGAGACCG CGCAGCGGTG A
 
Protein sequence
MSPSVSAPVD TVTVGPIEGS EKGYRAIAGS NGTLRVPFRR ISLTNGDHHD VYDTSGPYTQ 
YASEDQLHDL QAGLPKTRDE WAKPEPVATE NGGAGARTQL AWARAGVITD EIRFVAAREG
FDPEFVRAEV AAGRAVIPAN HKHPELEPAI IGKAFAVKIN ANIGNSAVTS SIGEEVEKMV
WATRWGGDTI MDLSTGKDIH ETREWIMRNS PVPVGTVPIY QALEKVKGDP TKLTWEIYRD
TVIEQCEQGV DYMTVHAGVL LRYVPLAANR VTGIVSRGGS IMAAWCLAHH EESFLYTHFG
ELCEILREYD VTFSLGDGLR PGSIADANDE AQFAELRTLG ELTKIAKSYG VQVMIEGPGH
VPMHKIVENV RLEEELCEEA PFYTLGPLAT DIAPAYDHIT SAIGAAIIAQ AGTAMLCYVT
PKEHLGLPNR DDVKTGVITY KIAAHSADLA KGHPGAQSRD DELSKARFEF RWVDQFNLSL
DPDTAREFHD ETLPAEPAKT AHFCSMCGPK FCSMRISADV RAYAQEHNLV TQEDIAAKLA
ADMAEKSKEF TDAGGRVYIP LETAQR