Gene Tpau_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4033 
Symbol 
ID9158217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4160973 
End bp4162511 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003648943 
Protein GI296141700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC GCCGCCTGAT CGTCACCCCG CTCGCCGCCG CACTGATCGG TGCCACACTG 
GTCATCGTCA GCGCTGGATG CTCTTCCGGT CGTGGCGACG ACGCCGCCAC GCTCGACCTG
ACCACCGGCG TGAACACGCC GCTCACGATC TCCGCCGGAC AGCTCGCGAA GCTCGCTGAC
GGCGGTGCGG TCGTCGGTGT CGAATCGTCG CCGAACGGCA CCGTCGAGGC AGGGAAGGGC
GGTGCCCTGG TGTTCAAGCC GAACAGCGGC TACACGGGGA CGGTGGAGCT CAAGGCCACC
GTGTCGCCTG CGGTCCAGCT GTTCTCCTCG GACATCCCGC CGCTCACCAC CGTCGGCGGG
GTCAGCGTGG ACGCGAGCGG CTACGGCTCG TCGTGGGTGC CCGTTCCCGG CGCACCGGAC
GAGTTCTACG GCCTCACCGA TCGCGGCCCC AATGTCGATG GTCCGCAGAA GGATCAGAAG
ATCTCGGTGA CACCCGATTT CACGCCGCAG ATCGGCCGGT TCAAGCTCGA ATCCGGGGTC
GCCAGGTTGT TGAGTGTGAT CACCCTCAAG GGCCCCGACG GCCGGCCGCT CAACGGCCGC
ACTGACACAG CGGCGCCCAC GGGCGAGAAG ATCATCGACC TGGACGGCCG GGAGATCCCG
CCCACGGATC ACGGCATCGA CTCCGAGGGC CTGGTCGCGA TGCCGGACGG ATCCTTCTGG
GTGTCCGACG AATACGGCCC GTTCCTCATT CACTTCGACT CCAACGGCCA GGAGCTCGAG
CGGCTCGCGC CGGGGCGCGG CCTACCCGAG GTGCTCAAGA ACCGCACCCC GAACCAAGGC
ATGGAGGGTC TGACCCTCAC TCCCGACGGT TCCAAGCTGG TCGGAATCAT GCAGTCGGCG
CTCAACCTGC CGGGGCTGAG CGGCAATGCC AAGGAGGTAC CGGCCACCCG GATCGTGACC
GTCGACCTCA AGACCAAGGC GACACAGCAG TTCGCGTACC TGCTCGACAA CCCCAAGGAC
ACCAAGAAGG CGGTTTCCGA GATCACCGCG ATCTCCAACA CCGAGTTCCT GGTCGACGAG
CGCGACGGCA AACTCGCCCC CAAGGCCAAT AAGACGATCT ACACGATCAG TCTCGACGGT
GCCACGCCGC TCACCGAGCA GCAGAATCTG GAAACGATCG TGGGGGTCAG CAATACCGCA
GCGGCGGAGA GCGCCCTCAA AGCCGCGGGC ATCACGCCCG TCCGTAAGTC GGTGGCGCTC
GACCTGAGCG GGCTCGTCGA CAAGCTCAAT CCTCGAGGTA CCTTCTTCGG CCATGACAAG
GTCGAGGGTC TGACCACCGT CGATGGCGGA AAGACCCTGT ACATCGCCAA CGACAGCGAT
TTCGGCCTGG CCGGTATCGC CGGCCCGAAG GTGCCCTTCC AGCTCAAGCC GAAGATGCTC
GCGAACGGCC TGCAGGACAG CCTGGAAGTG CTCCGCGTCG ACACGGCTCG GTTGAACGAG
GCGACCGCCA CCCGGACGAT CAAGGTCACC GTCAGCTAG
 
Protein sequence
MSDRRLIVTP LAAALIGATL VIVSAGCSSG RGDDAATLDL TTGVNTPLTI SAGQLAKLAD 
GGAVVGVESS PNGTVEAGKG GALVFKPNSG YTGTVELKAT VSPAVQLFSS DIPPLTTVGG
VSVDASGYGS SWVPVPGAPD EFYGLTDRGP NVDGPQKDQK ISVTPDFTPQ IGRFKLESGV
ARLLSVITLK GPDGRPLNGR TDTAAPTGEK IIDLDGREIP PTDHGIDSEG LVAMPDGSFW
VSDEYGPFLI HFDSNGQELE RLAPGRGLPE VLKNRTPNQG MEGLTLTPDG SKLVGIMQSA
LNLPGLSGNA KEVPATRIVT VDLKTKATQQ FAYLLDNPKD TKKAVSEITA ISNTEFLVDE
RDGKLAPKAN KTIYTISLDG ATPLTEQQNL ETIVGVSNTA AAESALKAAG ITPVRKSVAL
DLSGLVDKLN PRGTFFGHDK VEGLTTVDGG KTLYIANDSD FGLAGIAGPK VPFQLKPKML
ANGLQDSLEV LRVDTARLNE ATATRTIKVT VS