Gene Tpau_3527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3527 
Symbol 
ID9157706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3639335 
End bp3640645 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003648445 
Protein GI296141202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACC TCAGCAGACG CGGATTCCTC GCGACCAGCC TCGCCGCCAC GGCGGCCTTC 
GCCGCCGCCT GTGCGGGTTC CGGCGGGTCG GGCGGTGGCG GCAACTCGGG TGGCAGCGGC
GGCGGTATCA AGCTGCAGTT CCTCACCAAC CATCCGGGCA GCTCGAAGGC GATCGAGCAG
AGCATCATTG ACGAGTTCCA GAAGGCCAAC TCCGGCATCA CCGTCGAGCT GCTCGACGGC
GGCAAGGACT ACGAGGAGGT GGCGCAGAAG TTCCAGACCT CGCTGACCGG CGGCACGAAG
CCCGACATCA TCGTGGTCTC CGACGTCACC TGGTTCAACT TCGCGCTGAA CAAGCAGATC
GAGCCGCTCG ACGGCCTGTT CGCCGGTGCC GGCCTCAACC CTGCCGACTA CGTTGACTCC
CTGCTGGCCG ACGGCAAGTT CGACGGCAAG TACTACACCA TCCCGTTCGC CCGCTCGACG
CCGCTGTTCT ACTACAACAA GGACGTGTGG AAGAAGGCCG GCCTCGAGGA CCGCGGCCCG
AAGGACTGGG ACGAGTTCGT GGCCTGGGCG CCGCGCATCC AGGAGGCCAT CGGCGGCGAT
AAGAAGGCCA TCGTGCTGGC CGACTCGGCG AACTACATCG ACTGGGTCTT CGAGGGCTGG
AACTGGTCCA AGGGCGGTGC CTACTCGGAC GGCTGGGACC TGAAGTTCAC CACGCCCGAA
TCCGTTGCTG CCGCGCAGCA GCTCAAGGAC GTGATCGGCA AGTGGGGCCG GCTGACCAGC
AAGCCGGAGA ACGACTTCGG TGCCGGCCTC GCCGGCGTCA CCCTGCAGTC GACGGGCTCC
CTGAAGACGA TCACCACCAC CGCGAAGTTC GAGGTGGGCA CCGCCTTCCT GCCCGGCCCG
CAGGGCAAGT CCTGCCCGAC GGGCGGCGCC GGCGTGGCGA TCGCCGCGGG CATCTCCGAC
GACCGCAAGG CCGCCGCGAT GAAGTTCATC GAGTTCCTCA CCAATGCCAA GAACTCGTCC
ACCTTCTCGC AGGGCACCGG CTACATGCCG GTGCGCAAGT CCGCGGTCGA CGATCCGTCG
ATGAAGGAGT TCATCGCGAA GAACCCGAAT TTCGGGACCG CGGTCAAGCA GCTCCCGTTC
ACCCGCAGCC AGGACAACGC CCGGGTGTTC GTGCCGGGCG GTGGTCGCGA CATCGGGCAG
GCGTTGCAGC AGATCGCGAC CGGAGGTGAC CCGGCGGCGG TGCTCGGCGC GCTGCAGAGC
ACGATCCAGG GCAAGATCGA CTCGCAGATC ACGCCGAAGC TGCCGAAGTA G
 
Protein sequence
MADLSRRGFL ATSLAATAAF AAACAGSGGS GGGGNSGGSG GGIKLQFLTN HPGSSKAIEQ 
SIIDEFQKAN SGITVELLDG GKDYEEVAQK FQTSLTGGTK PDIIVVSDVT WFNFALNKQI
EPLDGLFAGA GLNPADYVDS LLADGKFDGK YYTIPFARST PLFYYNKDVW KKAGLEDRGP
KDWDEFVAWA PRIQEAIGGD KKAIVLADSA NYIDWVFEGW NWSKGGAYSD GWDLKFTTPE
SVAAAQQLKD VIGKWGRLTS KPENDFGAGL AGVTLQSTGS LKTITTTAKF EVGTAFLPGP
QGKSCPTGGA GVAIAAGISD DRKAAAMKFI EFLTNAKNSS TFSQGTGYMP VRKSAVDDPS
MKEFIAKNPN FGTAVKQLPF TRSQDNARVF VPGGGRDIGQ ALQQIATGGD PAAVLGALQS
TIQGKIDSQI TPKLPK