Gene Tpau_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3915 
Symbol 
ID9158096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4030907 
End bp4032538 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003648826 
Protein GI296141583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGAA GACGAAGTAG GTCGACGGCG ACGCGGGCGA TCCTCGCGAT CGCGGTGGCC 
GCGAGCCTGG CGGTATCGGG CTGCAGCATG CAGGGGGCGC GTTCGCAGAA CACGGTGGGC
CCCGACGCGA TCATCCGGGT GAACGGCGGC GAACCGCAGA ACGGGTTGAT CCCGTCGGAC
ACCAGCGAGA ACATGGGCGG CCGCGTGGTC GACTCCCTAT TCACCGGTCT GTACAGCTAC
AAGGCCGACG GCACGCCCGA GTTGGCGAAC GCCGAGTCGG TCGAGACCAC CGACAACAAG
CGCTTCCTGG TCAAGCTCAA GCGGGACTGG AAGTTCACCG ACGGCACGCC GGTCAAGGCG
GAGAACTACG TGCGTGCCTG GAACTTCGGT GCCGCAGTGG GCAATCTGCA GAAGCAGCAG
AGCTTCTACG CTCCGATCGC GGGTTTCGAC GAGGTGTCCG AGAAGGGTGC CACGAAGACC
GAGATGCGCG GCCTGCAGGT GATCGACGAC TACACCTTCT CGATCGAACT GGCGGCGCCG
AACATCGACT TCAAGCTTGC GTTGGGGTTC ACACCGTTCG TTCCGCTCCC CGATGTCTTC
TTCACCGAGG GTAAGGAGAA GTTCGGCCAG AACCCGGTGG GCAACGGGCC GTACAAACTC
AAGCAGTGGC GGCACAACGT GCAGCTCGAG GTGGTGCGGT ACGAGGACTA CAAGGGTCCG
AAGCCGAAGA ACGGCGGGCT CACCTTCATC ATGTACGAGT CCTACGATCC CGCGTACATC
GACCTCACGT CCGGGAATCT CGACGCGCTC GACAACATCC CGAACAGCGC GCTGCGCTCG
TTCCAGAAGA CGCTGGGCAA GAAGGCGATC ATCAAGTCGA CCGCGCAGAC CCAGAACTTC
GTGATCCCGC AGTTCCTGGA GCACTTCGGC AGCGACGAGG AGGGCCGCCT GCGCCGGCAG
GCGATCTCCA TGTCGTTCGA CCGGCAGCAG ATCATCGATG TGGTGTTCCA GGGCTTCCGG
AATCCCGCGC TCGAGTTCAC CGCTCGCTCG ATCCCCGGGT GGGACGGAAA CATCCCCGGC
AATGGCAACG TCAAGTACAA CCCGGAGCTG GCCAAGCAGC GCTGGGCGCA GGCGAACGCG
ATCAAGCCGT GGACCGGATC GTTCACCATC GCCTACAACT CCGACGGCGA TCACAAGCAG
TGGATCGATG CGGTGACCAA CCAGATCAAG AACACGTTGG GCATCGATGC GGCGGGCAAG
CCCTACGCGA CGTTCAAGCA GATCCGCGAT GAGCTCACCA AGAAGACGAT CAAGTCCGCG
GGGCGCAGCG GTTGGCAGGG CGACTACCCG ACGCAGCTCA ACTTCTTGGA GTCCAACTAC
CTCACTGGCG CTGGGTCGAA TGACGGCGAC TACAGCAATC CCGCGTTCGA CGCCAAGATC
GCCGAGGCGC AGCAGGCGCT CGACCCGGTG CAGTCGACCA GGCTCGTCAA CGAGGCACAG
GCGATCCTTC TCAACGACTT GCCCGTGGTC CCGCTGTGGG ACTACAAGGC CGCGGCCGGC
GTCGGCGACG GTGTCAAGGG TGATCTCACC TGGAACGGCC GCTTCGACTT CACCAACATC
ACGAAGGAGT AG
 
Protein sequence
MVRRRSRSTA TRAILAIAVA ASLAVSGCSM QGARSQNTVG PDAIIRVNGG EPQNGLIPSD 
TSENMGGRVV DSLFTGLYSY KADGTPELAN AESVETTDNK RFLVKLKRDW KFTDGTPVKA
ENYVRAWNFG AAVGNLQKQQ SFYAPIAGFD EVSEKGATKT EMRGLQVIDD YTFSIELAAP
NIDFKLALGF TPFVPLPDVF FTEGKEKFGQ NPVGNGPYKL KQWRHNVQLE VVRYEDYKGP
KPKNGGLTFI MYESYDPAYI DLTSGNLDAL DNIPNSALRS FQKTLGKKAI IKSTAQTQNF
VIPQFLEHFG SDEEGRLRRQ AISMSFDRQQ IIDVVFQGFR NPALEFTARS IPGWDGNIPG
NGNVKYNPEL AKQRWAQANA IKPWTGSFTI AYNSDGDHKQ WIDAVTNQIK NTLGIDAAGK
PYATFKQIRD ELTKKTIKSA GRSGWQGDYP TQLNFLESNY LTGAGSNDGD YSNPAFDAKI
AEAQQALDPV QSTRLVNEAQ AILLNDLPVV PLWDYKAAAG VGDGVKGDLT WNGRFDFTNI
TKE