Gene Tpau_1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1639 
Symbol 
ID9155789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1712084 
End bp1713475 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content64% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003646599 
Protein GI296139356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGGA CGATGAACAA GACGTTTTCT CGATTCGTGA CAGCGGTCGC CGCGGTATCC 
GTGGTGACCT CCACTGCAGG GTGCGGTTCC GGTGGCACGG CGCCCTCGAA CGTGATCAAT
TACTGGCTGT GGGATAACAA TCAACAACCC GTCTATCAAA AGTGCGCTGA CGCTTTCGAA
GCGGCGCACC CGGGCCGGAA AGTCGCGATC ACCCAGTACG GCTGGAGCAG CTACTTCACG
AAGCTCGCAT CGGGCTTCAT CGCCGACACC GGACCGGACG TCTTCACCGA TCACGTGTCC
AAGCTCGGAC AGCACCTCGA CCTCGAGGTG CTCCAGCCCC TGGACGAACT CGCCGCCACC
AAGGGGATCA AGGACGAGGA CTACCTTCCT CAGCTCGCCG CGCTGTGGAA GGGGCCCGAC
GGTCGGCGGT ACGGCGTGCC CAAGGACTGG GACACCGTCG CGTACTTCTA CAACAAGGAC
GCCACCGCCG CCGCAGGCGT GACCGACGCC GAGCTCCAGA GCATGACCTG GAACCCCGAC
GACGGAGGCA CACTCGAGAA GGTCCTCGCC CGGCTCACCG TCGATGAGAA GGGTGTACGG
GGTGACCAGC CCGGCTTCGA CAAGACGCGA GTGAAATCGT ACGGTCTCGC GGGGACGGAT
TCGGGCTACG GCGGATTCGG GCAATCGCAG TGGTCGCCGT ACACCGGTTC GATCGGGTGG
AACTTCACCG ACAAGAATCC CTGGGGCGCG AGGTTCAACT TCGATGACCC CAAGGTCCAG
AAGACGATCG ACTGGTACTT CGGGTTGGCG AAGAAGGGGT TCATGGCCCC CTTCGCGGTG
GCCGGTAACA ACACGTCCGG GATCGGTGCC GACAAGCAGA TGAGCGCGGG CAACGCCGCC
ATGGCGCTCG CCGGATCCTT CATGATCTCG TCCTACTTCA AGCTCGTCGA TCCGCAGGGC
AAGCCCCTCC CGATCGGTTT GGCGCCCACG CCCGTCGGGC CGTCCGGGAA ACGGGCGTCG
ATGTTCAACG GACTCGCGGA CGTCGTCTCG AAGCAGTCGA AGAATCCGGA GCTTGCGGGG
GAGTGGGTGG CGTTCTTGGG AAGTGACGCG TGCCAGGACA TCGTCGGGGA CTCCGGTGCG
GTTTTCCCCG CCCGGCCCAA CGGTATGACC ATCGCGAAGC AGCGCCAGGC GGCGGCCGGT
GTGGACATCA CCCCGTTCAC GATGCATGTG GACGACGGCA CGACATTCAC GATTCCGGTG
ACCACCGATG CCGCCGACAT CGTCCCGCTC ATGCAGTCTG CGTTCGACCC GATCTATCTG
GGATCGGCGT CGGGATCGTC GCTGTCCACG CTGAATCGGC AGATGAACAG GCTGCTCGAG
AGCAACAGCT GA
 
Protein sequence
MSRTMNKTFS RFVTAVAAVS VVTSTAGCGS GGTAPSNVIN YWLWDNNQQP VYQKCADAFE 
AAHPGRKVAI TQYGWSSYFT KLASGFIADT GPDVFTDHVS KLGQHLDLEV LQPLDELAAT
KGIKDEDYLP QLAALWKGPD GRRYGVPKDW DTVAYFYNKD ATAAAGVTDA ELQSMTWNPD
DGGTLEKVLA RLTVDEKGVR GDQPGFDKTR VKSYGLAGTD SGYGGFGQSQ WSPYTGSIGW
NFTDKNPWGA RFNFDDPKVQ KTIDWYFGLA KKGFMAPFAV AGNNTSGIGA DKQMSAGNAA
MALAGSFMIS SYFKLVDPQG KPLPIGLAPT PVGPSGKRAS MFNGLADVVS KQSKNPELAG
EWVAFLGSDA CQDIVGDSGA VFPARPNGMT IAKQRQAAAG VDITPFTMHV DDGTTFTIPV
TTDAADIVPL MQSAFDPIYL GSASGSSLST LNRQMNRLLE SNS