Gene Tpau_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1199 
Symbol 
ID9155339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1224809 
End bp1226260 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003646169 
Protein GI296138926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.420226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCG GACCGAGATC AGGGTTCGCC CCTGGGCGTA CCGCGAAGCT GGCTGCGGCC 
GGAATCGCCG CCACCATGGG GGTGACGATC CTGGCGGGCT GCGGGGCCAA GGAGGACGGC
GTCACCCTCA ACGTGTACGC GCCCGCCGAC GGCGCCACCC TCGTCAAAGA GGTGGCCGGA
GACTGTTCGA CCAGCGGGTA CACCGTGGTC GGCCACGCCC TTCCCAAGAG CGCCGACGAT
CAGCGCCTGC AACTGGCCCG TCGGGTGACC GGTAACGACC GCACCATGGA CCTGATGGGG
CTGGACGTGA ACTGGACCGC GGAGTTCGCG GAGGCGGGCT GGATTCTCCC GCTGCCCGAG
AATCTGACCC GGACGGCGGA GAAGACCGTG CTCGCGGGCC CGCTCAAGAC CGCCATGTGG
CAGGACAAGC AGTACGCGGC GCCGGCCTGG ACCAACACCC AACTGCTCTG GTACCGCAAG
GACGCGCTGG AAAAGGTGCT CGGCCGCAAG ATCGGCCCCG GTGTCCCCAA GCTCACCTGG
GACCAGGTGG TGCAATACGC GGAGAAATCC GGTCAACTCG GTGGCCCGAC ACAGATCGAG
GCCCAGGCCG CGCAGTACGA GGGCGTGGTG GTGTGGTTCA ACTCGCTGCT CGAAAGCGCC
GGTGGCCGGA TGGTGGCCGA CGACGGAAAG ACGGTGACGC TCACCGACAC CCCGGAGCAC
CGCGCCGCCA CGGTCAAGGC GCTCTCGATC ATGAAGGCCG TGGCCACTGC GCCCGGCCGC
GATCCATCGT TCACCCAGCT CAAAGAGGGC GAGTCGCGCC TGGCGATGGA GTCGGGCAAG
GCGATCTTCC AAGTCAACTG GCCCTTCGTT TTCGCGGGCG TCAAGCAGAA CGCCGCGGCG
GGCTCGGTGC CGTTCCTGCC CGAGCTCACC AAGTACGACG CGTTGCTCAA TCCTCCCAAG
GACGAGAAGA ATCCGCCCGA GCCGACGGTC GCGCAGCTGG GCGAGATCAA CAACCTGACC
CGGCAGAAAT TCGACTTCGC CCCCTTCCCA TCGGTGATCC CCGGTAAGCC CGCGAAGACC
ACCGTGGGCG GCATCAATTT CGCCGTCTCG AAGACCACCC GGTACGAGAA GCAGGCCTTC
GAGGCGCTCG CCTGCCTCAC GAACGAGGCC GCCGAGCGGA AGTACGCCGT CAAGGGCGGT
ACCCCACCGG TCCTGCCGAA GCTCTATGAC GATCCCGAGT TCCGCAAGGC CTACCCGATG
GCCACCCTGA TCCGGGACCA ATTGCAGGAC AACACCGCCG CGGTGCGGCC GATCACACCC
CAGTATCAGG CGATGTCCAC GCTGCTCCAG GCCACGCTCG CCCCGGTGGG GGCGTGGGAT
CCCGAACAGC TCGCGGACCG GCTCGCTGAT GCGGCTGAGA AGGCCATGAA TGGAAAGGGC
CTGGTGCCAT GA
 
Protein sequence
MSTGPRSGFA PGRTAKLAAA GIAATMGVTI LAGCGAKEDG VTLNVYAPAD GATLVKEVAG 
DCSTSGYTVV GHALPKSADD QRLQLARRVT GNDRTMDLMG LDVNWTAEFA EAGWILPLPE
NLTRTAEKTV LAGPLKTAMW QDKQYAAPAW TNTQLLWYRK DALEKVLGRK IGPGVPKLTW
DQVVQYAEKS GQLGGPTQIE AQAAQYEGVV VWFNSLLESA GGRMVADDGK TVTLTDTPEH
RAATVKALSI MKAVATAPGR DPSFTQLKEG ESRLAMESGK AIFQVNWPFV FAGVKQNAAA
GSVPFLPELT KYDALLNPPK DEKNPPEPTV AQLGEINNLT RQKFDFAPFP SVIPGKPAKT
TVGGINFAVS KTTRYEKQAF EALACLTNEA AERKYAVKGG TPPVLPKLYD DPEFRKAYPM
ATLIRDQLQD NTAAVRPITP QYQAMSTLLQ ATLAPVGAWD PEQLADRLAD AAEKAMNGKG
LVP