Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1001 |
Symbol | |
ID | 9155141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1025531 |
End bp | 1026814 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003645973 |
Protein GI | 296138730 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.628257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACCA CGGGAGCGGC GGGACTCAGC CGCCGGAGCG TGCTGAAAGG CGCACTCGCT GCGTCGACGT TGGCCGCACT CGCGCCGACC CTCGCATCAT GCGGTCGCTC GGACTCACCA CGCCCCCTCC ACCTGGGGTC GCGGAACTCG GACTCGAAGG CCAAAGCAGG TATGGCGGCG CTCGTGGGCG CATTCACCGA ACGTACGTCG ATTCCGGTAT CTGTGAACGT GGTCGATACG GTCTCGTTCC AGGAGAACCT CAACAACTAT CTCCAGGGTG CGCCGGACGA CGTGTTCACC TGGATGTCGG GTTACCGGAT GCGCTACATC GCCAATAAGA AGCTGGTCTC TCCGTTGGAT GCGGTCTGGC CCGCCATCGA CCGTGGCTTC GACAGTTCGT TCAAGGGCGC GGCGACCGAC AACGGTCACG CCTACCTCGT GCCCCTGACG TACTACCCGT GGGCGATCTA CTACCGCAAG AGTGTGTGGC AACGCTATGG CTACCAGCCG CCGCGCACCC ACGCCGACTT CATCGATCTG TGCAAGCGGA TGAAGGCGGA CGGCATCGTT CCGCTGGGAT TCGCCGCGCG GCAGGGGTGG ACCACGTTCG GCATGTTCGA CTACCTCAAT CTGCGGATCA ACGGTCCCGA TTTCCACCGC AGCCTGCTGG CCGGCGAGAT CTCCTGGACC GACAACCGGG TCTACGCGAC CTTCGACGCC TGGCGCGAAT TCCTACCGTT CCAGCAGGAA CAACCGCTGG GGCGCACCAT CGCGGAGGCG CACACCGCAC TGCTCAACCG CAAGGTGGGA ATGATGGTCA CGGGATTGTT CGTTTCGGAG CAGTTCCCCG CGGGCCCCGA CCTCGACGAC CTGGACTTCT GCCCGTTCCC GGAGTTCGAC AGCGCCATCG GCGCCGGAGC GGTCGAGGCA CCCCTCGACG GACTCATGAT GAGTGCGAAC CCCCGGAACC GGGAGGCCGC CACGCAGTTC CTCGAATACG CCGCCTCGGC GGAAGCGGGC ATCAAGTACA GCAGCGGGCT CGCGATGTCG ATTCCTGCTC ACAAGGACAT CTCGCTCGCC GACTACTCGC CGCTCATCCA GAAGGCAGCC GCGCTGGTGA AGAACAGCAA TTCGATCACG CAGTTCCTCG ACCGGGACAC ACGACCGGAC TTCTCATCGA TCGTGATCAT CCCTTCATTG CAACAGTTCA TCCGGGCCCC ACAGGATCTC CGTTCGGTGC TCGCGAGCAT CGAGAAGCAG AAGAAGGCGG TGTTCGACCG ATGA
|
Protein sequence | METTGAAGLS RRSVLKGALA ASTLAALAPT LASCGRSDSP RPLHLGSRNS DSKAKAGMAA LVGAFTERTS IPVSVNVVDT VSFQENLNNY LQGAPDDVFT WMSGYRMRYI ANKKLVSPLD AVWPAIDRGF DSSFKGAATD NGHAYLVPLT YYPWAIYYRK SVWQRYGYQP PRTHADFIDL CKRMKADGIV PLGFAARQGW TTFGMFDYLN LRINGPDFHR SLLAGEISWT DNRVYATFDA WREFLPFQQE QPLGRTIAEA HTALLNRKVG MMVTGLFVSE QFPAGPDLDD LDFCPFPEFD SAIGAGAVEA PLDGLMMSAN PRNREAATQF LEYAASAEAG IKYSSGLAMS IPAHKDISLA DYSPLIQKAA ALVKNSNSIT QFLDRDTRPD FSSIVIIPSL QQFIRAPQDL RSVLASIEKQ KKAVFDR
|
| |