Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2330 |
Symbol | |
ID | 9156486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2424692 |
End bp | 2425798 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003647276 |
Protein GI | 296140033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.489785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTACGC ATAGATCAGG ATCCGGTTCT CGCGGCCTCG CACGCTGGGT GATCGCGGCG ATCGCGGCGG TCCTGGTGGT GATCGCGGTG GCCGGGGCGA TGCTCTGGTT GCTGGGCCGT TCCGAGCAGG AGGGCCGCGA TGCCGCCGCG ACCTGCGTCG AGGGCGATCT CCAGCTGAAG GTGGCCGCCG CGCCCGCGCT GGTCGAATCC CTGCGCCGCG TTGCCGACGG ATTCAACTCG TCGGGAACGG TCTCGAACGA CTACTGCCCG CGGGCGGAGG TCACGGGTGT CGACTCCCCC GTCGCACTGT CCGCGCTGGC GGGGACGTGG GATCCGAAAC TCGGTCCGGC ACCGGCGGTG TGGATCCCGG AGAGCAGCAT CTGGACGGCA CGCCTGGCCG CCGCGAAGCC GGCCGAACTG TCGGGGCAGC CCACGTCCAT CGCATCATCG CCGGGTGTCC TGGCGGTGCG TGGTTCAGCA CGGCAGGCCT TCGACGGAGT CCGCTGGGTC GACGTGCCCG CACGCCAAGC CGATCTCGGG ATCTCGTTGC CCACCGCCGG TTCCGGCGCC GACGGCACCT ATCTGGCGGC CCAGTCCGTG GCTGCCGCTG TTGCCCGCAC GGGTGGCGCC GCGATCGACG AGGAGGCCGC GCGCGGCCCG CTGGTGACCG GCACGTTGAA CCGCTGGGCG AGCGCGGCAC CGAAGACGGC GAACGCCACC GCGGCGCTGG AGGGCCTGAT GGTGCCGTCG GATTCGCTGC GCGCCGTCCC GGTCACCGAG CAGCAGTTGT ACGCCTTCGC CCGCGGTCGG GGCGAGACCG CTCCCGTGGC GGTGTATCCG GCGGGGCCGA CCGCCGCAGC CACCTATCCG GCCGCCGTGC TCGATCGCGA GGGCGTCACG GAGGCGCAGC GTCGCGCTGC GTCCGACTTC GTCGCCTACA TCGGCAAGGG CGAGAACGCG AAGCCGCTCG CGGAGGCCGG TTTCCGGGTC GCGGGTCAAC CCACGCCGGA CAAGACCTCG TCGGTGTCCT TCGGCACCGT CCAGCCACTC GCACCCGCGG CGAACCCAGC GGTCATGGCG ATCGCCGACG CGATCACCCC CAAGTGA
|
Protein sequence | MGTHRSGSGS RGLARWVIAA IAAVLVVIAV AGAMLWLLGR SEQEGRDAAA TCVEGDLQLK VAAAPALVES LRRVADGFNS SGTVSNDYCP RAEVTGVDSP VALSALAGTW DPKLGPAPAV WIPESSIWTA RLAAAKPAEL SGQPTSIASS PGVLAVRGSA RQAFDGVRWV DVPARQADLG ISLPTAGSGA DGTYLAAQSV AAAVARTGGA AIDEEAARGP LVTGTLNRWA SAAPKTANAT AALEGLMVPS DSLRAVPVTE QQLYAFARGR GETAPVAVYP AGPTAAATYP AAVLDREGVT EAQRRAASDF VAYIGKGENA KPLAEAGFRV AGQPTPDKTS SVSFGTVQPL APAANPAVMA IADAITPK
|
| |