Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1589 |
Symbol | |
ID | 9155739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1660655 |
End bp | 1662751 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003646549 |
Protein GI | 296139306 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.242692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACGC CGCGCGATGA AGAACTGTTT AGCTTCAGCT GGCTTGCACT AAAACTTCTT GGACGGGGGC TCTATTCGAA TCCATGGAGC GCATTGTCCG AGCTCGTAGC CAACGGTCTA GACGCAAACG CTGAAACTGT CTATGTATAC ATCGATGCAT CCATCAAATC CAACGCAACG GTGGAAGTCA TTGACGATGG CGACGGTATG AGCAGGGACG AGATCGGCTT ATACGCCCAG GTCGGCAGAA ACAAGCGGAA CGACGAACCG GAAAACAACT CAGCAAAGAA TCCACCCAAA GGAAGAAAGG GAATCGGCAA ACTCGCAGCG TTGTACCTAT CTAACCACTT TTTCCTTCAC ACTCGACAAG TTGATAGCGC TACAAGTTGG GAACTCGACG CCCGAGATGA AAGGGTTTCG GAGGATGAAC ATCCGCGCCT TGTGGCCACG ACCGCAGGTC CCGAAACTCC AAACGAAACA CTTTGGAATT CCCTGGCTTC CGGGACCCGA ATCACCCTAC AGAATGTCGA TCTCACCGGG TACGGCGAAC AATCCATTAC CGCCCTTGGT TCAAGGCTCG CAAATCAATT TCTCCTTCCG GAAAGCAACT CAGGCGGGAC CATTCATATT TGGGTACGCA AGCGAGCCGA TGAAACAAAC CCCAGATACG GCCCCGTCGA AAAAACCGTC GCGTACAAGA ATCTTGTTGA GGTGTCGCAG AATTTCGATC CTGAAGATGT CGCTGACGCC CAGCCTTCTG ACTTTCAGCC GTCGGGCAGG TCTGTGCGAA TCCCGGCAAA GGGAGCTCCC GGCGGAGAAA TTTTGAAACC CGCCAAATTT ACAAAGTTTC AACATGACAC CCTTGAAGAC GAAGCATGGA AGGAAATTGA ATCAAGAGTC GATCGCAAAC GAAAGACTTA CGACAACGTC CCATTTGAGC TCACGGGCTG GATTGGTGTT CACGCCACCA TCGACAGTGC AGCTGCACGG GAGAACGACG CACGGTTTGT AAAAAATCGG TACTACAACC CCGCTCAGAT TCGCGTGTAC GTCCGTGGAA AGCTTGCAAG CGACCGTCTC CTGTCTCAGC TCGGACTAAC GGGGACATAC ACCAATTACA TTGAAGGTGA AATTAGCTTC GACATCCTAG ATGAAGACAC CTTGCCGGAT ATCGCGACAT CAAATCGACA AGACTTCGAT GAGACTGATG GGCGAGTCAC GTTGCTCAAG GCTTTGGTCC GACCAATAGT CCGCAGATTG ATGCTCAGCC GTCAGGAAAT TGCAACTGAG ATTGCTCGAG CTGTCACGGC CGAGAAGGAA CGTAGAGACA CTTCAAGCAA GCAGCAGTTC TCACATGAAG TCCAGGAAGA GCTCGCGCAA CGAACAGAAC TATCAGACAC CAGTCGCGCG GAGCTTCACA TGGTCATCAC CAATAAGATC CAGGGAGATG TCTCACCCAA GCAGAAGTAT CGAGTATTTA TTTCACATGC GCGGAAGGAT CGTGCGTTCG CGACAATGAT AGACGAGCTT TTGCAGCTAA AAGGCGCAAA AAAAGACGAA ATATTCTTCA CGTCACGTCC AGGCGATATT GAGTACGCCC TCGATGACCG CGCCCTGAGT ATGATTATCA AGGATAGCAT CACCCACTCA AATACGTTGA TCTTCTATCT GACGAGCAAG AATTTTCTTG CGAGTCAATA CTGCCTGTTC GAAGGTGGCG CGGGCTGGGC AACCCGATCG ATTGGCGACT ATCTCAAGCT CAACGTGGAT TACAAATCCA TTCCAGCCTT TCTCACGAAC GGGCGGTCCG AGGTGACCGT CCTGAATGGT ACCAGCATCG AGCTCACGGT AGATCTTCAT AACTACTTTA TCCGCGGAAT ACTCAATCCG ATGATCTCTC ACTTGAACCG AGGGCGAGAG ATCACGGGGG AGCCCCTGAT TTCGGAGTTC GAAATTCCGT CGGTGCCAAC TGAATTTGAA CTAAAGAAAC AGGGAAGAAC AGCCGCCGAC TTTTTCAACG AAGAGATCTC GGAATATTGG CAAATTCTCG TCGAAGATAC AATCGACGAG TACCTATCAG ATTACCCTAA GCCCTAA
|
Protein sequence | MTTPRDEELF SFSWLALKLL GRGLYSNPWS ALSELVANGL DANAETVYVY IDASIKSNAT VEVIDDGDGM SRDEIGLYAQ VGRNKRNDEP ENNSAKNPPK GRKGIGKLAA LYLSNHFFLH TRQVDSATSW ELDARDERVS EDEHPRLVAT TAGPETPNET LWNSLASGTR ITLQNVDLTG YGEQSITALG SRLANQFLLP ESNSGGTIHI WVRKRADETN PRYGPVEKTV AYKNLVEVSQ NFDPEDVADA QPSDFQPSGR SVRIPAKGAP GGEILKPAKF TKFQHDTLED EAWKEIESRV DRKRKTYDNV PFELTGWIGV HATIDSAAAR ENDARFVKNR YYNPAQIRVY VRGKLASDRL LSQLGLTGTY TNYIEGEISF DILDEDTLPD IATSNRQDFD ETDGRVTLLK ALVRPIVRRL MLSRQEIATE IARAVTAEKE RRDTSSKQQF SHEVQEELAQ RTELSDTSRA ELHMVITNKI QGDVSPKQKY RVFISHARKD RAFATMIDEL LQLKGAKKDE IFFTSRPGDI EYALDDRALS MIIKDSITHS NTLIFYLTSK NFLASQYCLF EGGAGWATRS IGDYLKLNVD YKSIPAFLTN GRSEVTVLNG TSIELTVDLH NYFIRGILNP MISHLNRGRE ITGEPLISEF EIPSVPTEFE LKKQGRTAAD FFNEEISEYW QILVEDTIDE YLSDYPKP
|
| |