Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2904 |
Symbol | |
ID | 9157072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3012133 |
End bp | 3014550 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003647841 |
Protein GI | 296140598 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.094756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATCG AATTCGAGCG TCCATCCACC CCCGCGCCCG AGAATCGCGC AGCCCGCCGC CGCTCCCGGG TGCCCCGCCG CCGCTCGGCA CGCCGTCACA CGGCGGCATG GCGCGGCGGA TTCGCCGCCG TCGCCGCGAC CGCCACGCTG ATGGGCACCG CAGCCGCCCC GATGGCCCTC GCGGCCGACA CCGCCGGCCA ACCCTTCGAC ATCACCATCG GCGGCACGAG TCTGCGCGAC GTGGGCATCG ACATCACCAA GCTCAACCTC TCCAATCCCG ACCTCAGCGG GATCGACTTC CAGAAGGTGC TCGGGCTCCT CGACAATCCA GCGATCTCGA AGATCGTCGA GTCATGCCGC ACCGGCGCTC TCGGTAGCGG TGGTGGCGGC TCGAACGTGG ACTGCGCGGG CGCCACCGGC ACCGGCGCCG CGGTCGTGCT CCCCGATCGA CTCGACCTCG CCGCGATCGC GGATCAGGTG ACGATCGACC TCGGCCCGCG GGTCGGAGCG CTCGGTGTCT ACGTGCCGAT GGGAGTGCAG ACACCGTTCG GGGTACTCAG TGTTCTCGCC GTGCCGCCGA CCGGCCTGGT CAGCCTCGGT CTCGGAGCAC TCGGGATCAA GGACATCGAT CCGACCGCCA TGGGCAAGTA CAAGACGCTC GACGACGTCA AGAACGGCGC CGCACTTCCG GTGAACCTGG TCCAGGAACG CTACTGCGAC AAGATCACCT GCATCTTCGG CGGCTGGAAG TACCGCGACG TGGATTCGAA CCTGGGCGCC CGGACCGAAG CGCTGAAGAT CGCCGGACAG CTGAGCGGCC GCACCTATGA CTTCACCAAG CCGACGGTCC TGGATCCGCA GATCCTGGTG CGGGGCACAT CGACGATTCT CGGCGACGGC TTCACCTTCG CGTTCTCCCG GAACGCGGGA ACAGCGTTGG CGGAGACCAA GAACAAGCTG GCCCTGGCAC TGGCCGGCGC GGACGGCGCC GACGCCGCGT CGAAGGCCTA TGCGAACCTC GGCCTGGCGC TCGCGCTGAA CATGAATACC AGCACGATGG GGCTTGACTG GTTCGGGTCG CCTCTGGCGT TCGACAAGAT CAAGGACTCG CAGGTCATCG ACAAGGTGCT CAACCTCGCG AAGACCTTCA ACATGTTGCC CGCAGGCATG AATCTCGATG CCGCCACGAT CAACAACGCG ATGGACACCA TCCAGAAGCT CAAGCTCCCC GACGTCAAGG AGGTCAGCTG CCTGGGCGTG GGAACATCGG CCTCGGCGTC GGGCCTCGGC GAATGCGGCA ACTACCTCGG CACGTTCGAC TACTACAAGG ACCTCCGCAA GACCGTCGAC GGCCAGAGCC GCCAGACGCA ATGGGGGCTC ACGGACCCCA CGTCGCTGGT GCTCGGCCAG GGCAACGCGC TCAGCCAGGT ACTCACGCCC GGCGTGCTCC AACTTCTCAC CGCCGTCGCC GGTGGCGATC TCAGCAAGGT GCCCTTCAAC GAGCTCACCG GCCTGCTGGA GAATCCGTTC GTCAAGGACT CGATCGCCGC GCTGCTCTCC GAGGAGAAGC GGCTCAAGCT GACCAAGGAC TTCGTCCGCT TCACCAAGGA CGTCAAGACC ACCGAGACCT CCCATCCGGT GGTCGACGAG AACGGGGTCC CGGTGCTGGG CCCGGACGGC AAGCCACAAG TCCAGACCTC GTCCACATCG AAGACCAGTT ACCTGCTCAC CAGTGACTAC GGCCTGCGGT CGCCGATCAC CGTGGACTGG TTGGGCTACC GACTGACGGT GTTCCCCACC GCCGAGGTGA ACGGCACCGT GCGCCCGAAC TACCTCGGCC TGCCCACCAT CACCAAGATC GCCGACGGCA CCCCGAGCCT GCTCCCGCGC ATCGGCCTGA TCGAGCTGGA CAACCCGTTC GGCCTGGGTA CTCTGCCGAT CCTGCCCTTC GATCCGCTCG GCGCCTTCAG CTCGTGGACC AAGAGCATCA CGCTCAAGGA CGATGTGAAC CGCATCCGCG AGGTGCTGCC CGCGGTCACG GCGGCGCTGC CGAAGACCCC GGCCCCCAAG ACCGAGACCG ACGGTGCCGT GCCGGTCACA GCCGTGGCCG CCGGAGCCTC CGCGTCGGAT GTGACGGCCG TTGCCTCCGA GCCGCGCGCG GGACGCTCGC CTCGATCGGT GGCCGAGGCT CCGGCTGAGG CGACGACGGC GCCGACGGCC GAGACGAGTA CGACGTCGAC CCCGCAGCCG GAGAAGCCCA CGAAGGAGAC CACGTCGCAG ACCCCGACGA CGGAGTCGAA GCCCGAGACC ACGCCCGCGC CGACGACCGA GTCCGCGACC GCCGCCGACG CGTCGTCGAC GACGACGGAT AAGGCAACGT CCGCCGACGC CGCGAGCACG GAGAAGGGCG CCGCATAG
|
Protein sequence | MSIEFERPST PAPENRAARR RSRVPRRRSA RRHTAAWRGG FAAVAATATL MGTAAAPMAL AADTAGQPFD ITIGGTSLRD VGIDITKLNL SNPDLSGIDF QKVLGLLDNP AISKIVESCR TGALGSGGGG SNVDCAGATG TGAAVVLPDR LDLAAIADQV TIDLGPRVGA LGVYVPMGVQ TPFGVLSVLA VPPTGLVSLG LGALGIKDID PTAMGKYKTL DDVKNGAALP VNLVQERYCD KITCIFGGWK YRDVDSNLGA RTEALKIAGQ LSGRTYDFTK PTVLDPQILV RGTSTILGDG FTFAFSRNAG TALAETKNKL ALALAGADGA DAASKAYANL GLALALNMNT STMGLDWFGS PLAFDKIKDS QVIDKVLNLA KTFNMLPAGM NLDAATINNA MDTIQKLKLP DVKEVSCLGV GTSASASGLG ECGNYLGTFD YYKDLRKTVD GQSRQTQWGL TDPTSLVLGQ GNALSQVLTP GVLQLLTAVA GGDLSKVPFN ELTGLLENPF VKDSIAALLS EEKRLKLTKD FVRFTKDVKT TETSHPVVDE NGVPVLGPDG KPQVQTSSTS KTSYLLTSDY GLRSPITVDW LGYRLTVFPT AEVNGTVRPN YLGLPTITKI ADGTPSLLPR IGLIELDNPF GLGTLPILPF DPLGAFSSWT KSITLKDDVN RIREVLPAVT AALPKTPAPK TETDGAVPVT AVAAGASASD VTAVASEPRA GRSPRSVAEA PAEATTAPTA ETSTTSTPQP EKPTKETTSQ TPTTESKPET TPAPTTESAT AADASSTTTD KATSADAAST EKGAA
|
| |