Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1843 |
Symbol | |
ID | 8603170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 2158088 |
End bp | 2159728 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299454 |
Protein GI | 269126084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00395513 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCT GGGCCGAACA CGTCAGCACG GCGCTGCTGG GCACCCGGCG GCGGCCCGTC CCGGCGCTTG CGGTCCGCCT CGCTCCAGAA GACGACGGGG AGAGCGCGCC GCGCTCTTCC CACGGCGGCG CGGGCGGGAT CCCCGCGGGG GCGGATCCCG CCGGTGCGCT GCTGGAGCAG GCCGCCGTGC TGACGGTGCA GCGGCGGGCG GGGCGGCGCG CGGGGAGCGC CGCCGAGCAC GCCGTGATCG CGCCCGCCCC GGCGGAAACG CTGCCGGTGG TGCCGCCGGC GGCGGCCCGG CGGCTGGCGC AGATCCTGGC CGGCGACCGC CTGCCGCTGC TGCCGCAATG GCTGCAGGCC GCCGCCGAAC GCGGCTTCAG GGTGCCCCCG GCGCTCCTGC CCGACCTGCT GGAACGGGGC CGCGCCGACC GGTCCCTGCG CCCGGCGATC ATGCGGGCGG CCGGGCGGCG CGGGGTGTGG CTGGCGCTGC ACAACACCGA CTGGGCGTAT CTGGTGAACG AAGGGGGCGA CCTGGGCGAC GACGACCCCC GGGTGTGGCG GACCGGCACC CGCAGCCGGC GGATCGCCTA CCTGACCCGG CTGCGCGGCC GCGACCCGGG GGCGGCGCGG CGGGCGCTGG CGGACAGCTG GCGGAGCGAA CCGGCCCCCG ACCGGGCCGC GTTCCTGGCC ACTTTCGCCC GCGGCCTGTC CCCAGATGAC GAGGAGTTCC TGGAGAAGGC CCTGGACGAC CGGGCCAAGG ACGTGCGGCA AGTGGCGGCC GACCTGCTGG CCGAGCTGCC CGGCTCGGCC TACGGACGGC GGATGGCCGA ACGGGCCAAA AGCTGCGTGC GGGTCGAAGA GCACGTGGCA GAGGGACGGC GGCACGTCCG GATCGTCGTC GAGCTGCCGC ACGCCCACGA CGAGGGCATG GCCCGGGACG GCATCCCGTT CCACCCGGCG GGCTCGTTCG CCCCGGCCGG CGGGTCGGGC GCCCCGGTGG GGACCCGGGC CGGGTGGCTG CGCGAGATCC TGGCGCGCAC CCCGCTGGAG ACCTGGACCG ACCTGCTGGG GATGCCCGCC TGCGAGGTGG TGCGCCTGCC GGTGACCGGG CCGGAGGCGA AAACCGGGCG GCGCCGGGGC AAGGCGGGGG CGAAGGACGA CTCCTGGGCC CGGGATGTGC ACATCGGCTG GGTTCGGGCC GCGCTCCGCC TGCGCGATGC GCAGTGGGCG CGGGCCCTGC TCGCCGACGG CGCGGTGCCG GCCGAGGAGG CCGCGGCGCT GGCCGATCTG GTGGGGCTGC TGCCCGCCGG TGAGCGCGAG CCGATGGCCG CCGCCTTGAT CCGGCGGCTG GGGGACGCCT CCTGGGCGCT GACGGCGCTG GAACGCATCC CCGGCCCGTG GGCCGGGGAG CTGGCCGACC TGGTGATCGA GCTGCTGGTG GCCGCGGCGC AGGAGGAGGA GCGCCGCCGC GGCGGGCGGG CCGGTCACCG GCTGGCGCCG CTGTGCAAGC TGGCCGGCAC CCACCTGGCC CCCGAGGTCG CTCCCCGGCT GGCGGCGCTG GGCCTCCCCG GCTCCTGGCC GGTCCAAGAA CTGATCGACA GCCTGCGATT CCGCCACCAG ATGCTGCGGG AGCTCGCCTG A
|
Protein sequence | MSTWAEHVST ALLGTRRRPV PALAVRLAPE DDGESAPRSS HGGAGGIPAG ADPAGALLEQ AAVLTVQRRA GRRAGSAAEH AVIAPAPAET LPVVPPAAAR RLAQILAGDR LPLLPQWLQA AAERGFRVPP ALLPDLLERG RADRSLRPAI MRAAGRRGVW LALHNTDWAY LVNEGGDLGD DDPRVWRTGT RSRRIAYLTR LRGRDPGAAR RALADSWRSE PAPDRAAFLA TFARGLSPDD EEFLEKALDD RAKDVRQVAA DLLAELPGSA YGRRMAERAK SCVRVEEHVA EGRRHVRIVV ELPHAHDEGM ARDGIPFHPA GSFAPAGGSG APVGTRAGWL REILARTPLE TWTDLLGMPA CEVVRLPVTG PEAKTGRRRG KAGAKDDSWA RDVHIGWVRA ALRLRDAQWA RALLADGAVP AEEAAALADL VGLLPAGERE PMAAALIRRL GDASWALTAL ERIPGPWAGE LADLVIELLV AAAQEEERRR GGRAGHRLAP LCKLAGTHLA PEVAPRLAAL GLPGSWPVQE LIDSLRFRHQ MLRELA
|
| |