Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1872 |
Symbol | |
ID | 8603199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 2195587 |
End bp | 2198187 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299483 |
Protein GI | 269126113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0047456 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTTTG ACGACGACTA CAAGCGGGAG GTGCTGGAGC CCGCGCGGGA GGCGGGCGAT CAGCCCCCGG AGGACCTGCG GGTGCGCTAC CGGCTGCGCG AGCCGCTGGT GCCCGCCCAG GTCGCCGAGC AGGTCCGGCT GGTGCGCCAG TGCTGGCGGC GCGCCCGGGG GCAGCTGAAG TACCGCAAGC TGATCGACCG GCTGGAGGCC GAGCACCGGC AGCTGGCGCC GCTGTTCGCC GCCGCCGAGC GGGGCGATGT GCGCCCGCTG GCCGAGCGGC TGCGCGGCAG CCGGGAACGC AGCGCGCAGC GGCTGGCCGA CGCCCGCGCC CGCCTGACCG ACGCGGCCGG CGAGGTGCGG ATGGTCACCC CGGCCGAGGT GGAGGAGCTG GCGCGGGCGG CCGGCATCGC CCCCGGTGAA CTGGAGTCGG TGGCCAAGGT CGAACGGATC GAGATCCGCG ACCCGGACCG GCTGCCGGCC TCCCCGCCGT ACGCCGGCTA CCGCAAGGTG CGCGAGTCGC TGGATGTGCT GGGCCACCGG CATCTGGCCG AGTTCCTGTT CGGGGACGCG CTGGGCGGGC CGATGCGGGT GCTGGAGGGG TTCGCCGCCC CCGGCGTGCC GCCCGGCCCG CAGGCGCTGG CGCAGGCCGT GCAGCAGGCG GCCGAGCAGT GGGCGCGGCG CGCCCGCGAC AGCAGCAGCA CCCACGCCGG CACGGTGCTG GTGGCGCTGC GGGAGGCCCC ACCGGACGCG CTGATCTGCT ACGACCTGAC CGAGCGGCTG CGGGAACGGC ACCGCCAGCG GGCCTCCCAG GGCGCCCTGC TGCGGCACGC GGTGCAGGAC CTGGGGATCG AGCCGGCCGA TGCGCGCCGC CTGGTGTTCG CGGTGCTGCG CGAGGACGGC CCCGGCGGCG GCGTGGCGGC GCGGCTGCGG GCGCTGCTGG ACGCCGGGGA GGTGTACGCG GCGGCGCTGC TGGGCGAGAA ACTGGCCGGC TCCGAGCTGC CGGAGGAGGC CGAGCTGCTG GCCGAGGAGG CCCGGCAGCG GGTGGCGGCG GCCGTGCGGC TGCGGGAGGC GGCCACCGCC GAGACCGACG CCGACCGGGC CTGGCGGATG CTGGCCGACG CGCTGGCGCT GGTGCGCGAC CTGCCGGGCG CCGCCGAGCA CCAGCGGCGC CTGCCGCCCC GTCCGGTGCC GCGCCTGCGG GCGGTCGCCG AGGGGACGGG GGTGCGGCTG GACTGGGCGC CCAGCCCGTC CGCGGTGGGC GAGATCACCT ACCGGGTGGT GCGATGCCAA GGCCGCCCGC CGGGCGGGGA CGGCGACGGG GAGACGGTGG CCGCGGTCGC AGAGGCCACC GGGGCGTTCG ACGCGGACCC GCCGGTCAAC GTGCCGCTGT ACTACGGGGT GGTGGCCCGG CGCGGCGCCG CGGACGCGCC GATCACCTGC GCGGATCCGG TCGTGGTGCG CCCGGAGGTG CAGTCGCTGG AGCTGGTGGC CGGCGACGGG GTGGTGACCG GCCGCTGGAT CGCCCCGCCG GGCGCCGCGC GGATCGTGGT GCTGCGGGAG GGCCGGCCGG TCGCGGCCGA ACGCGACGGC TTCCGCGAGC AGGTGCCCAA CGGGGTTCCC TGCCACTACC GGATCGCCGC CGTCTACCTG GACGGCGAGG GCCGGGAGAC GATGACGCCG GGGGTGACGG CCTCGGTCAC CCCGAACGCC CCGCCCGAGC CGGTGCGCGA GTGGACGGTG GAGACCGACC CGGCGGACCC GGCGCGGACC CTGCTGTGCT TTCCCCACCC GCCCGGCGGC ACCGTAGAGA TCCTGATGCT GGAGGCGCCG CCGCCCTGGC CGGTGGGCAC GCTGCTGCCG GCGGCCAAAG CGCTGCAGGC GGGCCGCAGA GTGCCGGCGG CGCCCACCTC CCGGGGGCTG ATGCTGCGCC CGGACGGCGG CGGGTGGCTG CTGGCGGTCA CGGTGTCGGG GGACCTGGCG GCGATCGGCG CCTGCCACCG GCACGTCAAC CTGCCTCCGC CGGCCGCGCT GGTGGCCGAG CGGCGCGGGG AGAGGGTGCA CGTCGGGTTC GACTGGCCCG AGGAGGTGGC GGAGGTGGAG CTGACCTACC GGGTCGGCGC GTCCGCCCCC CGCGAGGAAC GGCTGACGGT CACCCGCGCC GCCTATGAGT CCGGCGGCGG GATCCACCTG CCGGTGCCGG CGGACCAGCC GGTCACGGTG GCGGTCGCGG CGGCCGGGAT GCGGCAGGGC GCGCGGGTGG TGGGACCGGC GGCGCAGACC ACGCTGCCGG CCCGCCGGCG GGTCCGCTAC GACCTGCGGC GCTCGGGGCC GCCGTGGCGG CGGTCGCTGA CGGTCCGGCT GTCGGCGCCG CACCCGCTGC AGGTGGCGCG CCTGACCCTG GTGCACCGGG ACGGCCAGGT GGAGCCGCAG CGGCCCGAGG ACGGCCGGGT GCTGGGGACC TGGGAGCAGG TGCCGGTCCC CGGTGAGCTT TCGGTGCCGG CGCCCGGCGG ATCGGGGCCA TACTGGCTTC GCTGTTTCGC CGACGATGAG TCCATCGAAC TGATCGACCC CCCGGTCCGC AGCCGCCAGA CCCTGAGGTG A
|
Protein sequence | MGFDDDYKRE VLEPAREAGD QPPEDLRVRY RLREPLVPAQ VAEQVRLVRQ CWRRARGQLK YRKLIDRLEA EHRQLAPLFA AAERGDVRPL AERLRGSRER SAQRLADARA RLTDAAGEVR MVTPAEVEEL ARAAGIAPGE LESVAKVERI EIRDPDRLPA SPPYAGYRKV RESLDVLGHR HLAEFLFGDA LGGPMRVLEG FAAPGVPPGP QALAQAVQQA AEQWARRARD SSSTHAGTVL VALREAPPDA LICYDLTERL RERHRQRASQ GALLRHAVQD LGIEPADARR LVFAVLREDG PGGGVAARLR ALLDAGEVYA AALLGEKLAG SELPEEAELL AEEARQRVAA AVRLREAATA ETDADRAWRM LADALALVRD LPGAAEHQRR LPPRPVPRLR AVAEGTGVRL DWAPSPSAVG EITYRVVRCQ GRPPGGDGDG ETVAAVAEAT GAFDADPPVN VPLYYGVVAR RGAADAPITC ADPVVVRPEV QSLELVAGDG VVTGRWIAPP GAARIVVLRE GRPVAAERDG FREQVPNGVP CHYRIAAVYL DGEGRETMTP GVTASVTPNA PPEPVREWTV ETDPADPART LLCFPHPPGG TVEILMLEAP PPWPVGTLLP AAKALQAGRR VPAAPTSRGL MLRPDGGGWL LAVTVSGDLA AIGACHRHVN LPPPAALVAE RRGERVHVGF DWPEEVAEVE LTYRVGASAP REERLTVTRA AYESGGGIHL PVPADQPVTV AVAAAGMRQG ARVVGPAAQT TLPARRRVRY DLRRSGPPWR RSLTVRLSAP HPLQVARLTL VHRDGQVEPQ RPEDGRVLGT WEQVPVPGEL SVPAPGGSGP YWLRCFADDE SIELIDPPVR SRQTLR
|
| |