Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1000 |
Symbol | |
ID | 5171712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 1030951 |
End bp | 1032696 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640563518 |
Product | hypothetical protein |
Protein accession | YP_001244594 |
Protein GI | 148270134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00918654 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT GGATGAAAAA GTGGCAAGGG GTTATTATCT GGGCAGTCGC CATTGCGTTC GTGGCGGGTA TGATCTGGTG GTCGGTGTCG ATCAACCTCA GAAACACCCA GAACAACGTT AAATACACCC TCGAACAGAG TCTCGCTTAC ATCACAAAAG ACGGAACGGC TTTGAACGAT TCCACCTATT GGCTCATGCC GTGGGAAGTG AACGACTATT ACTCCAACCT GTTGAGTTCT TATCAGATCA TATCTCTAGA TCCACTTTTT GAGGAACCCA GACTCAAAGC TCTGATAGCG GACGTCTTTC TCCAGCAGAA AGTGGTTCTT TACTATGCGG AAAAGAACGA CATAAAGCCT TCCAAGAAAG AAATCAACCA GGAAGTGAAC AACGTTATCC AAACAATAAA GAACGATCAA AATCAGCTCA ACCGGATAGA GAGAACGTAC GGAAGTCTTT CGAACTACGA GAAGAACTAC CTGGAGCCTC AAATTCGAGT TCAGCTCACC ATCAAGAAGG TTCAGGAAAA AGTGGGAGTT GTCACCGAGG ATGAAATAAA GAAATATTTC GAGGAAAACA AGGAAGACCT TCAGAAGCAG TACGACAGAG TTGACATAGA AGCTGTGTCG TTCGACAGCA GTTCCACGGC TCAGGGGTTC ATTGCAAAAG CAAGTGAGGT GGGTTTCGAT GAAGCAGCTT CCAGTATGAA TGTTACTGTT CAACCTTTTT CCAACGCTAC AAGGGGGATT TTCCCGGATG AAATAGACAC GGCTCTGTTC AGTGCCACTC CAGGTTCGAT TGTTGGACCG TTCTTTTTCC TCGATCAGTG GTACGTGTTC AGGGTGAAGA CTTCCTCGGT TCTCACAGAT TTCAACGCCT TCGAGAACAG CGACGCGTAC AGTGATGTGA AAACGAAGCT GGAACAGGAA AAATTCCAGA AGTGGTTGGA AGAGTTCATG AAGGAAGAAA ATCTATCGTA CGCGTTCAAC GATCAGGTTC TTGAATACTG GTGGAAGTAC TTCAAGAACG AAGAAGACCT CTACGGAAAA CTGGCGAACC TTCTCTTCCA GGGAGAAAAC CTGGTAACAG AAACATCTGA TGAGCTGAAA TCTCTCTTCG TCCTTCTCTC CGACAGCAAG ATCCAGGAAC TCACCAAACA GATAGCCGAA TTGACTCAGT ACAGAACAGC GCTTGAAAAT TCTCAGGAAC CAGATGAGGA TCTTATAAAG AAGTACGGGA AACTTTCGAT CGAGGAAGTC GACGCAAGAA AAGAAGAGCT TGAAAAGAAA AAAGCAGAGT TGGAAAACAA AAAAAGAGCA GTTGTGGATT ATCTTTACGA GAACTACCCG TCTTCCACGT ATGTACTGGA ATACGCGTAC CGCTTGCACC CCAACGACAT AAACATCAAG TACAACTATT ACGCCAATCT TTACAATCAA ATCAAACCTT ATCTGTCAAC TGGAACCTAC GATCCGAACC AGATATTTGG GGTTCTTCTC GGTCTCTACA CAGTGGCCAA TGCAACGGAC GCTTCTACCA GCATACGTCT CGACTCTTAT TACATGCTCT ACGATATGAG CCTTGCACTG AACGATCCCA CGTCTGCGAA GTATTACCTT GATGAGATGA AAAAAATCGA TCCAAATTTC ATGGATTACG AGTCTGCCTA CAATCAGGTG GAATCAATTC TCGAAGCCAT GAAAGCATCA GAAGAATCTA CACCTTCTAC ATCGACAGGA GAATGA
|
Protein sequence | MRKWMKKWQG VIIWAVAIAF VAGMIWWSVS INLRNTQNNV KYTLEQSLAY ITKDGTALND STYWLMPWEV NDYYSNLLSS YQIISLDPLF EEPRLKALIA DVFLQQKVVL YYAEKNDIKP SKKEINQEVN NVIQTIKNDQ NQLNRIERTY GSLSNYEKNY LEPQIRVQLT IKKVQEKVGV VTEDEIKKYF EENKEDLQKQ YDRVDIEAVS FDSSSTAQGF IAKASEVGFD EAASSMNVTV QPFSNATRGI FPDEIDTALF SATPGSIVGP FFFLDQWYVF RVKTSSVLTD FNAFENSDAY SDVKTKLEQE KFQKWLEEFM KEENLSYAFN DQVLEYWWKY FKNEEDLYGK LANLLFQGEN LVTETSDELK SLFVLLSDSK IQELTKQIAE LTQYRTALEN SQEPDEDLIK KYGKLSIEEV DARKEELEKK KAELENKKRA VVDYLYENYP SSTYVLEYAY RLHPNDINIK YNYYANLYNQ IKPYLSTGTY DPNQIFGVLL GLYTVANATD ASTSIRLDSY YMLYDMSLAL NDPTSAKYYL DEMKKIDPNF MDYESAYNQV ESILEAMKAS EESTPSTSTG E
|
| |