Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1074 |
Symbol | |
ID | 8806833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1133734 |
End bp | 1135707 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | peptidase M48 Ste24p |
Protein accession | YP_003460322 |
Protein GI | 289208256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.959288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCT TTGAGCATCA GGACCGTGCC CAGCGGCGCA CGCGCTGGCT GCTGGGCCTG TTCATGCTGG CCGTGCTGGC GATTGTGGTG GTGATGAACC TGATCGCGCT GGTGCTGTTC GGCCAGGCGC AGGCGACCGC ACCGGGCGAA CCCTGGCTCA CCCGGGAGTT TCTTCGCGAC AACCTGGACG TGGTCGGCTG GACCAGTGCG TTCACCGTGG GGCTGATTGG CCTGGGCAGT CTCTATCGCA TGCTGAGCCT GCGCGACGGC GGTGGCGCGG TGGCGCGCGA GCTCGGGGGT ACGCGAATCG AGGGGGATAC CCGGGACCCG TTGCGTCGAC GCCTGATGAA CGTGGTGGAA GAGGTCGCGA TTGCCTCGGG CGTGCCGGTG CCCGAGGTCT ACGTGCTGGA GCAGGAGCCG GGGATCAATG CCTTCGCAGC CGGGTATTCG CCTTCGGATG CGGCGGTTGC GGTGACGCGC GGTGCGCTGG AACACCTGAA CCGAGACGAG TTGCAGGGCG TGGTCGCCCA CGAGTTCGGA CACATCCTCA ATGGCGACAT GCGCATGAAC ATCCGCCTGG TCGGGATGCT GTTCGGGATT CTGGTGATGG CGCTGATCGG GCAGCGCGTG CTTTTGGCCA TGCGCTTTTC ACGTAACAAC CGCAACGCCG GCGGGATCGT AGTCGCCGGC CTGGTGCTGA TGATCGTCGG TTACATCGGC CTGTTCTTCG GGCGCTGGAT TCGGGCGTCG GTATCACGTC AGCGCGAATA CCTGGCGGAC GCCTCGGCCG TGCAGTTCAC GCGCCAGCCG GAGGGCATTG CAGGGGCACT GAAGAAGATC GGGGCCAGCT ATTCCGGCCT CAACGCGGAC AGCGAAGAGG TCGGCCACAT GCTGTTCGTC AATGGCGGGC TGGGGCGGAT GTTCGCGACG CATCCGCCAC TGGAGGATCG CATCCGCAAG ATCGAGCCGG GTTTCGACCC CTCGGAGCTC GAGGCCGTGC AAAAACGGAT GCAGGAAGAT GCGCGGCGCC GGCGCGAGGA GCAGGAGGCC GAAGCCGCGC GCGAACAGGC CGAGCCGAAG GCCGGGGGTG GGCTGGGGCT ACACCCGGAG GCCCTGCTCG AGGCGATTGG CGACCCTGGC CTTGCGCATA TCCTCGGGGC GGGATTCTTG GTCGCCGCGG TGCCCAGCGC CCTGGAACGC TCGGTGCATT CCGACCGCTG GGCCGCCGAG GCCATGCTGT ATCTGCTTAT TTCGTCCGAT TCCGAAGTGC GCGATGCCCA GCTGATGCGC ATCCTCGAGA CGCGCGGCGA AGACAGCGAA AAGCGGGTAC GTGAACTGCT TCAGGAAGTG CCGGAACTCA GCCCGGATCT GCGCATCCCG CTGCTGGAGA TGGCCTTCCC CGTGCTGCGC CGACACCCGC CACAGGAACG TGAAGTCCTG TCGCGTCTGG TCGACGAGTT GATTCAGGCC GATGAACGGG TATCGGCCTT CGAGTATGCG CTCGGGCGGC TCGTCCGGCG CCAGCTGCAG GATATCGAAC GGCCCTCGCA GGCCCGTTCG GGAGGGCGCC GTCGGCTGGC CGACAACGGC GGGCAGGCGC GCTACGTGCT CGCGGTTCTG GCCCATCACG GCCATCCGGA TGATCCCGAT GCGGCGGTTA CAGCGTTCAA TGCCGGTGTG GCGGCGCTCA CGGGCATCAT GCCGCTGGAG TCGGAAGCCC CGCCGGAGCT CGAGCAGGCT ACTGCCGGCA CTGCCTGGGC ATCCGTGCTG GACAAGGCCC TGGAGCGGCT CGACCAGCTG CGGGTCAAGG ACAAGCGGAC ACTGGTGGAG GCGATGCTGG CGACGGTTCA GCGCGGCGGT GGTGTTCAGG TGGCCGAGAT CGAGCTGTTG CGCGCCATGG CCGCTGCATT GCACATCCCG CTGCCCATCG CCGAGATGGC GGACGGCTCG CAGGGGAGCG AGTCGCACGA CTGA
|
Protein sequence | MDFFEHQDRA QRRTRWLLGL FMLAVLAIVV VMNLIALVLF GQAQATAPGE PWLTREFLRD NLDVVGWTSA FTVGLIGLGS LYRMLSLRDG GGAVARELGG TRIEGDTRDP LRRRLMNVVE EVAIASGVPV PEVYVLEQEP GINAFAAGYS PSDAAVAVTR GALEHLNRDE LQGVVAHEFG HILNGDMRMN IRLVGMLFGI LVMALIGQRV LLAMRFSRNN RNAGGIVVAG LVLMIVGYIG LFFGRWIRAS VSRQREYLAD ASAVQFTRQP EGIAGALKKI GASYSGLNAD SEEVGHMLFV NGGLGRMFAT HPPLEDRIRK IEPGFDPSEL EAVQKRMQED ARRRREEQEA EAAREQAEPK AGGGLGLHPE ALLEAIGDPG LAHILGAGFL VAAVPSALER SVHSDRWAAE AMLYLLISSD SEVRDAQLMR ILETRGEDSE KRVRELLQEV PELSPDLRIP LLEMAFPVLR RHPPQEREVL SRLVDELIQA DERVSAFEYA LGRLVRRQLQ DIERPSQARS GGRRRLADNG GQARYVLAVL AHHGHPDDPD AAVTAFNAGV AALTGIMPLE SEAPPELEQA TAGTAWASVL DKALERLDQL RVKDKRTLVE AMLATVQRGG GVQVAEIELL RAMAAALHIP LPIAEMADGS QGSESHD
|
| |