Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1688 |
Symbol | |
ID | 8807460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1805140 |
End bp | 1807680 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003460919 |
Protein GI | 289208853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.121796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTCGC TGCTGGTAAA GGCCTGGCTG CGCGACTGGC GCAGGCGCCC GCTGCAGCGG CTGCTGACGC TGGTGGGCGT GGTGATCGCC GTGGCCGTGG TGCTGGCGGT GGATCTGGCC AACCAGTCGG CGCAGCGCGC GTTCGACCGT TCGATGGACG AGGTGGCGGG CACGGCCACG CACCAGATTC TGGGGCCGGC CCGGGGTTTC GACGAGACAC ATTACCGTGA TCTGCGGCTG GCCGGCTGGC GCGATACCGC GCCGATCGTG GAGGGACCCG CGCGGCTGCC GGACGGGCGC ACGATCTATG TGCTGGGCGT CGATCCGCTG GCCGAGGGCC CGTTCCGCGG GGCGGCCTCG GATATCGGGG ATGCCCCGGT GCAGCGGCTG GTGACCGAGC CGGGCACGAT CCTGCCGCCG AGTGGCTGGT GGTTGGGCTC CGGGATGGCG CATGAGGACC GCCTGGCGCT CGAGACCGCC GCCGGGCGCT TCGAGGTCAC GCTGCTCGAA CGCGAGGCGG AGCGTGACGA ACCGGCGGCC GAGTCAGACG ACGAGGGTGG CTCGCTGTGG GTCACGGATA TCGCCACCGC ACAGACGTTG CTCGGCCGGC CGGGACGGCT CGATCGTGTC GACGTACGCC TCGATGACGC CGACATCGGC GCCTTCGCGG AGCGATTGCC GAATGGTCTG GAACTCGTGC CAGTGGCGGC GCGCGACCAG GCCTCGCGCG AGCTGGCGCA GGCCTTCCGC ATCAACCTGA CGGCGATGAG CCTGCTGGCG CTGCTGATTG CAGCCTTCCT GATCTACAAC ACCCAGACCT TTGCGGTGCT GCGCCGACGC GAGGTGCTGG CGAGTCTGCG CCTGGTGGGG GCCGGTCGGC GGGCGCTGTT CGGCGTGGTG ATGCTGGAGG CCTTGCTGGT CGGTGTGATC GGCACGCTGC TGGGGGTGCT GGTCGGGATC GGTCTGGCGC AGGTGCTGGT CGGCCAGGTC GCGCAGACGG TGACGGATCA TTTCTTCGTG GTGGCGGTGA CCGAGGTGAC GCCGCGCCCG CTGGCGCTGC TGGTGGTAAT CGCGCTGGGG GTAGGGCTGT CGCTGCTGGC GGCGCTGGCC CCGGCCTGGG AGGCGGCACG GGCGGCCAGC CCCGGCAGCC TTGGACGCGG GCGGCTGGAA CGCCGGGCGC GCCGGGGCCT GCGCGGGCTG GCCGTGGCGG CGGTCGTTCT GCTCGCGGCG GGTGGCGTGA TGCTGCTGCT CGGGGCCGGA GGGCTGGTGG GCAGCTTTGT CGCGCTGTTC CTGCTGATCA CTGGCTTTGT GCTGCTGCTC CCGCTGGTGC TGGCCGGAGT CCTGCGCGGT CTGGCGCGGC TCGGTCGCGG GCAGACGCTC CCGCGCATCG GGGTGCGCAG TCTCGAGCGG AGCCTGTCGC GCAGCGCACC GGCGACCTCC GCGCTGACGC TGGCCCTGGC GGCCGCGATC AGCGTGTCGG TGATGGTCGG GAGCTTCCGC GACAGCGTGG AGCAGTGGCT GGGGCAGGCG CTTACCTCGG ATGTGTACGT GGTGCCGGTG GCGCCGGCCG CGGCGCGCAG TGGGGCGCGC CTCCCCGCGG ACTGGGGGGA GACGCTCGCC CGGATCGCGG AGGCGGAGTC GGTCAGCACC GGCTCCAGCC AGGACACGCC GAGCGATGTC GGGATGATCG ATCTCTACGT GGTCGAGCCG CACCCGGTCT GGCTGGAGTA CCTGCCGCTG CTGGAGGCCG AGGCCACGGG CGAAGCACTC GACCGGCGCC TGCGAGAGAC CGATGCGGTA CTGGTCACCG AGCCCTTTGC GGCCCATCAC GACGTGGGTG TGGGCGACCG GCTGGAGATC CGCGTGGATG AGGGGCGGCG GGTGTTCGAG GTGGCGGGCG TGTATCGCGA TTACGGCAGT CCGCAGGGCA CGGTGCTGAT GCTGGGCGAC CGGTTCTACG CGGGCACGCC CGGCGCCGAG GAGGTGGGGT CGTTTGGACT GCGCCTGCCC GAGGGCGCGG ATGCCGACGC GGTGATCGAG CGCTTGGAGA CTTGGCTGGC CGAGCGCGAG ATGGCGGCCA TGGTCACGCG CCCCGGCGAT ATCGAGCAGG AGTCGATGGC GATCTTCGAC CGTACCTTTG CGATCACGCA TCTGCTGCGG GTGATCACGC TGCTGGTCGC GTTCATCGCG ATCCTCGGGG CGCTGATGGC CCTGCAGATG GAACGCGCGC GTGAATTCGC CACGCTGCGC AGCCTGGGCC TTCTGCCCGG CGGGGTGCGC GGACTGGTGG TGTTTCAGGG CGCGGTGCTG GGTGCTTTCG CCGCGCTGGC CGCGATACCA CTGGGTCTTG GCATGGGCTG GCTGCTGATC GAGGTGATCA ACCGCGAGGC GTTCGGCTGG GGCATGGACC TGCGCTGGCC TGGTCGCGAG ATTGCCGAGA CCGTTGCCCT GGGCGTGGGT GCGGCGCTGC TGGCGGCGCT GATCCCGGCC TGGCGCATGG CGCGCATGCG GCTGGTGCCC GCGTTAAGGG AGGTGCCCTG A
|
Protein sequence | MSSLLVKAWL RDWRRRPLQR LLTLVGVVIA VAVVLAVDLA NQSAQRAFDR SMDEVAGTAT HQILGPARGF DETHYRDLRL AGWRDTAPIV EGPARLPDGR TIYVLGVDPL AEGPFRGAAS DIGDAPVQRL VTEPGTILPP SGWWLGSGMA HEDRLALETA AGRFEVTLLE REAERDEPAA ESDDEGGSLW VTDIATAQTL LGRPGRLDRV DVRLDDADIG AFAERLPNGL ELVPVAARDQ ASRELAQAFR INLTAMSLLA LLIAAFLIYN TQTFAVLRRR EVLASLRLVG AGRRALFGVV MLEALLVGVI GTLLGVLVGI GLAQVLVGQV AQTVTDHFFV VAVTEVTPRP LALLVVIALG VGLSLLAALA PAWEAARAAS PGSLGRGRLE RRARRGLRGL AVAAVVLLAA GGVMLLLGAG GLVGSFVALF LLITGFVLLL PLVLAGVLRG LARLGRGQTL PRIGVRSLER SLSRSAPATS ALTLALAAAI SVSVMVGSFR DSVEQWLGQA LTSDVYVVPV APAAARSGAR LPADWGETLA RIAEAESVST GSSQDTPSDV GMIDLYVVEP HPVWLEYLPL LEAEATGEAL DRRLRETDAV LVTEPFAAHH DVGVGDRLEI RVDEGRRVFE VAGVYRDYGS PQGTVLMLGD RFYAGTPGAE EVGSFGLRLP EGADADAVIE RLETWLAERE MAAMVTRPGD IEQESMAIFD RTFAITHLLR VITLLVAFIA ILGALMALQM ERAREFATLR SLGLLPGGVR GLVVFQGAVL GAFAALAAIP LGLGMGWLLI EVINREAFGW GMDLRWPGRE IAETVALGVG AALLAALIPA WRMARMRLVP ALREVP
|
| |