Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2434 |
Symbol | |
ID | 8808215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2554905 |
End bp | 2556233 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003461660 |
Protein GI | 289209594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000516263 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCACA ACACACACCT GCTCCGTCCC GCCCTGGGCG GACTGACCGC CCTGGCACTC GCCGTGCCGA CCGCCGCCAC CGCCCAGCAG TGGGAGACGC GCGCGGGAGA CCGCAACTGG GGGCTGGATA TCTCATTGAT CCTGGAAGGG GTCTACTACA ACGAGATGTC CCGCGGGATC AGCGACCCAG CCGGCTTCGA TGACGGCCAT GACCATGGGC ACGACCACGG AAACGGGCAT GACCACGGGT TCGACGACGG CTTCAACCTC GGGCACAGCG AGCTGGTGTT CGAGGCCAAC CTCGGCGATC TGTTCGATGG CGTGCTGATG ATCGGTTTCG ACGAGGACCA TGTTGAAGTC GAAGAGGCCT ACCTGACCAC CCGAGCCCTG CCCGGCGGCT TCCAGGTCAA GGCAGGTAAA TTCCTGTCGG ACATCGGCTA CATCAACAGC CGCCACGTCC ATGACTGGGA CTTCGTCGAC CGCCCGCTGG TCAACGAATA TCTGTTCGGC GACCACGGAC TGCAGGAAAC TGGCGTTCAG GCCACCTGGC TCGCTCCGAC CGAGACCTAC ATCCAGCTCG GGGTGGAACT GCTTCAGGGC GACAACAGCT TTATCCCCTA TCAGGGCGGC GGCGGCACGC ATACCACCAA CGCCGGGGAC GACATTCGAT CCTCGTTCTC GGATGTCTCG GGACCCCGGC TCGTTACCGC ATTCGCCAAG TTCGGCCCGG ATCTGGGACC GGATCACGCC ATGCAGTTTG GCGTCTCCGG CGGCTACGCA AACACCTGGC AGGAAGAGAC CGAGCATGGC TCCGGCCTGG AGGAATGGGA TGGCGATGCC TGGTTCGCCG GGCTGGACGC CGTCTACAAG TACAGCTCGG GCCGCGCCTA CGGGCACGGC GACTGGCGGC TGCAGGGCGA GTACTTCTAT CGCGAGATCG ACGTCGACTA CCAGCGTCGC GAAGACCAGA CCACCATCGC CGGCAACGCC CAGGGTAGCG GCAAGGCGCG CCAGGATGGT CTCTATATCC AGGGGGTGTA CGGCTTCGCC CCACGCTGGG AGGCCGGCCT GCGTTCCGAG GCCCTGGGCC TGACCAATCG CGGGCTGCCC ACCAACGAAG GCAGCGAGCG CAACTTTGCG GAGCCCTCTT TTGGCACGAG CTACCGCCAC TCCGCCCAGG TCACCTTCCG CCCCGTGGAG CCGGTATTCC TGCGCGCGCA GCTGTCGCAG AACGACTTCG TCGACGAGGA CGGTGATCGC GACCGCGGGC TGGAATTCAT GCTGCAGCTG AACGTGGCCC TCGGCGCGCA CGGCGCCCAC CGCTTCTGA
|
Protein sequence | MPHNTHLLRP ALGGLTALAL AVPTAATAQQ WETRAGDRNW GLDISLILEG VYYNEMSRGI SDPAGFDDGH DHGHDHGNGH DHGFDDGFNL GHSELVFEAN LGDLFDGVLM IGFDEDHVEV EEAYLTTRAL PGGFQVKAGK FLSDIGYINS RHVHDWDFVD RPLVNEYLFG DHGLQETGVQ ATWLAPTETY IQLGVELLQG DNSFIPYQGG GGTHTTNAGD DIRSSFSDVS GPRLVTAFAK FGPDLGPDHA MQFGVSGGYA NTWQEETEHG SGLEEWDGDA WFAGLDAVYK YSSGRAYGHG DWRLQGEYFY REIDVDYQRR EDQTTIAGNA QGSGKARQDG LYIQGVYGFA PRWEAGLRSE ALGLTNRGLP TNEGSERNFA EPSFGTSYRH SAQVTFRPVE PVFLRAQLSQ NDFVDEDGDR DRGLEFMLQL NVALGAHGAH RF
|
| |