Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1339 |
Symbol | |
ID | 8807105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1425507 |
End bp | 1427987 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | FimV N-terminal domain protein |
Protein accession | YP_003460582 |
Protein GI | 289208516 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.866454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.334377 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACAAGT TACTACTCGG GCTACTGCTC TTGCTGCTGG TTGCACCGGC ATCATCGAAT TCGGTGAGTT TTGGTGAGGT CAACCTGCAA TCCCATCTGA ACCAGACCTT GCGGGCGGAG ATTCCGCTGC GAGGCGGTGC CGCGGCCAGT GATTCGCTAC AGGTGCGTCA GGCTTCGGAA AACGAATACC GGCGCGCCGG CATGAGCCGC GGCAGCGTGC CGGGCGACCT GAACATCCAG GTTCAGGGTG AGGGGGCGGA TCGCCGGGTG ATGCTGACCA CGCGCCAGCC GGTGCGCGAG CCGTATGTGG GCATCCTCCT CGAGGCCCGC TGGGATGGCG GGCGCTCAAT GCGTGAAGTC TCCCTGTTGC TTGATCCGCC GGACACCATG CCTTCCGGGG CGGCCGCTCC GGTCCCGCGT GACACCGCCC GGGAGGCGCC GCGTCGACAA GAGGCGCCAA TGCCGCGTGA AGGTGAGGCC TACACCGTGC GTAGTGGCGA TACCCTTTAT CGCATCGTGG AGCGCGCGGG TCTGGCCGGC ATGGCGGATC AGGCGATGCT GGCGTTCCTG GAGGCCAACC CCGACGCGTT TTCTGACGAG AACATCAACA GCCTGCGCGC GGGCGCTGAA CTGACCGTCC CGTCGCGCTC GGAGCTGGAG TCGCGTAGTG CGGCCGAGGC GCGCCAGGAA GTGCAGCGAC AGGTTCAGGC CTGGCGCGAG GGAACGGTGA CCGCAGCTCG CGAGGAACCC CAGCCTGAGG AAGCCGCCGA GGCCGAGCCG GAAGTCGAGG CGGTCCCGGA AGAGGATGAG GCGGTAGCGG AGGACGATGA GGCCGTGTCC GAGGACGACG AGGCCGCCGC GGATGATGCC GTGGCCGCGA CGGACGAGGA CGATGAAGCG GCGGCGGCCG ACGACCGCCT GGAGATCGTG ACAGAACTCC TGCCGGAGGC GGATGAAGGG GCGGCGGCCG CGGCCGATCC CCCCGATCTG CTCCAGGAAG CCATGCTGTC CCAGCGCGCC GAGATGGAGG GGATGCGCGA GCAGATCACC GAACTGCGCG AGGAACTGGG TGAGCGTGGC CGCCTTGCGG AGCTTTCGAG CGAGAACATG GCGGAGCTTG AGGAGCAGCT GAGCCAGCTG CGGGCGGAAC GCAATGAACT GATGGCACGG CTCGATCGGG CAGATGCCGA ACGCAATGCG CCGCTGCACG AGCGGATCAT GAATGATCCG CTGCTGCTGA TGATGGCGAT TGCGCTGGTA ATCCTGCTGC TGCTCGTGTT GCTGGCCTTT GCGCGTGGTG GTCGGCGCGA AGTGGTGGTT GAAGAGCGTG CTTCCGGCAT GCCGGCGCGG GCGGACAACA GCGGTGTGTG GGCCGCGGCG GCGGATCCCG CCGGCTACGA GGCACGTGAG ATCGATAGTG CCGGCGCGGG ATCGGGCGGC GCCACTACTA CAGCCGTAGC CGGCGGGGCT GCTGCGGGCC TGGCGGCCGG AGCTGTGGCG GGCAAGGACG ATGATCGCGA ACCGCTGGCG GATTCGGACG AGCCAGAGGA GTCGGTGAGT GAGCCTGTAG TGGTGAGCTC CAGTGGCGAT GCCTCGGTGG ACGACGTGCT GGCCGAAGTG GATGTTTGCC TGGCCTACGG GATGAATGAT CAGGCCGAGG AGACCCTGAC CCAGGCCATC GAGGGCGACC CGGACAATAC CCAGTATCGT CTGAAGCTCG TCGAGGCGCG CGTGGCGCTG GGCGACGAAG AAGGCGCCCG CGAGGCGGCA TCCAACCTGC GCGAACGCCT CCCGGCGGAC GATGTTGAAA CGCGCAATCA TCTGGCGGAG CTGGAATCGC GCATCGGTGG TGCTGGTGAT GACAGCGCAG GGCCCGCGGG CGCTCCGGCG GGGGTAGAAG GTCTGGGCGC TGCGGTGGAG CCGCAGGCGG CAGATGACGG CGAAGCACCG AACGAGATCG ATTTCTCCGG CCTGGATCTC CCGGATGTGC AATCCGAAGC CGAAGATCTC GGTAACACCG CCGAGCCGCA GGAAGAGACG GACTCCGGGC TGACCTTTGA TTTTGACGAG ACCTTCGATG AAGGGGCTTC GACGCCAGCG GAGCAGACCC CTTCGGAGAC GGACGAAGCG CGCAGCGATG ACCTGAATGA CTTGTCGTTC GATATCGACG ACAGCGACCT GCCGAACCTG GATGAAGAAA CCCGGGAGGA TGCGGCCGGG AGTGATATCC CGGCACTGGA TCTGGGCGAC GAGGACCTGC CGGCTGCGGG CCAGGAAGAT GCCTTGCCGC GAGAGGACGG AGAGTCCTCG CCCATCGGCG CCGACGAAGA CAATGAAACG CGCCTGAGCC TGGCGCAGGC CTATGCGGAC ATGGGCGATG ACGAAGGTGC TCGCGAACTG ATCGACGAGA TCGTGGCCAC CGGGTCGGAG GATCAGAAGG CGAAGGCTGA GGCGATTCGT CAGCAGCTGG AGAGCTCCTG A
|
Protein sequence | MHKLLLGLLL LLLVAPASSN SVSFGEVNLQ SHLNQTLRAE IPLRGGAAAS DSLQVRQASE NEYRRAGMSR GSVPGDLNIQ VQGEGADRRV MLTTRQPVRE PYVGILLEAR WDGGRSMREV SLLLDPPDTM PSGAAAPVPR DTAREAPRRQ EAPMPREGEA YTVRSGDTLY RIVERAGLAG MADQAMLAFL EANPDAFSDE NINSLRAGAE LTVPSRSELE SRSAAEARQE VQRQVQAWRE GTVTAAREEP QPEEAAEAEP EVEAVPEEDE AVAEDDEAVS EDDEAAADDA VAATDEDDEA AAADDRLEIV TELLPEADEG AAAAADPPDL LQEAMLSQRA EMEGMREQIT ELREELGERG RLAELSSENM AELEEQLSQL RAERNELMAR LDRADAERNA PLHERIMNDP LLLMMAIALV ILLLLVLLAF ARGGRREVVV EERASGMPAR ADNSGVWAAA ADPAGYEARE IDSAGAGSGG ATTTAVAGGA AAGLAAGAVA GKDDDREPLA DSDEPEESVS EPVVVSSSGD ASVDDVLAEV DVCLAYGMND QAEETLTQAI EGDPDNTQYR LKLVEARVAL GDEEGAREAA SNLRERLPAD DVETRNHLAE LESRIGGAGD DSAGPAGAPA GVEGLGAAVE PQAADDGEAP NEIDFSGLDL PDVQSEAEDL GNTAEPQEET DSGLTFDFDE TFDEGASTPA EQTPSETDEA RSDDLNDLSF DIDDSDLPNL DEETREDAAG SDIPALDLGD EDLPAAGQED ALPREDGESS PIGADEDNET RLSLAQAYAD MGDDEGAREL IDEIVATGSE DQKAKAEAIR QQLESS
|
| |