Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2833 |
Symbol | |
ID | 8829245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013930 |
Strand | + |
Start bp | 188841 |
End bp | 190691 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003494785 |
Protein GI | 290243115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.587179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.68865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCATGG ACGCAGCCGA ACAAACCCAG GAAGGGGTCG AGGACGTGAA TGTTTTCGAC GCCGAGGTAT CGATCGAGGA CGAGTGGCAG GAAGAGCCTG AGCCTAGCGG CGCGGGTCAG ATGCGTGATC CGTACGCGGC TTTCGAAGAT CCTGAAATTC TGAAGATGTC CGACCTTCCG GACTACGACT GGGTTCTGGA AGAAGGAGTG GGTGGAGTCA AGATTCCGGA CGGGATCGAG AAGTTCGCTT GTATTATCGC CCGTGTTCAG GCGTGGCACG GCGATTTCAG TCGAGAGGAC ATCCGTATTG CCAGTCTCCC GGATATGGTC GAGCAGGCGT CCGTTGACCT GGAAATCTGG CTGGTGGTGA CCGGGCGCGT CAAGGATCGG AACGCGCTGA AGGTGGCTGA AATGGTGCGA AGCCTTCGGA ATCGCTACCC ACGCGCCCGC CTGCATGCAA AGCCGAAAAT TGCCACGGCG GCCATCATCG AGGCGATGTA TGAGCGAAAC AATGAGGTCG TCAATGGTGC GATCGAAGAT GTTTCGGAAG AACAGCAGGA GTTCGACAAG CTCGGCCGAT ATGCCTTTGA TAACGGGGTG TCGGACATCC ATATCGAGGT TACATCGGAG CAGGCCCAGA TCTTGATGCG CCAGCACGGC CGGCTGCGGC ACTACAAGGA CCTGGCACCC GCCAAGGCCA CCGCCATTTG CTCGGCGGTC TACAACACCA TGTCGGAAGC AGGTTCGACC CGTGACTCTT TCAATGAGCG CAAATTCCAA AATGCGGTGA TTGACCGGCC GTACGATGAA GGGCGTGTCC GGTTTCGTTA CGCGTCCATG CCGGTCGCAC CCAATGGGTT CAATGTGGTC CTGCGGTTGC TTCCGGTAGG TGTGGAAAGC GCGCACAAGT CGTTCGAAGA CCTCGGTTAT GCCCCGGCGC ACACGCAGAG CATGCAGCGG GCAATGGCGC GCTCATCGGG CATGGTGATC ATCGCGGGGA CCACCGGATC CGGTAAGTCC ACGACCCTCA AGAATGCGAT GGAGGGCGTG GCAACAGCCA ATCCCGACCA GAAGATCCGA ACCATCGAAG AGCCGGTCGA ATACTCAATC CGGAATACGT CGCAAACTCC GGTGGTGCGG GACGATAAGG AGAAGGATGG GGCGAACTCC AGCAAACCGT TCGCTGATGC GATCAAGGCG GCCATGCGGG CCGACCCGGA CCGAATGCTC GTCGGGGAGA TTCGAGACAA GATCACGGCC GAGCTCGCGA TTCAGGCATC CCAGACGGGC CATGGCGTTG CCACGACCCT ACATGCGGAA TCGTGGAGCG GGATCTTCGA TCGACTGGGA TTGCTGGGGA TACAGATGGG CGTGTTGGCG CAGCCGGGCC TCATCGCAGG TTTGGCTTAC CAGAAACTGA TGCCGGTGCT GTGTGAGCAC TGCAAGATTA GTTTCCATCA GTGGGTGGAT GAACAGGCAG ACATGGAGAA TCCAAAGGAC TCAGGTTTTG TAGATCGAGT CCGTCGCGTC GTCCCCGACG CCGATCTGGT GATGGTCTAC ATGACAGGGC CTGGGTGCCG TCAGTGTAAA GGGACAGGGA TAACCGGCCA GACGGTTTGC GCGGAGGTCG TATTGCCAAC CAATGCCATG CTCGATGCGG TTCGTCATGG TGATGTCGTC AAGCTGAGGC AGCTCTGGAG GGAACAACGA AACGACTCGG ACCCAGATGA CATGAGCGGG CGGACTGCAT TCGAGCATGC CCTGCTGAAG ATGCGCCACG GGATCTGTGA TCCGAAAGAT GTGGAAAGCA AGTTCATGCG GTTGGACGAA GTGTTCCACC TCGACGATTA A
|
Protein sequence | MSMDAAEQTQ EGVEDVNVFD AEVSIEDEWQ EEPEPSGAGQ MRDPYAAFED PEILKMSDLP DYDWVLEEGV GGVKIPDGIE KFACIIARVQ AWHGDFSRED IRIASLPDMV EQASVDLEIW LVVTGRVKDR NALKVAEMVR SLRNRYPRAR LHAKPKIATA AIIEAMYERN NEVVNGAIED VSEEQQEFDK LGRYAFDNGV SDIHIEVTSE QAQILMRQHG RLRHYKDLAP AKATAICSAV YNTMSEAGST RDSFNERKFQ NAVIDRPYDE GRVRFRYASM PVAPNGFNVV LRLLPVGVES AHKSFEDLGY APAHTQSMQR AMARSSGMVI IAGTTGSGKS TTLKNAMEGV ATANPDQKIR TIEEPVEYSI RNTSQTPVVR DDKEKDGANS SKPFADAIKA AMRADPDRML VGEIRDKITA ELAIQASQTG HGVATTLHAE SWSGIFDRLG LLGIQMGVLA QPGLIAGLAY QKLMPVLCEH CKISFHQWVD EQADMENPKD SGFVDRVRRV VPDADLVMVY MTGPGCRQCK GTGITGQTVC AEVVLPTNAM LDAVRHGDVV KLRQLWREQR NDSDPDDMSG RTAFEHALLK MRHGICDPKD VESKFMRLDE VFHLDD
|
| |