Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2313 |
Symbol | |
ID | 8808094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2431261 |
End bp | 2432658 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | neutral invertase |
Protein accession | YP_003461539 |
Protein GI | 289209473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.314535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAC TCCCGCCCGA TGCCCGCAAC CCGACCCTGG AAGCGGCGTT CCAGCTGCTG CGTGATGCCG AGGTGCGCTA CGAAGGGCGC ATCGTGGGCA CCGTGGCCAG CCTGGATACG CGTGCGCCAG CGGAGAACTA CGCCGACTGT TTTATCCGCG ACTTTGTGCC CTCGGGGCTC GTGTATCTGT TGCACGACGA GCCCGAAGTG GTGCGCAACT TCCTGTCGCT AATCTTGCAG ATCCGTGACA CCCAGGAAGA GATCGAGGGC CACCGCCGCC TGCCACGGGT GATGCCGGCC AGTTTCCGGG TATTCACCGA CGAGAACGGT CGTGAGGGGT TGGCGGCCGA TTTTGGGGAT CGCGCGATCG GTCGCGTGGC CCCGGTGGAC TCGATGATGT GGTGGGTGCT GCTCGCACGT GCCTACCAGA ACCGCACCGG GGACCACGAC TTTATTAAGA GCCCGGATGT GCAACGCGGC ATCCGGTTGA TCCTCAGCAT CTGTCTGCAG GATCGCTTCG AGGTGTTTCC GACCCTGCTG GTGCCGGATG GCAGCTTCAT GATCGATCGC CGGATGGGCG TGTTCGGCCA TCCGCTGGAG ATCCAGGCGC TGTTCTACGG GATGCTGAAG GCCTCGCTGG CGATGCTCGA ACCCTGCGAC ACCGACTCCG AGCAGCTCTG CGAACAGTCC GCGATCCGTA CCCGGCAGCT GTCCGACTAC ATCCGCCGTT ACTACTGGCT GGATCTGGAA CGTCTGAACG ACATCCATCG CTACCGCACC GAGCACTTCG GCCACGAGTC GGAGAACGCG CTGAACATCT ACCCCGAGAG CATCCCCGAC TGGCTGGTGG ACTGGCTGCC CTCGGAGAGC GGGTATCTCG TCGGCAACCT CGGGCCGGGG CGCATGGACT TTCGTTTCTT CAGCTTCGGT AATCTGCTGG CAGTGTTGTT CGGGCTGGCC GACGAGCAGG AGTCGCGTTC CATCATGCAG ACCTTCGAGC AGCGCTTCGA AGACCTGATC GGCACCATGC CGGTGAAGAT CTGTTACCCG GCGATGAGCG GCGAGGAGTG GCGACTGCTG ACCGGCTCTG ACCCCAAGAA TACCCCGTGG TCGTATCACA ATGGCGGCAA CTGGCCGGCC CTGCTGTGGG CGTTTACCGG TGCCGCGTTG CGGGTCGGGC GGCCGGACCT CGCACGCAGC GTACATGCGG TCGCGGCCGA ACGGCTGCAC CGCGACGACT GGCCGGAATA CTACGACGGC CGTCACGGTC GCCTGATCGG CCGCCGCGCA AACTATCAGC AGACCTGGTC GGCCACCGCT GTGCTGGTCT CGCAGGCGCT GCTCGACAAC CCCGAGACCA TGAGCCTGTT CGACAGCCCT GAACCCGAAT TGCGATGA
|
Protein sequence | MNQLPPDARN PTLEAAFQLL RDAEVRYEGR IVGTVASLDT RAPAENYADC FIRDFVPSGL VYLLHDEPEV VRNFLSLILQ IRDTQEEIEG HRRLPRVMPA SFRVFTDENG REGLAADFGD RAIGRVAPVD SMMWWVLLAR AYQNRTGDHD FIKSPDVQRG IRLILSICLQ DRFEVFPTLL VPDGSFMIDR RMGVFGHPLE IQALFYGMLK ASLAMLEPCD TDSEQLCEQS AIRTRQLSDY IRRYYWLDLE RLNDIHRYRT EHFGHESENA LNIYPESIPD WLVDWLPSES GYLVGNLGPG RMDFRFFSFG NLLAVLFGLA DEQESRSIMQ TFEQRFEDLI GTMPVKICYP AMSGEEWRLL TGSDPKNTPW SYHNGGNWPA LLWAFTGAAL RVGRPDLARS VHAVAAERLH RDDWPEYYDG RHGRLIGRRA NYQQTWSATA VLVSQALLDN PETMSLFDSP EPELR
|
| |