Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1221 |
Symbol | |
ID | 8806983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1298784 |
End bp | 1300469 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003460466 |
Protein GI | 289208400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.240035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG ACAATCGACG CAGCCGCGTC GTCACCGAAG GGCCACGCCG CACCCCCAAC CGCGCCATGC TCAGGGCCGT GGGCTTCGGT GACGACGACT TCCAGCGCCC CATCGTTGGC ATTGCCAACG CCCACAGCAC CATCACTCCC TGCAACATGG GGATCGGGGC ACTGGCCGAA CGTGCGGAAT CGGCCGTACG AGCCTCGGGC GGCATGCCGC AGACCTTCGG CACCATCACC ATCTCTGACG GCATTTCGAT GGGCACCCCG GGCATGAAGT ACTCCCTGGT GTCGCGCGAG GTGATCGCCG ACGCGATCGA GACGGTCGTC AACGGCCAGA GTCTGGACGG GCTGCTGGCC ACCGGCGGCT GCGACAAGAA CATGCCCGGC GCCATGATGG CGATCGCCCG GCTTAACGTG CCCGCGATCT TCGTCTACGG AGGCACGATC AAGCCGGGAC GCTACAAGGA CCAGGACCTG ACGATCGTCA GTGCGTTCGA GGCCGTCGGA CAGCATGGGG CGGGAAAGCT GTCGGATGAA GACCTGGCCG GCGTGGAGCG CAATGCCTGC CCGGGAGCCG GCTCCTGTGG CGGCATGTTC ACAGCCAATA CCATGTCCAG CGCCTTCGAG GCGATGGGCA TGAGTCTGCC GGGCTCGTCG ACCCAGGCGG CCGAGGACCC GGAAAAGGCC GACTCCGCCG CCCGTTCCGC CGAGGTCCTC CTGGAGGCGA TCCATGCCGA CCGCAAGCCG CGCGACATCC TCACGCGCGA GGCCTTCGAG AACGCCCTGG CCGTGGTTAT GGCCGTCGGG GGATCGACCA ATGCCGTGCT CCACCTGCTC GCCATTGCCC ACTGTGCCAA CGTGCCGTTA GCTCTGGATG ACGTAGAACG GATCCGCCGG AAAACCCCGG TACTGTGCGA TCTGAAGCCC TCCGGTCGAC ACGTGACCAC GGAATTCCAT GCGGTGGGGG GCACACCGCA GGTCATGAAG ATCCTGCTGA ACGCCGGTCT TCTGCACAGG CACTGCCTGA CCATTACCGG CCAGACCCTA AGCGAGGTCC TCGCAGACAT CCCTGATGCA CCACCATCGA ACCAGACCAT CATCCGCCCG CTGGATGCCC CCCTGTACCC GCAGGGGCAC CTGGCGATCC TGAAGGGCAA TCTGGCGCCG GAAGGCAGCG TGGCCAAGAT CACCGGGCTC AAGTCCACTT CGATTCGCGG GCCCGCGCGC GTATTCGAAT CCGAGGAGGA ATGCCTGGAC GCCATCCTGG CGGGCGGGAT TCAGGCTGGC GACGTCATCG TGGTTCGTCA TGAGGGCCCC CGCGGCGGGC CCGGTATGCG CGAGATGCTC GCCCCCACGG CAGCGATCAT CGGCGCGGGT CTCGGCGACT CCGTTGCGTT GATTACCGAC GGACGCTTCT CCGGAGGCAC CTACGGTCTC GTCGTCGGAC ATGTCGCTCC CGAGGCGGCG CAAGGTGGGC CCATTGCCCT GGTGCGCGAG GGCGACATCA TCGAGGTCGA TGCCGACCGT AATCGTCTCG CGCTCGAGGT ACCAGAGGAC GAACTGGCGC GTCGCCGCAG TGCCTGGCAG GCACCCCCGC CGCGCTTTGA TCGAGGTGTG CTCGGGAAGT ACAGCCGTCA GGTCGGATCC GCCAGTCGCG GCGCGGTCAC CGACGACTTT CGCTGA
|
Protein sequence | MTQDNRRSRV VTEGPRRTPN RAMLRAVGFG DDDFQRPIVG IANAHSTITP CNMGIGALAE RAESAVRASG GMPQTFGTIT ISDGISMGTP GMKYSLVSRE VIADAIETVV NGQSLDGLLA TGGCDKNMPG AMMAIARLNV PAIFVYGGTI KPGRYKDQDL TIVSAFEAVG QHGAGKLSDE DLAGVERNAC PGAGSCGGMF TANTMSSAFE AMGMSLPGSS TQAAEDPEKA DSAARSAEVL LEAIHADRKP RDILTREAFE NALAVVMAVG GSTNAVLHLL AIAHCANVPL ALDDVERIRR KTPVLCDLKP SGRHVTTEFH AVGGTPQVMK ILLNAGLLHR HCLTITGQTL SEVLADIPDA PPSNQTIIRP LDAPLYPQGH LAILKGNLAP EGSVAKITGL KSTSIRGPAR VFESEEECLD AILAGGIQAG DVIVVRHEGP RGGPGMREML APTAAIIGAG LGDSVALITD GRFSGGTYGL VVGHVAPEAA QGGPIALVRE GDIIEVDADR NRLALEVPED ELARRRSAWQ APPPRFDRGV LGKYSRQVGS ASRGAVTDDF R
|
| |