Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0823 |
Symbol | |
ID | 8806578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 881147 |
End bp | 883054 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | protein of unknown function DUF255 |
Protein accession | YP_003460074 |
Protein GI | 289208008 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.302169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0427489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAGG ACGGACACAG AGCGCGACAT CCGGTCATTC GGGCCAGTTC CATGCTGGGC GTGATTACAG CCCTGTCCTT GGCTGTGGCC ATGCCGGTGC AGGCCAATCC CGAAATGGCA GAGAGTATTT CCCCGTACCT CCGGTTGCAT GTCGACGACC CGGTACAGTG GCGTGCCTGG GATCCTGCGC TGCTGGAGAA AGCCCGTGAG GAACAGCGCC CGCTATTGAT CTCGAGTGGG TACTACTCCT GCTATTACTG CCATGTCATG CAGGAGGAGA GTTTCCAGGA CGATGGCATT GCCGAGCGCC TGAACGAGCA TTTCATCCCG GTCAAGGTCG ATCGCGAGGT GAACGGCGCG CTCGATACCT ATCTGCTCGA GTTTCAGCGC GCGACCGCCG GGGCAACCGG CTGGCCGCTG AACGTGGTGG TCACGCCCGA GGGGTACGCG CTCACAGGCG TGGTGTATGC CCCGCGTGAT CAGTTCGCGG AGTTCCTGGA CGGCATTACC GATCGCCTCG CGGCAGACTA CGACAGGCTT CTGGAGCTCG CTCGCCGCGG CAGTGACGAG CTCTCCGAAC AACTGCGCAA GGGCGAAGAG CCCATGTCGG ACAGTCGGGC GGCGCGTCTG CCGCAGGTCC TCTGGAACGC AATGGAACGC GAGGCCGATG ACCTCCAGGG CGGGTTCGGG AGTTCGACCA AGTTCCCGCA TTCGCCGTGG CTGCGCGCGA TGCTCGAGGC CGAACAGGCT GGCAAGGCAC CGGACTGGGT CCCGGAATTT CTCGAGATCA CGCTGGACGA GATGAGCGGG TCCGGGCTGC GTGACGTCCT GGGTGGCGGG TTCTTTCGTT ATTCCGAATC GCCGGACTGG GACGAGCCGC ATTTCGAAGT GATGCTGGAG GACCAGGCCC AGCTGGCATG GATTTACCTC AAGGCGGGCA AGCACTTCGA TCGCGATGAC TGGGTCGATA TCGGCCTGGA GAACCTCGAT TTCGTGAAGC GCGACATGGC CCTGCGCGAG GATGACCTGG GCGAACAGGC CGACGTCGGT CCCGAGGGCT TTCGTGGATT CGCGGCCAGT CTCTCGGCCA TCGACGAGCA GGGGCGCGAG GGCGGTGCGT ACCTGTGGCC TGAGGAGGAC CTCGACGCTG CGTTGGCGGA GCACGAGCAC CCGGAACTGG TGCGGGCCTA TTTCGGGATG GAAGGGCCCT CGGTGTTCGA CCTCGGATAC CTGCCGCGTG GAGGCGATTC CATCACGCAA TTGGCCGAGC GCTTCGAGCT CGACGAGGAG GTGGTGCGGG AGCAGGTGGA CAGTGGCCGC GAGGCCCTGA TCCAGGCTCG CGAGGAGCGC GGTCTGCCGC GCGACGAAAA GGTCCTGACC GGGGCGCATG GGCTGCTGCT TTCCGCGCTG TCGCTTGCAG CCGACCACGA CGAGCGCTTC GAGGAGGCCG GAGAGGCCGT GCGCTCGTGG TTGCTGGAGG TGGCCGAGTC GCCCGACGAC TTGCCGACGC TGATGGGCCT GCCGAGCGAC GATGCCGGCG GTGCGACGCT GAACGACTAC GCCTATGTTG CCCAAGGGCT TCGTGACTGG GATTCGGTAC GCGACAGTGG TGATCGCGAC GCCGCGATCG CCTTGCTGGA GACGGCCTGG GAGCGTTTCC GTGATGACGA CGGCTTTCGG TCCGAGCCGG AACCCCCGCT GCCCGGGATG ATCTCGCAGC GCTTCCATCC GGCGGTGCAT CGGCCGTCGC CGACCACGCT GATCCTGCGG ATGACCGAAG AGTGGCTGGA GCACAGCGAG GCGCTCAGGA CGGCCTTCGA GGAATACGAC TTGCGCCCGG GGCGCGGAGT CGAAGGGCAG CCGAAGCATC ATGCGCGGCT GATCCTCTGG TTCGAGGAAC GCGACTGA
|
Protein sequence | MPQDGHRARH PVIRASSMLG VITALSLAVA MPVQANPEMA ESISPYLRLH VDDPVQWRAW DPALLEKARE EQRPLLISSG YYSCYYCHVM QEESFQDDGI AERLNEHFIP VKVDREVNGA LDTYLLEFQR ATAGATGWPL NVVVTPEGYA LTGVVYAPRD QFAEFLDGIT DRLAADYDRL LELARRGSDE LSEQLRKGEE PMSDSRAARL PQVLWNAMER EADDLQGGFG SSTKFPHSPW LRAMLEAEQA GKAPDWVPEF LEITLDEMSG SGLRDVLGGG FFRYSESPDW DEPHFEVMLE DQAQLAWIYL KAGKHFDRDD WVDIGLENLD FVKRDMALRE DDLGEQADVG PEGFRGFAAS LSAIDEQGRE GGAYLWPEED LDAALAEHEH PELVRAYFGM EGPSVFDLGY LPRGGDSITQ LAERFELDEE VVREQVDSGR EALIQAREER GLPRDEKVLT GAHGLLLSAL SLAADHDERF EEAGEAVRSW LLEVAESPDD LPTLMGLPSD DAGGATLNDY AYVAQGLRDW DSVRDSGDRD AAIALLETAW ERFRDDDGFR SEPEPPLPGM ISQRFHPAVH RPSPTTLILR MTEEWLEHSE ALRTAFEEYD LRPGRGVEGQ PKHHARLILW FEERD
|
| |