Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1902 |
Symbol | |
ID | 8807675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2017351 |
End bp | 2019384 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF255 |
Protein accession | YP_003461129 |
Protein GI | 289209063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGAC TGGCCGGAGC CAGCAGCCCG TATCTGCTAC AGCATGCCGA CAACCCCGTG GACTGGTATC CGTGGGGCGA GGACGCGCTG GAACGCGCCC GCCGCGAGGA CAAGCCGATC TTGCTGTCGA TCGGCTACTC GGCCTGCCAC TGGTGCCATG TGATGGCGCA TGAATCGTTC GAGGACCCGG CCACCGCCGA GGTGATGAAC CGTCGCTTCA TCAATATCAA GGTGGACCGC GAGGAACGTC CCGACCTCGA CCGCATCTAC CAGAATGCCC ACATGCTGCT GTCGCAGCGC CCGGGTGGCT GGCCGCTGAC GGTGTTCCTG ACACCCGACC AGGTGCCGTT CTTTGCCGGC ACGTACTTTC CGAGCACCCC GCGTCACGGG CTGCCGTCGT TCGTCGACCT GATGAATCGC GTGGCCGATT TTCTCGCCGA GCACCCGGAC GAGATCCAGC GCCAGAACGA GTCGCTGCAG CAGGCATTGG CGCGCATTTA TCGACCGGCA GGCGGGGCGA TCCCGGCGAT CGGGGTGCTG GACAAGGCGC GGGCCGAGCT GGCCCAGACC TTCGACGATC AGTTCGGTGG CTTTGGCGAT GCACCCAAGT TCCCGCACCC GGCGAGCCTG GAATGGCTGG CCTGGCACGC GGCGCGCCAC AATGATGCCG AGGCCGAGCG CATGCTCGAG CGGACGCTCG CGGCGATGGC CGCAGGCGGG ATCTTCGACC AGGTGGGCGG CGGCTTCTGC CGCTACTCGG TGGATGCGCG CTGGATGATC CCGCACTTCG AGAAGATGCT CTACGACAAC GGGCCGCTGC TGGGCCTGTA TGCCGAGCGC GCGGCCGCCG GCGACGACCG TGCCCGGCGT GTGGCCGAAC AGACGGTCGC CTGGCTGGAG CGCGAGATGC GCGACCCCTC CGGCGCCTTC TACTCCAGCC TCGATGCCGA TTCCGAGGGC GAGGAGGGCC GCTTCTACGT CTGGGATCCG GAGATGGTCG AGGGCCTGCT GCCGGAGGAC GAGTGGGTGG TCGCCAGCCG GGTCTGGGGG CTGAACGGCC CGGCCAATTT CGAGGGTCGC TGGCACCTGC ACGAGGTGGC CCCGATCGCG ACCGTGGCCG ATGCCCTGGG TATCGACGAG TCCGAGGCGG AAACGCGCCT GGGACGTGCC CGCGAGCGCC TGCTGGCCGC CCGCGAGCAG CGCGTGCGGC CGCATCGCGA CGACAAGATC CTGGGGGCCT GGAACGCGCT GATGATCAAC GGGCTCGCCC GCGCGGCGCG TGCGCTCGAA CGGCACGATT GGCTGGGACT CGCGCGCGCG GCGATGCGCG CGGTGCGTGA ACGGCTCTGG CACGACGGCC GGCTGTTTGC CAGCTTCCGC GAGGGCGCCA CCAGCGAACT GCCGCGCGCC TATCTGGACG ACCACGCCCT GTTGCTGGAG GCCACGCTGG CACTACTGGA GGTCGAGTGG GATGGCGACC TGCTGGGCTG GGCCACGACC CTGGCAGAGG CCCTGCTGGC CGACTTCGAG GATACCGAAC ACGGTGGCTT CTTCTATACC GCGCGCGATC ACGAGGCGCT GATCCAGCGG CCCAAGGTCT ACGCCGACGA TGCGATGGCC GCCGGTAATG GCATTGCCGC GCAGGCCCTG CAGAAACTGG GCTACCTGCT CGCCGAGCCG CGCTATCTGG AGGCCGCCGA GCGCACGCTG GCCAACGCCG GGCCGATGAT CGAACAGGCC CCGTTGGGCC ACATGAGCCT GCTGGTCGCA CTGGACATGC ACCAGCAGCC GCCGCCGCTG GTCGTGCTGC GCGGAGCGGC TGACGAACTT GCTCCGTGGC AGCAACGGCT GCGCGCGCAT GATGCTCCGA TGTGGGTCTT CGCGATTCCG GCGCAGGCGG ACGATCTGCC TCCCGCGCTG GCGGAGAAGG CCGCCCCGGA GACGGGTGTG CGCGCCTATC TCTGCCGCGG CCTGCACTGC GAAGTGCCCG TGACCGACCC CGCTGCCCTG GAAGGTGTGC TCGCCGCGGG TTGA
|
Protein sequence | MNRLAGASSP YLLQHADNPV DWYPWGEDAL ERARREDKPI LLSIGYSACH WCHVMAHESF EDPATAEVMN RRFINIKVDR EERPDLDRIY QNAHMLLSQR PGGWPLTVFL TPDQVPFFAG TYFPSTPRHG LPSFVDLMNR VADFLAEHPD EIQRQNESLQ QALARIYRPA GGAIPAIGVL DKARAELAQT FDDQFGGFGD APKFPHPASL EWLAWHAARH NDAEAERMLE RTLAAMAAGG IFDQVGGGFC RYSVDARWMI PHFEKMLYDN GPLLGLYAER AAAGDDRARR VAEQTVAWLE REMRDPSGAF YSSLDADSEG EEGRFYVWDP EMVEGLLPED EWVVASRVWG LNGPANFEGR WHLHEVAPIA TVADALGIDE SEAETRLGRA RERLLAAREQ RVRPHRDDKI LGAWNALMIN GLARAARALE RHDWLGLARA AMRAVRERLW HDGRLFASFR EGATSELPRA YLDDHALLLE ATLALLEVEW DGDLLGWATT LAEALLADFE DTEHGGFFYT ARDHEALIQR PKVYADDAMA AGNGIAAQAL QKLGYLLAEP RYLEAAERTL ANAGPMIEQA PLGHMSLLVA LDMHQQPPPL VVLRGAADEL APWQQRLRAH DAPMWVFAIP AQADDLPPAL AEKAAPETGV RAYLCRGLHC EVPVTDPAAL EGVLAAG
|
| |