Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1066 |
Symbol | |
ID | 8806824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1125600 |
End bp | 1128122 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003460314 |
Protein GI | 289208248 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGTC TTTCCGGGCG CGGTGGTCCG CGCGAATGGC TTCGCGGTAC CTGGCGCGAG TGGCGCAGCG CCGAGTTGCG CGTGCTTGCC GCGGCGCTGG TCGTCGCGGT GGCGGCGGTC GCGGCGGTGG GTTTTTTCAC CGACCGGGTG GATCGGGGCA TGGCGCAGCA GGCGACCCTG CTGCTGGGCG GTGACATGGC CGTGGTCGGC GACCAGCCGA TCGCGGAGGA CTGGCTCGCG GAGGCCGAAC GCCGAGGCCT GCAGACCGCC CGCACCGCCG CGCTGCCCAG CGTGCTGTTT ACCCCCGAGG ACGACTCCCG CCTGTCCGCG CTGCGCGCGG TGGACCGTGA CTGGCCTCTG CTGGGCGAGG CGCTCGTAAG AGACGTCCCG GATGCCGACC CCGAGGTACA CACGCAGGGT CCGGAACCCG GTACGGCCTG GCTGGAAGGC GCTCTGCTGC AGGCCCTGGA GCTCGAGGTT GGCGATACGG TCGAGGTCGG CGAACTGGTG CTGGAAGTGG CCGGCGAGAT CGCCCGCGAA CCCGACCGCG GCAGCGCCCT GCTGGATATC GCACCCCGGC TGATGCTCGC CTATGACGAT CTCGAGGTCA GCGGGCTGAT CGGACCCGGC AGTCGCGTGC GCTACGAACT CTTTGCGACC GGCCCGGCGG CCCAGATCGA AAGCTACCGC GACTGGATCG AGGAGCGGCT GGAGCCGGGT CAGCGCCTGC GCGAGCTTGA TGACGCCAAC CCGGAGCTGC AGGTCGCCAT CGAGCGGGCC GAGCGCTTTC TCGGGCTGGC CTCGCTGATG GCAGTCCTGC TGGCCGGGGC GGCGATCGCG GTGGCCGCCC ATCATTTTGC CCAGCGCCGC GCGGATGCCG CCGCGGTCAT GCGGGCGCTG GGGGCCAGCG GGCGCCATGT ACTCAGGCTT TATTTTGGCC ATGTCCTGAT CATTGCCCTG CTCGCCAGTG GGATCGGACT GGCCATCGGA TTTGCCGCGC AGTTTGTGCT CTCCGCGCTG CTCGGGGAAT GGCTGGCGGG CGACCTGCCG CCGCCGGGGC TCCGCCCGGT GGCGGCGGGC CTCGGCGTGG GACTGATTAC CGCGGTCGGC TTCGCCCTGC CCGCGTTGCT GCGCATCGGC CAGGTGTCTC CCCTGCGCGT GCTGCGCCGC GATCTGGGGC TCCCGCCCGC GTCCGTCTGG CTGTCGGGAC TGCTGGCCCT GCTGGCCTTC GGTGCGCTGT TGTACTGGCA GGCGGCCGAC CCGCGTCTGG CCGCCTGGGT CCTGCTCGGG ACCGCGGGCG CGCTAATCGT GCTCGGGGCA CTGGGCCTGC TGCTGGTGCT GGCCCTGCGC CCGCTGCGCA ATCATGGAGG TATGGCCATG CGCTTTGGTC TGGCCAACCT TTCCCGCCGT GGCGGACTGA GCGCCGTGCA ACTGGTCGCC TTCAGCATCG GCATCCTTGC ACTATTGCTG CTGGCGATCG TGCGCGTGGA CCTGCTCTCC GCCTGGGACG CGAGCATCCC CGCCGATGCA CCGAACCAGT TCTTCGTCAA CATCCAGCCC GGGCAGGAAG AGGCCTTTGC CGACCGAATC GAGGCCTCCG GGCTCGAACG CCCGCAGTTG GACCCAATGA TTCGCGGGCG CCTGATCGCG ATCAACGACC AGCCGGTCCA GCCCGATGAT TTTGACGACG ACCAGACCCG GCGTCTGGTG GATCGCGAGT TCAACCTTTC CTACGCGGAA GAACCGCCAG AGCACAACCG CGTGGTGGAA GGCCACTGGT TCCACCACGG GCCGACCGGC GAGGATGCGG ACGGCGCATG GTCGGTCGAG GCCGGTTTGA TGGAACGTTT CGGTCTGGAG ATTGGCGACA GCCTGACCTT CCGAGTGGGC GGGCAACCCG TGCGCGGCAT GATCGCCAAC GTGCGCGAGG TCGACTGGGA CAGCTTCCGG GTCAATTTCT TTGTGATCGG CCCACCCGCT CTGTTGGGCG AACAGCCGCG CACCTACATC ACCGCACTGC ATGTGCCGGA GGGGATGGGC GCCGAGCAGA ACCGCTGGCT GCGGGAATTC CCGGCCGTCT CCGCGATCGA CGTCGGCGCG ATCCTCGAGC AGGTTCGTGA CGTGATGGAT CAGGCCACCC GCGCCGTGGA GTATGTGTTC CTGTTTACAC TGCTCGCGGG CCTGACCGTG CTGTTTGCGG CTGTGCAGGC CACACGCGAC GTGCGCCGCC GCGAGACCGC CCTGCTGCGC ACCCTGGGGG CCCGACGCCA CCACATCCGG CGGGCGTTGC TGGCCGAGTT TGGCAGCCTG GGCTTTCTGG CGGGTCTGCT GGCAGCCGTC ATCGCGACCG CCGTGGGAGG CCTGCTGGCC TGGCAGGTAT TCGAGTTCGA CTACCGCGTC AACCCGATGA CCTTCGTGTT CGGGATTGTC GGCGGGACCC TGGGCATTGC GCTGGCCGGC TGGCTGGGAA CGCGTAGCGT GCTGCGCCAG GCGCCACTCG GGGTCCTGCG CGGCCCCGAG TGA
|
Protein sequence | MTRLSGRGGP REWLRGTWRE WRSAELRVLA AALVVAVAAV AAVGFFTDRV DRGMAQQATL LLGGDMAVVG DQPIAEDWLA EAERRGLQTA RTAALPSVLF TPEDDSRLSA LRAVDRDWPL LGEALVRDVP DADPEVHTQG PEPGTAWLEG ALLQALELEV GDTVEVGELV LEVAGEIARE PDRGSALLDI APRLMLAYDD LEVSGLIGPG SRVRYELFAT GPAAQIESYR DWIEERLEPG QRLRELDDAN PELQVAIERA ERFLGLASLM AVLLAGAAIA VAAHHFAQRR ADAAAVMRAL GASGRHVLRL YFGHVLIIAL LASGIGLAIG FAAQFVLSAL LGEWLAGDLP PPGLRPVAAG LGVGLITAVG FALPALLRIG QVSPLRVLRR DLGLPPASVW LSGLLALLAF GALLYWQAAD PRLAAWVLLG TAGALIVLGA LGLLLVLALR PLRNHGGMAM RFGLANLSRR GGLSAVQLVA FSIGILALLL LAIVRVDLLS AWDASIPADA PNQFFVNIQP GQEEAFADRI EASGLERPQL DPMIRGRLIA INDQPVQPDD FDDDQTRRLV DREFNLSYAE EPPEHNRVVE GHWFHHGPTG EDADGAWSVE AGLMERFGLE IGDSLTFRVG GQPVRGMIAN VREVDWDSFR VNFFVIGPPA LLGEQPRTYI TALHVPEGMG AEQNRWLREF PAVSAIDVGA ILEQVRDVMD QATRAVEYVF LFTLLAGLTV LFAAVQATRD VRRRETALLR TLGARRHHIR RALLAEFGSL GFLAGLLAAV IATAVGGLLA WQVFEFDYRV NPMTFVFGIV GGTLGIALAG WLGTRSVLRQ APLGVLRGPE
|
| |