Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0404 |
Symbol | |
ID | 8806139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 420208 |
End bp | 421338 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function UPF0075 |
Protein accession | YP_003459655 |
Protein GI | 289207589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000929624 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC GGTTAATTGG ATTGATGTCC GGCACCAGCC GGGACGGCGT GGACGCCGTC CTGGTCGAGA TTGAAGGAGA CGGCTCGCTG CATACCGCGG GGCATTGCCA CCTGCCATAC CCGGATGCGC TGGAGGAGGA TCTCGCAGCC GCCGCGACCG CCGAGGCACT GCGCTTCGAG CACCTGGGCA CACTGGACGC CCGGGTTGGG CTGTTCCTGT CGCGCGCCGT CCAGCAACTG CTGGGATCCA CCGGCCTGAA GGCCCAGGAC ATCCTGGCCA TCGGCTCACA CGGGCAAACG GTCCACCATG CGCCCGGCGC CGACCCCGCG TTCACCTGGC AGATTGGCGA CCCATTTCGA ATCGCCGAAG CCACCGGCAT CGACGTAATC GCGCACTTTC GCCAACGCGA CCTCGCGGCG GGTGGCGAGG GGGCTCCGCT GGCCTGCGCG TTCCATGCCG CCTGGCTGGG CCACGCGTCC GAGACACGCG CGATCCTGAA CCTCGGGGGG ATCGCCAATC TGACCTGGCT GGAGCCGGGC CAGCCGGTAC GCGGATGCGA CAGCGGACCT GCCAACACCC TGCTGGACGG CTGGGCCCGG CGGCACCTGG GCCAGCCCTA TGATGCCGAT GGCTCCTGGG CGCGCACGGG ACACGTCGAC CGGTCACTGC TCGAACAACT CCTAGCGGAC CCGTACTTTC AGCGCCCCGC CCCCAAGAGC ACGGGGCCCG AGCATTTCTC GCCCCACTGG CTGCGCCAGG TTGGTGGCGA ACGGATCGAT CGCCTGAACA CAGAGGACGT TCAGGCCACC CTGGTCGAAC TTACGGTGGA GGGCGTGCGG CTGACGCTCG AATCCCTGCG CACAACCGCA CCCGATCGGG TCATCGTCTG TGGCGGAGGC GCCCACAACG GCTACCTGAT GGAACGGCTG CAAAGCCAGC TGGCCGGCAG CACCGTCGAG ACCTCCGAAC GCCACGGGAT ACCCCCTCAG CAGGTGGAGG GCGCCGCCTT TGCCTGGCTG GCGTATCGCC ACCTGCAACA AGAGGCCGGC AATCTGCCGG AGGTCACAGG TGCCCGTGGG CCACGCATCC TCGGCTGCCG GATCCCCGGA CGCGCACCTG AATCCACATA A
|
Protein sequence | MTQRLIGLMS GTSRDGVDAV LVEIEGDGSL HTAGHCHLPY PDALEEDLAA AATAEALRFE HLGTLDARVG LFLSRAVQQL LGSTGLKAQD ILAIGSHGQT VHHAPGADPA FTWQIGDPFR IAEATGIDVI AHFRQRDLAA GGEGAPLACA FHAAWLGHAS ETRAILNLGG IANLTWLEPG QPVRGCDSGP ANTLLDGWAR RHLGQPYDAD GSWARTGHVD RSLLEQLLAD PYFQRPAPKS TGPEHFSPHW LRQVGGERID RLNTEDVQAT LVELTVEGVR LTLESLRTTA PDRVIVCGGG AHNGYLMERL QSQLAGSTVE TSERHGIPPQ QVEGAAFAWL AYRHLQQEAG NLPEVTGARG PRILGCRIPG RAPEST
|
| |