Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1582 |
Symbol | |
ID | 8807351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1681826 |
End bp | 1684051 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function UPF0126 |
Protein accession | YP_003460815 |
Protein GI | 289208749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGTTGG CGCTGGCCAT CCTCGCGCCG TTCACGGCCG GTGCGTCCTC GCCCCCGTCC GCGGCCGATA TCGGCACCCC GATCCCGATC GCGCAGCAAT CGGCGCCCGA AACGTTGCGC TCTGGCTGGT TCGAACGTCC CCCGTATCAG GCCGGTTCCG ACAGCGCATC CCGGGCATCC CCGGCCGGGC TCGACGTGCG GGTCGCGCAG GAGGCACTGG CCGGTGTCGG GCGGGGTGCC GAGTTCACGC CGCTTTCCTG GAACGACCAA CTCGCCGCGC TGCGCCACGG CGCGATCGAT TTTGTCATCG GCACCTATTT TGATTCCGGT CGACTCGACT ATGCGCATTA CTCGCAGCCC TACCGTCAGG AACGTAACGC CCTCTATGTG CGCGCCGGCG AGCACGGTGA CTATTCATTC CAGGACCGCG CGGCCCTGGT CCAGGCGGTT CGCGACGGAG AGTTCCGGCT GGGGGTGACA ACGGGCTATG CCTACGCCTC CAGCGAGCTC GCGGACCTGG TCGCGGATCC GGGGGAGGCC GCGCGCCTGT ACCCCGCCGC CGGTGACGAG GCCCATATCC AGGCGCTGAT CAAGGGCCGG ATCGACGGCT TTGTCGCGGA CCCCGTGATT GTGGACCTGC TGCTGGCGCG GACCGATCGT GCCCGTGCGA TCCAGACGCA TCCGCTGGAC CTGGGGACCG TTGGCGTGCG GGTGATGTTC TCCCGCGCCA CGGTCCCCGC GGCGTTCGTC GAGCAGTTCA ACGCGCAGTT GCAGGCGCTG GAGGACAGCC GGCGGATGGC CGCGCTGGAA GTCGAGCATG TGCTGCCCGC GTTTCTCTCG ATGGCCACTA GCGAGGCCTG GTTCAATACG CTCACCCTGC TGGGGATCCT CGCTTTTTCC GCCTCCGGGC TGATCCTCGC CCGGCGCGAA CGTTACAACC TGTTCGGCGC GCTGGTCCTG GCCTCGTTGC CGGCGATCGG CGGCGGCGTG CTGCGCGATC TCCTGCTGGG CCGCGACCCG ATCTTTATCT TCGAGACGCC GGAGTTTCTT CTGGTCCCGA TCGTGGTCGT GGGCGTCGGC TTCGTGCTGT ACAAGCTGCA CGATCATGTA CTGGTGCGCT GGCATGTCGT CCAGGAACTC CTGACGCGGC AGCAGGCGGG ATTGGGCGGG CGCGTGGTGG GGAATCTGCA GCGTGGACTG GATGCCTGGG CGGTGGCCTC GTTCACCGTG ATCGGCGTCG GGGTGGCCGT CGAGTCGCAG GCCAGCCCGT TGTGGCTCTG GGGCCCGGTA ATGGCCGTGG TGACCGCCTC GTTCGGCGTG ATCATGCGCG ATGTGGTGCG CTCGGACTTC AACATCGACA TGCTCAAGCG CGACTCCTTC GCCGAGATCT CCGTCCTGGG CGGCATTCTC TATTCGCTGA TCCTGCTTTG GCCCCCGGTC GAGCTGAGCC TCGGATTCAT CCTGATCACC ACCCATGCAG TGATCGTGCT GCTGTTCCTG GCGCGCCTCG GGGTTCTCTA CTGGGGCCGG CCCAACCCGT TGCAGATGGG CGACCCGCAT ACCCTGCCGG AACGGCGCCT CGAGGCACTG GCGCGGCGCG AACCGGAGGC CTGGTCCAGT CTGACCGGCT ACCTTACCGA GGATGACGAG GGGCGTGCCC GCCCGGTCGA TCCCGGTGAG TTGGAACGCC TGCACAAGGA CTTCGAATAC CTGCTGCAGC CGGTGTTTGC CGAGCTGGGC CGGCTGGCGG GCGAGCCCCT GCTGGAGGCC GCCGCGCGGC GGCACGAGGC CCTGCGTGAT CGCTTCCGGC TGGTCGCCCG CCTGGAGCAG CAGATGTATG ACCACCTGCG CGCGACTGTG ACGGATATGG AGACCGCGGC CGACGAGGTA GCGCGGGCCC TGGAGGGCCG CGTGCTCGAG GCCCTGCGCG GATTGCTGGA TGCCGCGACC ATGGCCGTGT ACTCCAGGGA TCGCGAAGAC CTGGCCTGGC TACGCGACAT GACGGCCAAT CAGCGCAGCC GCTTCGATGC CCTGCGCGCC CGCTACCTCG ACCCCGATCA TTTCCAGCCT GGAGGTGCGC TGGACCGGGT GCTGCAGCGC ACCCACCGCG TGGAGCGCAT GCTCTGGATG CTGGCCGAGT ACGCGGATCG GCGGCTGGAA CCGACGGTGA GCCCGGTTCA GGGTGATCGG CGTGCACTGC AGAGGCGATT CCTGCGCGCC GGCTGA
|
Protein sequence | MGLALAILAP FTAGASSPPS AADIGTPIPI AQQSAPETLR SGWFERPPYQ AGSDSASRAS PAGLDVRVAQ EALAGVGRGA EFTPLSWNDQ LAALRHGAID FVIGTYFDSG RLDYAHYSQP YRQERNALYV RAGEHGDYSF QDRAALVQAV RDGEFRLGVT TGYAYASSEL ADLVADPGEA ARLYPAAGDE AHIQALIKGR IDGFVADPVI VDLLLARTDR ARAIQTHPLD LGTVGVRVMF SRATVPAAFV EQFNAQLQAL EDSRRMAALE VEHVLPAFLS MATSEAWFNT LTLLGILAFS ASGLILARRE RYNLFGALVL ASLPAIGGGV LRDLLLGRDP IFIFETPEFL LVPIVVVGVG FVLYKLHDHV LVRWHVVQEL LTRQQAGLGG RVVGNLQRGL DAWAVASFTV IGVGVAVESQ ASPLWLWGPV MAVVTASFGV IMRDVVRSDF NIDMLKRDSF AEISVLGGIL YSLILLWPPV ELSLGFILIT THAVIVLLFL ARLGVLYWGR PNPLQMGDPH TLPERRLEAL ARREPEAWSS LTGYLTEDDE GRARPVDPGE LERLHKDFEY LLQPVFAELG RLAGEPLLEA AARRHEALRD RFRLVARLEQ QMYDHLRATV TDMETAADEV ARALEGRVLE ALRGLLDAAT MAVYSRDRED LAWLRDMTAN QRSRFDALRA RYLDPDHFQP GGALDRVLQR THRVERMLWM LAEYADRRLE PTVSPVQGDR RALQRRFLRA G
|
| |