Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0956 |
Symbol | |
ID | 8806712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1014689 |
End bp | 1016959 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_003460207 |
Protein GI | 289208141 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.137711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.159136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCC TATCCGCCAC CTCGACGCTC GGCCGCTTCC AGCGCTGGCG TCGCACCAAG GACCGCACCG CCCGTTACAC CATCGGCTTT TTCGGTGTGG CCGTGATTGG TGCCCTGACC CTGATGTTCG TGTATCTCGC CAGCGAAACG CTGCCGATGT TCCAGGGGGC GACTCTGGAA CCACGCACCG AGTACGCGGC TCCCGGCGGT GCCGACACGC AGACCATCCA TCTCGCGGTG AACCGCCATC GCGAGATGGC GGTGCGCCTC ACCGAAGACC GACGCGCCCT CTTCTTCCGC CCCTACAGCG GCGACATCGT GAAAGAAGAG TCGCTACCGA TCCCGGAGGA TGTGCAGGTC ACGAGCTTCA GTGCCGCCGA GCCGCGCACG CGCCTGGTGG CCGCGGGCCT GGACAACGGG CAGGTACTGG CCATCGAGTA CGAATACAAC GAGCGTTTCA GTCCCGAAGG CCGTCTGTAC GACCCGGGCC TGGTGTACCC CCTGGGGGAC GAGGACTCCG CGCTGCTGGA CGTGGACGAC GACGGCCGCG CGATCTCGGT GGTTGGGATA CAGCGCGGCT CCAGCGGCAT CCGCGTCGCA GCCGCAACAC AAGACGGCGA TCTGAAGCTG GTGCGCTTCG AGAAACGCAC CTCGATGATG ACGGGCGAGA CCGAAGTGCG ACGATCGGCC TACGACCTGC CCTCGCTCCC GGATGGCGCG ACGCCGACCC GCATTCTGCT GGATATCACC GGCCGCCACA TGCTGGTCGG CGACGACCAG GGGCAGCTGC ACTTCTTCGA CATCAATAAC CCCTCACGCG CCAGCCTGGT CGACAGCAAG CGCGTGATCC GCGGCGATGA TGCCGAGGTA ACCTCGCTGG AGTATCTGCT GGGCACGGTC TCCATCATCG TCGGGGGCTC CGACGGCACG GTCTCCCAGT ACATGCTGGT ACGCGACGAC GACAACGTGA ACCGCATCAC CCGGGTGCGC GACTTTCCCT CGCACGCCGG CGCGATCCGC AACATCCAGC CGGAATACAT CCGCAAGGGC TTCCTGACCG CCGACGATCA GGGCGAGGTG AAGATTCACT ATTCCACCTC GCAGCGGACC CTGATCGAAC GCCAGATCAC CGACCAGCCA CTGCACCGGG TGTATGTCGA CCCGCGCAAC CGCCTGCTGA TCGGAATCGA CGATCACGAG ACCTGGCATC TGCAGGATCT GTCGAACCCG CACCCGGAGG TCTCCTTCCA CGTGCTGTGG CAGAAGGTCT GGTACGAGGG ACGCTCCGGG ACCGACTACG TCTGGCAGTC GTCCTCCGCC ACCGACGAGT TCGAGCCGAA ATTCTCGCTG GTGCCGCTGA CCATCGGCAC CATCAAGGCC GCCTTCTACG CGATGCTGTT CGCCACGCCA CTGGCGATCA TGGGGGCGAT CTACAGCGCC TATTTCATGT CACGGCGCAT GCGCAGCATC ACCAAGCCCT CGATCGAACT GATGGAGGCT CTGCCGACGG TCATCCTCGG CTTCCTCGCC GGGCTGTGGC TGGCCCCGTT CATCGAGGCC AACCTGCCGG CGATCGCCAG CATCCTGATC CTGATGCCGC TGGGCATGCT GGCCGCCGCG TTCCTGTGGA CACGGGTGCT TCCGGAGAAC CTGCGCAACC TCGTGCCCGC CGGCTGGGAG GCCGCGATCC TGATCCCCGT GATCCTGGCG ATCGGCGGGT TCTCCGTCTC CATGAGCCCG CTGATCGAGG TCTGGATGTT CGGTGGCGAT GCCCGCCAGT GGCTGACCGA TCACGGCATC ACCTACGACC AGCGCAATGC GCTCGTGATC GGTATCGCGA TGGGCTTCGC GGTCATCCCG ACGATCTACT CGATCTCCGA GGACGCGGTG TTCAACGTGC CCAAGCACCT GACGCAGGGC TCGCTGGCCC TGGGCGCGAC CCCGTGGCAA ACGGTGGTGC GCGTGGTGCT GCTGACCGCC AGCCCCGGCA TTTTCTCGGC GGTGATGATC GGCTTCGGCC GCGCGGTGGG CGAGACCATG ATCGTACTGA TGGCGACCGG CAACTCGCCG GTGGTCAACT TCAACATCTT CGAGGGCATG CGCACGCTCT CCGCGAACAT CGCGGTGGAG ATGCCGGAGG CCGCCGTCGG CAGCACCCAC TTCCGCATCC TGTTCCTGGC CGCCCTGGTC CTGTTCGCGC TGACCTTTGC GGTAAACACG GTCGCCGAGA TCGTGCGCCA GCGCCTGCGC CAGAAATACA GCTCCCTTTG A
|
Protein sequence | MSSLSATSTL GRFQRWRRTK DRTARYTIGF FGVAVIGALT LMFVYLASET LPMFQGATLE PRTEYAAPGG ADTQTIHLAV NRHREMAVRL TEDRRALFFR PYSGDIVKEE SLPIPEDVQV TSFSAAEPRT RLVAAGLDNG QVLAIEYEYN ERFSPEGRLY DPGLVYPLGD EDSALLDVDD DGRAISVVGI QRGSSGIRVA AATQDGDLKL VRFEKRTSMM TGETEVRRSA YDLPSLPDGA TPTRILLDIT GRHMLVGDDQ GQLHFFDINN PSRASLVDSK RVIRGDDAEV TSLEYLLGTV SIIVGGSDGT VSQYMLVRDD DNVNRITRVR DFPSHAGAIR NIQPEYIRKG FLTADDQGEV KIHYSTSQRT LIERQITDQP LHRVYVDPRN RLLIGIDDHE TWHLQDLSNP HPEVSFHVLW QKVWYEGRSG TDYVWQSSSA TDEFEPKFSL VPLTIGTIKA AFYAMLFATP LAIMGAIYSA YFMSRRMRSI TKPSIELMEA LPTVILGFLA GLWLAPFIEA NLPAIASILI LMPLGMLAAA FLWTRVLPEN LRNLVPAGWE AAILIPVILA IGGFSVSMSP LIEVWMFGGD ARQWLTDHGI TYDQRNALVI GIAMGFAVIP TIYSISEDAV FNVPKHLTQG SLALGATPWQ TVVRVVLLTA SPGIFSAVMI GFGRAVGETM IVLMATGNSP VVNFNIFEGM RTLSANIAVE MPEAAVGSTH FRILFLAALV LFALTFAVNT VAEIVRQRLR QKYSSL
|
| |