Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2142 |
Symbol | |
ID | 8807917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2262973 |
End bp | 2265822 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003461368 |
Protein GI | 289209302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.364139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGCA TCAGCATCCG CGGGGCCCGC ACCCACAACC TGAAGAACAT TGATCTCGAC CTGCCGCGCG ACCGGCTGAT CGTGATCACG GGCGTGTCCG GCTCCGGCAA GTCGTCGCTG GCCTTCGACA CCCTGTTTGC CGAGGGACAA CGCCGCTACG TGGAGAGCCT GTCCACCTAC GCCCGCCAGT TCCTGTCGAT GATGGAAAAG CCGGACGTGG ATCAGATCGA GGGGCTCTCC CCGGCGATCT CCATCGAGCA GAAGAGCACC TCGCACAACC CGCGTTCCAC CGTCGGCACC ATCACCGAGA TCTACGACTA CCTGCGCCTG CTGTTCGCGC GTGCGGGCGA GCCGCGTTGC CCCGAACACG GTGAGACCCT GGCGGCCCAG ACCATCTCGC AGATGGTGGA TCAGGTGCTG GATCTTCCCG AGGGCGAGAA GGTGATGCTG CTGGCCCCGA TCGTGCGCGG ACGCAAGGGC GAGCACCACA AGCTGCTGGA GTCCATGCGC GCCCAGGGGT ACCTGCGCGC CCGCATCAAC GGCGAGATCC ACGACCTCGA CGCCTTGCCC GCGCTGGACC CGAAGCGCAA GCACAACATC GAGATCGTGA TCGACCGCTT CAAGGTCCGC GAGGACCTGC GGAGCCGCCT GGCCGAGTCC TTCGAGACGG CGACCGAACT GGCCGACGGG CTGGCGCTGA TCGCCTGGAT GGACGAGCCC GAACGCGAGC CAATGATCTT TTCCGCGCGC TATGCCTGCC CCGTCTGCGG CTATGCGCTG CGCGAGCTGG AGCCCCGGCT GTTCTCGTTC AACAATCCGG CCGGCGCCTG CCCCACCTGT GATGGCCTCG GCGTGCGCCA GTACTTCGAC CCGGAGCGGG TCGTCTCGCA ACCCGAACTG AGCCTGGCCG GCGGCGCAGT GCGTGGATGG GACCGGCGCA ATGCCTGGTA CTTCAGCCTC CTGAACAGCC TCGCCCGCCA TTACGGCTTT GACGTCGAGA CGCCGTGGCA GGAGCTGCCG GCCGAGGTCC GCGCCGTGGT GCTGCACGGC TCCGGCAAGG AGAAGGTCAA GCTGTCGACG CCGGCCGCGA ATGGCCGCCT GCGTACCGAC GAGCGCGTGT TTGAAGGAGT GATCCCGAAT CTGGAGCGGC GCTACCGCGA GACCGACTCC GCCGCCGTAC GCGAGGAGCT AGGGCGCTAT CTGGCCGAAC AGGCCTGCCC CGAGTGCCAC GGCACGCGGC TGAACTCCGC GGCGCGTCAT GTGTTCGTCA ATGACCTGAC ACTGCCCGAG GTCACGCACA TGAGCGTGGC CGACGCACGG GCCTTCTTCG CTGAGCTGCG CCTGCCGGGC CGCTTCGCCC AGATCGCGGA GAAGATCGTG CGCGAGATCG GCGCACGCCT GGGCTTCCTG AACGATGTCG GGCTCGACTA CCTGACCCTG GACCGCTCGG CCGACACCCT GTCCGGCGGC GAGGCCCAGC GCATCCGGCT GGCCTCCCAG ATCGGGTCGG CCCTGGTAGG GGTGATGTAC ATCCTCGACG AACCCTCGAT CGGCCTGCAC CAGCGCGACA ACGAACGCCT GTTGAAGACT TTGTGTCACC TGCGGGACCT CGGCAACACC GTGGTGGTCG TGGAGCACGA CGAGGATGCG ATTCGCGCGG CGGATCATGT CGTCGACATC GGCCCCGGTG CGGGCCGCCA TGGCGGCCAG GTCGTGGCCG CCGGCACCCC GGACGAGGTG GCCCGCACAC CGGACTCCCT GACTGGGGCC TATCTCACCG GTACGCGCGC GATCGCCGTC CCGGAGAAGC GCACCGAACC GGACCCCGAA CGCGAGATCC GCGTGATCGG CGCGCGCGGC AACAACCTGA AGGGCGGAGA CTTCGCGTTT CCCACGGGCC TGCTGACCTG CGTCACCGGG GTCTCGGGCT CCGGCAAGTC GACCCTGGTC AACAACACGC TCTACCCGGC CGTCGCCGTG GCCCTGCATG GCGGGCGCCA CACGATCGCG CCGCACGAGC GCATCGACGG CCTCGAACTG ATCGACAAGG TCGTCGACAT CGACCAGAGT CCGATCGGCC GCACGCCGCG CTCGAACCCG GCGACCTACA CCGGCCTGTT CACCCCGATC CGCGAGCTGT TTGCCGCCAC CGCCGAGGCC CGCTCGCGCG GCTACAAGCC AGGGCGCTTC TCGTTCAACG TGCGCGGCGG CCGCTGCGAG GCCTGCCAGG GAGATGGCGT GATCAAGGTG GAGATGCACT TTCTTCCGGA TGTCTTCGTC CCCTGCGACG TCTGCAAGGG CAAGCGCTAC AACCGCGAGA CCCTGGAGGT CCGGTACAAG GGCAAGAGCA TCCACGAAGT GCTGGAGATG ACCGTCGAGG AGGCCCTGGA GTTCTTCCAG CCGGTCCCCG TGATCCATCG CAAGCTCGAG ATGCTGATGG AGGTCGGGCT GTCCTACATC CAGCTCGGTC AGAACGCCAC CACGTTGTCC GGTGGCGAGG CCCAGCGCAT CAAGCTCGCA CGCGAGCTGT CCAAACGCGA TACCGGCCGC ACCCTGTACA TCCTCGACGA GCCCACGACC GGCCTGCACT TCGAGGACAT CGCCCAGCTC TTGCGGGTGC TCCACCGCCT GCGCGACCAC GGCAACACGA TCGTCGTGAT CGAGCACAAC CTCGACGTGA TCAAGACCGC CGATTGGCTG ATCGACATTG GACCGGAGGG CGGTAGCGGT GGTGGCGAGC TGCTCGTCGC GGGCACGCCC GAGGCGGTGG CTGGCACTCC AAGCAGCCAC ACGGGGCGCT TCCTCGCTCC TTTGCTGGGG ACGCAGCAAA GGAGCGCCGC TTCCGCCTGA
|
Protein sequence | MDSISIRGAR THNLKNIDLD LPRDRLIVIT GVSGSGKSSL AFDTLFAEGQ RRYVESLSTY ARQFLSMMEK PDVDQIEGLS PAISIEQKST SHNPRSTVGT ITEIYDYLRL LFARAGEPRC PEHGETLAAQ TISQMVDQVL DLPEGEKVML LAPIVRGRKG EHHKLLESMR AQGYLRARIN GEIHDLDALP ALDPKRKHNI EIVIDRFKVR EDLRSRLAES FETATELADG LALIAWMDEP EREPMIFSAR YACPVCGYAL RELEPRLFSF NNPAGACPTC DGLGVRQYFD PERVVSQPEL SLAGGAVRGW DRRNAWYFSL LNSLARHYGF DVETPWQELP AEVRAVVLHG SGKEKVKLST PAANGRLRTD ERVFEGVIPN LERRYRETDS AAVREELGRY LAEQACPECH GTRLNSAARH VFVNDLTLPE VTHMSVADAR AFFAELRLPG RFAQIAEKIV REIGARLGFL NDVGLDYLTL DRSADTLSGG EAQRIRLASQ IGSALVGVMY ILDEPSIGLH QRDNERLLKT LCHLRDLGNT VVVVEHDEDA IRAADHVVDI GPGAGRHGGQ VVAAGTPDEV ARTPDSLTGA YLTGTRAIAV PEKRTEPDPE REIRVIGARG NNLKGGDFAF PTGLLTCVTG VSGSGKSTLV NNTLYPAVAV ALHGGRHTIA PHERIDGLEL IDKVVDIDQS PIGRTPRSNP ATYTGLFTPI RELFAATAEA RSRGYKPGRF SFNVRGGRCE ACQGDGVIKV EMHFLPDVFV PCDVCKGKRY NRETLEVRYK GKSIHEVLEM TVEEALEFFQ PVPVIHRKLE MLMEVGLSYI QLGQNATTLS GGEAQRIKLA RELSKRDTGR TLYILDEPTT GLHFEDIAQL LRVLHRLRDH GNTIVVIEHN LDVIKTADWL IDIGPEGGSG GGELLVAGTP EAVAGTPSSH TGRFLAPLLG TQQRSAASA
|
| |