Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1868 |
Symbol | |
ID | 8807641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1984003 |
End bp | 1985961 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | acetate/CoA ligase |
Protein accession | YP_003461095 |
Protein GI | 289209029 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.477544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA CCATCGAGTC CACGCTGCAC GAGACCCGTC ACTTCGCGCC GCCGGCGGAG TTCACGAAGA ACGCCCGTCT GAAGCCCGAA GATCTCGAGG CCCTGCACAA GAAGGCCGAC GCCGACTACG AGGGCTTCTG GTGTGACCTC GCGCGCGAGA AGATCGACTG GCAGACCGAG TTCACCGAGG GTCTGGACAG CTCGAACGCA CCACACTACC GCTGGTTCGC CGACGGGCGC ATGAACGTCT CCTACAACTG CATCGACCGT CATCTTGAAG TGCGCGGCAA CAAGACCGCG ATCATCTTCG AGGCGGAAAA TGGCGAGACC CGCAACCTGA CCTACCGCGA CCTCTACAAC GAGGTCGGGA AGCTCGCCAA CGCGCTCAAG ACCATGGGTG TTGAAAAGGG CGACCGGGTG ATCATCTACA TGCCGATGAA CGCCGAGGCC GTGATCGCCA TGCAGGCCTG CGCCCGCATC GGCGCGATCC ACTCGGTGGT CTTCGGGGGA TTCTCCGCCG ATGCCCTGCG TGACCGCATC GTCGACTCCG GCGCCAAGCT GGTCATCACG GCGGATGGTG GCGTGCGCGG CGGCAAGACC GTAGCCCTCA AGGCCAATGT CGACAAGGCG CTGGACTCCA ATCCGGCCGA CGTCGAGAAG GTCATCGTCT GCAAGCGCGC CGGCAACGAC GTGACCATGC AGGACGAACG CGACATCTGG TGGGACGACG CCGTGGATGG CGAGTCGGTC GACTGCGAAC CGGAATGGGT CGAGTCCGAG CACCCCCTTT TCCTGCTCTA TACCTCCGGG TCCACCGGCA AGCCGAAGGG CATCCAGCAC TCCAGCGCCG GATACCTGCT GGGCGCGATT ATCACCAACC AGTGGGTGTT CGACCTGCAG GGCGACGACG TCTACTGGTG CACCGCCGAT GTCGGCTGGA TCACCGGCCA TACCTACGTC GCCTACGGCC CGTTGGCGGT CGGCGCGACG CAGGTCGTCT ACGAGGGCGT GCCGACGGTG CCCGATGCCG GCCGCTGGTG GAAAATGTGC CAGGACCACG GCGTGACCGT GTTCTACACC GCCCCGACCG CGATCCGCGC GCTGATGAAG GCCGGTGATG ACTTCCCCGC CCAGTACGAC CTGTCGAAGT TGCGGCTGCT GGGCACGGTC GGCGAGCCGA TCAACCCCGA GGCCTGGATG TGGTACTACC GCGTGATCGG CGGTGAACGC TGCCCGGTGG TCGACACCTG GTGGCAGACC GAGACCGGCG CCAACATGAT CGCCCCGATC CCCGGTGCCA CCACCCTGGT GCCCGGGTCC TGCACCCAGG CCCTGCCCGG CATCGACGCC GACGTGGTGG ACGAGAACGG CAACAGCCTG CCCGCCGATC AGGGCGGATA CCTGGTAATC AAGAAGCCGT GGCCGTCGAT GCTGCGCACC GTGTGGGGCG ACGATCAGCG CTACAAGGAC ACCTACTGGC CCAAGTTCGA CGGCAAGTAC TACCTGGCCG GCGACTCCGC GCGCCGCGAT TCCGAAGGCA ACTTCTGGAT CATGGGCCGC ATCGACGACG TGCTGAACGT CTCCGGCCAT CGTCTGGGCA CCATGGAGGT CGAATCGGCC CTGGTCGCAC ATGCCGAGGT GGCCGAGGCC GCCGTGGTCG GGCGTCCGCA TGATGTGAAA GGCGAGGCCA TCGTGGCCTT CGTGATCCTG AAGGGCGACC GCCTGACCGG CGACGAGGCC GACGCGATGA TCAAGGACCT GCGCAACTGG GTGGCCGATC AGATCGGCCC GATCGCCAAG CCGGACGACA TCCGCTTTGC CGACGGCCTG CCCAAGACCC GCTCCGGCAA GATCATGCGC CGGCTGCTGC GCTCCATCGC CAAGGGCGAG GAGATCACCT CGGACACCTC GACGCTCGAG AACGAGGCCG TGATCCCGCA GCTCCAGGGC AAGGCCTGA
|
Protein sequence | MSDTIESTLH ETRHFAPPAE FTKNARLKPE DLEALHKKAD ADYEGFWCDL AREKIDWQTE FTEGLDSSNA PHYRWFADGR MNVSYNCIDR HLEVRGNKTA IIFEAENGET RNLTYRDLYN EVGKLANALK TMGVEKGDRV IIYMPMNAEA VIAMQACARI GAIHSVVFGG FSADALRDRI VDSGAKLVIT ADGGVRGGKT VALKANVDKA LDSNPADVEK VIVCKRAGND VTMQDERDIW WDDAVDGESV DCEPEWVESE HPLFLLYTSG STGKPKGIQH SSAGYLLGAI ITNQWVFDLQ GDDVYWCTAD VGWITGHTYV AYGPLAVGAT QVVYEGVPTV PDAGRWWKMC QDHGVTVFYT APTAIRALMK AGDDFPAQYD LSKLRLLGTV GEPINPEAWM WYYRVIGGER CPVVDTWWQT ETGANMIAPI PGATTLVPGS CTQALPGIDA DVVDENGNSL PADQGGYLVI KKPWPSMLRT VWGDDQRYKD TYWPKFDGKY YLAGDSARRD SEGNFWIMGR IDDVLNVSGH RLGTMEVESA LVAHAEVAEA AVVGRPHDVK GEAIVAFVIL KGDRLTGDEA DAMIKDLRNW VADQIGPIAK PDDIRFADGL PKTRSGKIMR RLLRSIAKGE EITSDTSTLE NEAVIPQLQG KA
|
| |