Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1530 |
Symbol | |
ID | 8807299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1630254 |
End bp | 1632464 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | 1,4-alpha-glucan branching enzyme |
Protein accession | YP_003460770 |
Protein GI | 289208704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.473268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000810495 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAACTG CCTCCAGCAA ATGCATCACA CCCGTTGCCG ATACCCTGGC GCGTTTGCCG GAGGCGCGTC TCCATGATCC CCACAGCGTC CTCGGGTGCC ATGAGCTGGA TCAAAAGACT GCCTGCTACC GTGCCTGGCT GCCCCATGCC AGCGAGGTCT CGATCACACT TTCGGACGGA TCCGGCGAAT ACTTCACGCC GAGTACCAAA ATCCCCGGAC TCTTCACTTG GCAAGGCCGA GCCAGCGCCC TCCCCAGGCA CCCGGAACTG AGCTGGAAAC CCGAGGACCG CAGCGACACC CTGCGAACCG TCGATCCGTG GAGCTTCGCA CCGGACATCC CTTCGTTCGA CCGTCACCTG TTCGCCGAGG GCCGCCACTG GCACGCCCAC CGCATGCTCG GCGCCCATCC GCAAACGCGC GACGGGATCG ACGGCGTGCG TTTCGCCGTC TGGGCGCCCA ATGCCGAGCG CGTCTCCGTC GTCGGCGACT TCAACCGCTG GGACGGGCGC TGCCATCCCA TGACCGTCCA CCCGGGCAGC GGCATCTGGG AGCTGTTCGT CCCCGGCCTT GCCATGGAGA CGCTGTACAA GTTTGAAATC CGCAACCGTG ACCACGGCAC CATCCACCTC AAGACCGACC CCTACGCCCG CGCCTACCAG GTGCGCCCGG AGACGGCCGC CCGCATCCAG CCGCAAAGCG AGTACGCCTG GGGTGATGAC GACTGGCTGC AAAACCGCGA CCCCGAGGGC TGGAAACACC GCCCGATGAG CATCTACGAG GTGCATCTCG GCTCCTGGCA GCGCAGCCCC GAGGGCCACT GGCCCAACTA CCACGATCTG GCGGAACGCC TCGTGCCCTA CGTCCGTGAC ATGGGCTTTA CCCATATCGA GCTGCTGCCG GTCACCGAAC ACCCCTTCGA CGGATCCTGG GGCTACCAGA GCACCGGCTT CTTCGCGCCC ACCTCGCGTT TTGGCAACGC GGACGACTTT CGCTACTTCG TCGACCAGGC GCACCAGGCC GGGATCGGCG TACTGCTCGA CTGGGTGCCC GGCCACTTCC CGCGCGACGA ATTCGCCCTC GCTCGCTTCG ACGGCACCGC ACTGTACGAA CACGAAGACC CACGCATCGG CGAACACCGG GGCTGGGGCA CGCTGATCTT CAACTATTCA CGTCCCGAGG TACGCAACTT CCTGCTCTCC AGCGCCCTGT GCTGGGTGGA GGACTTCCAC ATCGACGGCC TGCGCGTGGA TGCCGTCGCC TCCATGCTCT ACCGCGACTA CGACCGCGAA CCGCACGACT GGCTGCCCAA CATCCACGGC GGCAACGAGA ATCTCGAGGC CGTCGACTTC CTGCGCCACC TCAACGAGAC CGTCCAAACC GAGCATCCCG GCGTCGTGAT GATCGCCGAA GAATCCACCG CCTGGCCCGG CGTCAGCCGC CCGCCGTCCA TGGGCGGACT CGGCTTCACC ATGAAATGGA ACATGGGCTG GATGCACGAC ACCCTGACCT ACTTCGGCAT GGACCCACTC TACCGCCACT ACCACCACAA CCAACTCACC TTCGGCCAGA TCTACGCCTA CAGCGAGAAC TTCGTCCTGC CCTTCTCGCA CGACGAAGTC GTGCACGGCA AACGCAGCCT GCGCGGCCGC ATGCCCGGCA ACGAATGGGA ACAGTTCGCC AACCTGCGCC TGCTCTACGC CTGGCAGTGG CTCTACCCCG GCAAAAAGCT CCTGTTCATG GGCCAGGAGT TCGGCCAGGG CCCAGAATGG TCCGAGGACC GCGAACTGGA CTGGTACGTC CTGCAGTACC CGCTGCACCA GGGCCTGCAA CAGCTCGTGC GCGACCTCAA CCGCGTCTAC CGCGACCACG CCGCCCTGCA CGGCCGCGAG TTCGAACCCG ACGGCTTCGC CTGGCTGGAC TGCGACGACG CCCAACACTC CACCCTCAGC TTCCTGCGCC GCGACACCCA GGGCCGCGAA AGCATCGTCG TCCTCAACCT CACCCCCGTC GAACGCGGCG CCCACCCCGT GCCGGCGCCT CATCCCGGCT CCTGGCGCGT CGTCCTCAAC ACCGACGCCC AGGCCTATGG CGGCGACTCC CGCGGCCCCG CCCACGCCCA CGCCGAACCC ATCGAGCGCA ACGGCCACCC CGCCACGCTC TATCTCCACC TGCCGCCCCT GACCGCCCTC CTCCTGGAAC CTGCCGATTG A
|
Protein sequence | MTTASSKCIT PVADTLARLP EARLHDPHSV LGCHELDQKT ACYRAWLPHA SEVSITLSDG SGEYFTPSTK IPGLFTWQGR ASALPRHPEL SWKPEDRSDT LRTVDPWSFA PDIPSFDRHL FAEGRHWHAH RMLGAHPQTR DGIDGVRFAV WAPNAERVSV VGDFNRWDGR CHPMTVHPGS GIWELFVPGL AMETLYKFEI RNRDHGTIHL KTDPYARAYQ VRPETAARIQ PQSEYAWGDD DWLQNRDPEG WKHRPMSIYE VHLGSWQRSP EGHWPNYHDL AERLVPYVRD MGFTHIELLP VTEHPFDGSW GYQSTGFFAP TSRFGNADDF RYFVDQAHQA GIGVLLDWVP GHFPRDEFAL ARFDGTALYE HEDPRIGEHR GWGTLIFNYS RPEVRNFLLS SALCWVEDFH IDGLRVDAVA SMLYRDYDRE PHDWLPNIHG GNENLEAVDF LRHLNETVQT EHPGVVMIAE ESTAWPGVSR PPSMGGLGFT MKWNMGWMHD TLTYFGMDPL YRHYHHNQLT FGQIYAYSEN FVLPFSHDEV VHGKRSLRGR MPGNEWEQFA NLRLLYAWQW LYPGKKLLFM GQEFGQGPEW SEDRELDWYV LQYPLHQGLQ QLVRDLNRVY RDHAALHGRE FEPDGFAWLD CDDAQHSTLS FLRRDTQGRE SIVVLNLTPV ERGAHPVPAP HPGSWRVVLN TDAQAYGGDS RGPAHAHAEP IERNGHPATL YLHLPPLTAL LLEPAD
|
| |