Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0621 |
Symbol | |
ID | 8806370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 660243 |
End bp | 662423 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | capsular exopolysaccharide family |
Protein accession | YP_003459872 |
Protein GI | 289207806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.242447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.136345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGA ACCATTCCCC CGGGGTGCCG GAAGGCGAAC TGGATATTGG CCGACTGCTT GGCCAGTTGC TGGACCACAA GTGGCTGATC ATTTTTATCA CGGCGCTGTT TGCTATTGCG GGCGCGGTAT ACGCGACGCT GTCCACCCCG ATCTACCGGG CGGATGCGCT GGTGCAGGTG GAGGACATGG GACGCTTCAG CAGCCCGTTG TCCAGTGTGC GCGAGATGCT GGGGCAGGAG CCGCGCGTCG AGGCCGAGCT GCAGATCCTG CGTTCACGCA TGGTCCTCGG CCGCACGGTC GATCAGGAGG CGCTGGATCT GGTGGTACGC CCGCACCGTA TGCCGGTGGT GGGCGATTAC CTCGTTCGCC GGGGCGTGGA GCGCCCCGCC TTTGCTCAGT CCAGTGTCTG GGCGGGCGAA TCGATCAACG TGGGCGAGTT CCAGGTGGCC CGAGCCTACG AGGGGCGCAC CTTTACGCTG GAGGTTCTGG ATGACGATCG TTATCGCCTG ATCAACGGGG ATCATGAACT GGGCGAGGGG CGTATTGGCG AGGACGTGGA GTTCTTTGAG GGTGATGTAC GCCTGCGTGT GGCCGAAATG GATGCGGGGC CGGGGGCGCG TTTTGATATC CAGCGGCGCT CGCGCCTGGC GGCGATCAAC AGCTTGCGTG GTCAGTTCGA GGCGGGGCAG CAGGGCCGCG AGAGTGGCGT GTTCAGCCTG ACCCTGACCG ATCCGGACCC GCAGCAAGCC CAGCGCATCC TGAACACCAT CAGCCAGATC TATCTGACCC AGAACGTCCA GCGCAAGGCC GCCGAAGCCG AGAAGAGCCT CGAATTCCTG GAGGAGAAGG TGCCGGATGT GCGCGATCAG CTGCAAACGG CCGAGGATGC GCTGAACGAG TACCGCACAG AGCGCGACAG CGTGGACCTC TCGCAGGAGA CCCGTGCCGT GCTGGACCGC CTGGTCAACC TGGAACGTCA GCTCAACGAG CTGGAGTTCG AGGAGGCCGA GATCTCCCGG CGGTACACGC CCAGCCATCC GACCTATGCG GCCCTGATCG ACAAGCGCCA GAAGCTGCAG GAGGAGCGCA ACCGGCTGAA CCAGGAGGTC AGCGCGCTGC CGGAGACCCA GCAGCAGATC CTGCGTCTCA ACCGGGATGT GGAGGTCAAC CAGGACATCT ACGTGCAGCT GCTCAATCGC ATGCAGGAGA TGGATATTGC CCGCGCCAGT ACGGTGGGCA ACGTCCGTAT CCTGGACGAT GCCCAGGCGG GGCTGTCCCC GGTCGAACCG CAGCGCAACA TGATCGTGAT GCTGTCCACG GTATTTGGCG GTGGCCTGGC GATTGCGCTG GTGCTGGTGC GCGGGATGTT CAACCGCGGG GTGGAATCAC AGGAGGAGAT CGAGGGGATC GACCTCCCGG TGTATGCCAG CGTGCCGCGT TCCAGGGCGC AGGAGAATCT GCTGCGCCGG GTGAAACGGC GCCACCAGTC GCGGGGGCAT GACGTTCCAG GGGATGTACT GGCCTCGATC GACCCGGCCG ACGACGCCAT CGAGGCCCTG CGTGGCCTGC GCAGCAGCCT GCATTTCGCG ATGCTGGAGG CCGAGGACAA CCGGCTGGTG ATTACCGGTC CGAGCCCGGA TGTGGGCAAG AGTTTTGTTT CGGTTAACCT GGCGACCGTT TGTGCGCAGG CGGGACAGCG CGTGCTGATC ATTGACGCCG ACCTGCGCAA GGGGCATGTC CACCACGCCT TCGGTCAGCG CAGCGACGGG GGACTCTCGG AGTTTCTGGC GGGCCAGGAA TCCCTGGAAG GGGTGCTGCG CCGCACGGAT ATCCAGGGGC TGGACTACAT CGCCCGTGGC AGCGCGCCGC CGAACCCGTC GGAGTTGCTG ATGAGCGACC GTTTCAGCCG CATGCTGGAG CAGCTCAGCC GCGACTACGA TCTGGTGCTG ATCGACACGC CGCCGATCCT GGCGGTGACC GATGCCGCCG TGGTCGCACG CCAGTGCTCG ACCACCTTGA TGGTGGTGCG CTTCCAGCTG AACCCGCTGC GCGAGATCGA GTCGGCACGG CGCCGTCTCG ATGCGGCGGG AGTCGACGTG CGTGGCTGTA TCCTGAACTC GATCGAGTAC AAGGCCTCGA CCAGCTACGG CTACGGTTAT TATCACTACT CCTACAAGTA G
|
Protein sequence | MTKNHSPGVP EGELDIGRLL GQLLDHKWLI IFITALFAIA GAVYATLSTP IYRADALVQV EDMGRFSSPL SSVREMLGQE PRVEAELQIL RSRMVLGRTV DQEALDLVVR PHRMPVVGDY LVRRGVERPA FAQSSVWAGE SINVGEFQVA RAYEGRTFTL EVLDDDRYRL INGDHELGEG RIGEDVEFFE GDVRLRVAEM DAGPGARFDI QRRSRLAAIN SLRGQFEAGQ QGRESGVFSL TLTDPDPQQA QRILNTISQI YLTQNVQRKA AEAEKSLEFL EEKVPDVRDQ LQTAEDALNE YRTERDSVDL SQETRAVLDR LVNLERQLNE LEFEEAEISR RYTPSHPTYA ALIDKRQKLQ EERNRLNQEV SALPETQQQI LRLNRDVEVN QDIYVQLLNR MQEMDIARAS TVGNVRILDD AQAGLSPVEP QRNMIVMLST VFGGGLAIAL VLVRGMFNRG VESQEEIEGI DLPVYASVPR SRAQENLLRR VKRRHQSRGH DVPGDVLASI DPADDAIEAL RGLRSSLHFA MLEAEDNRLV ITGPSPDVGK SFVSVNLATV CAQAGQRVLI IDADLRKGHV HHAFGQRSDG GLSEFLAGQE SLEGVLRRTD IQGLDYIARG SAPPNPSELL MSDRFSRMLE QLSRDYDLVL IDTPPILAVT DAAVVARQCS TTLMVVRFQL NPLREIESAR RRLDAAGVDV RGCILNSIEY KASTSYGYGY YHYSYK
|
| |