Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0605 |
Symbol | |
ID | 8806354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 641290 |
End bp | 643290 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_003459856 |
Protein GI | 289207790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0935812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.274375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTAC TGGATCGTAG TCTGGAGAAC TTGCTGCGAC TGCCGCGCGG CAAAAAACAG GGATTGATGG TGCTGGCGGA CATCGTAATG CTGCCGCTGG CACTGTGGCT GGCCTTCGTG ATTCGTACTG GCCAGACGGT CCCGCCGATG ATGGTCGAGG CCTGGTGGCT GTTTGCGCTG GTCCCGGCGG TCGGCGCGCT GGTGTTCGCG CGGCTCGGTT TGTATCGAAC GGTCGTGCGG TTCATGGGGG CCCGCGCGGT GCGTGCGGTG ATCGTCGGCG TCTCCGCGCT GGCGCTGATG ATCTGGGCCT CGGCCGAACT GACACAGACA CCTGGATTCT TCGGGGCGGT GGCGGTGAAT TTTGCGCTGC TGGCCTTTGC CCTGGTCGCG GCGGGCCGAT TTGTGGTGCG CAGCTGGTAC CAGGCCGTGT CCGATGCGCG TACTGGCAGC GAGCGCGTGG TCATCTTTGG CGCCGGTGAG GCAGGGGTTC AGCTGGCCTC CGGCCTGCAG TCCAGCGGCC GCTTCCGGGT GATGGCCTTC ATCGACGAGG ACCCGGCGGT CCAGGGGCGT TCGATTGACG ATATCCCGGT CTGTGGCCGC GAGGCCCTGG GCACGCTGGT GGAGCAGTGT GGCGTGCGCC GGGTATTGCT GGCGGTGCCG TCGGCGAGCC GGGCCGATCG CCGGCGCATC CTCGAATGGC TGGAGCCCTA TCCGCTGCAT GTGCAGTCGG TGCCCTCGCT GGAGGCGGTC GTCTCGGGCC GGGCCCCCCT GGATCAGCTG GAGGACCTGG ATGTCGCCGA CCTGCTGGGC CGCGATCCGG TGCCGCCCAA CGAGGCCCTG CTGGCTCGGA GCATCCGCGG ACGCGTGGTG ATGGTCACCG GCGCCGGGGG CTCTATCGGT TCGGAGCTTT GCCGCCGCAT CCTCAGTCTG GAGCCGGCCG CCCTGGTGCT GTTCGAGCGT AACGAGTTCG CGCTCTACCG GATCGAACAG GAGATCATGC AGCGGCGGTT GTCCATGCAG CCGGGCGTCG AGGTGCACGC GGTGCTGGGC TCGGTCGCCA ACCAGGCGCG CGTCGAAGAG GTCCTCCACC AGTATGGCGT GCAGACGCTC TACCACGCGG CCGCCTACAA GCATGTGCCG CTGGTCGAGG GCAATCTGCT GGAAGGCCTG CGCAACAACA GCCTGGGCAC CGCCGTGGTG GCCGATGCGG CGATCGCCGC CGGCGTCGAG CGTTTTATCC TGGTGAGCAC GGACAAGGCC GTGCGGCCCA CGAACGTGAT GGGCGCGAGC AAGCGCCTGG CCGAGATGGT GCTGCAGGAC CGCGCACGCC GTCAGAATGC AACGGTGTTC AGCATGGTGC GGTTCGGCAA CGTCCTGGGT TCTTCCGGGT CCGTCGTACC CCTGTTCCAC CGCCAGATTC GCGAAGGTGG GCCGGTCACG GTGACTCACC CGGAGATTAC CCGCTACTTC ATGTCGATCC CGGAGGCCGC CGAGCTGGTG ATTCAGGCCG GGGCGATGGC GCAGGGGGGC GAGGTCTATG TGCTCGACAT GGGCGAACCG GTGCGCATCA TCGACCTGGC CCGGCGCATG ATCCGTCTGC ACGGCCATCA GGTCCGGGAG GACGAGGCCG AAGGGGAAGG TATTGCCATC ACCTGTACCG GCCTGCGTCC CGGCGAGAAG CTCTACGAGG AATTGCTGAT CGGGGATAAC GTGGAAGAGA CCCCGCACAG CAAGATCATG CGTGCACGCG AAACCTGTGC GGAGCCGGCG GTTCTCGCCC GCGCCCTGCG CACGCTGGAG AAGGCCGACC GCGAACGCAA TGCCCAGGCC GCGTTCGAGG TGCTGCGCGA GGTCGTCGAT GGTTACGAGC CGGGCCGCGA TGAAGAGGCC GCTCACAGTG CCTCTGCACC GGTCGGACGG CCCCGCCCGG CTCCGGCCCC GACCGTGCGT CCGGCGGCAG TCGCCTATGC ACGGGTGGCC GATCCGGTGG TTCCGCGCTA G
|
Protein sequence | MAVLDRSLEN LLRLPRGKKQ GLMVLADIVM LPLALWLAFV IRTGQTVPPM MVEAWWLFAL VPAVGALVFA RLGLYRTVVR FMGARAVRAV IVGVSALALM IWASAELTQT PGFFGAVAVN FALLAFALVA AGRFVVRSWY QAVSDARTGS ERVVIFGAGE AGVQLASGLQ SSGRFRVMAF IDEDPAVQGR SIDDIPVCGR EALGTLVEQC GVRRVLLAVP SASRADRRRI LEWLEPYPLH VQSVPSLEAV VSGRAPLDQL EDLDVADLLG RDPVPPNEAL LARSIRGRVV MVTGAGGSIG SELCRRILSL EPAALVLFER NEFALYRIEQ EIMQRRLSMQ PGVEVHAVLG SVANQARVEE VLHQYGVQTL YHAAAYKHVP LVEGNLLEGL RNNSLGTAVV ADAAIAAGVE RFILVSTDKA VRPTNVMGAS KRLAEMVLQD RARRQNATVF SMVRFGNVLG SSGSVVPLFH RQIREGGPVT VTHPEITRYF MSIPEAAELV IQAGAMAQGG EVYVLDMGEP VRIIDLARRM IRLHGHQVRE DEAEGEGIAI TCTGLRPGEK LYEELLIGDN VEETPHSKIM RARETCAEPA VLARALRTLE KADRERNAQA AFEVLREVVD GYEPGRDEEA AHSASAPVGR PRPAPAPTVR PAAVAYARVA DPVVPR
|
| |