Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0554 |
Symbol | |
ID | 8806295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 588198 |
End bp | 589295 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Protein of unknown function DUF2066 |
Protein accession | YP_003459805 |
Protein GI | 289207739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGGT GTTTCCCTGG GGCGCTGTGG TATTTTGGGC GCCTGTCGAA AACGCCATTC AACCTTATGC TGCGCCTCTT TCGTCCGTCC CGCGCCCTGG CTGCGTGCCT GCTCCTGTGT CTGCTGGTGG GGGTCTCCCC GTTGCTGGCA GATGATGCCG TGGTGGTGAA TGTCCATGCG GAGGACGAAA CGGCCGCGCT GGCGCAAGGC CTGGAACAGG CGCTGGTGCG CGTGGCCGGG CAGCGTTCGC CGGAGCTGAG CGAACTGGTG CAGGTCCTGT TGCTGGAGCT GGACGAGGAG GGGCGTATTG AGGCCCTGCG TCGGCGCCAG GAGGTGGACG ACGGGGAATA CCGCCTGGAA TTCGATCGCG ACCGGCTACG CAGTGCCCTG CGCGCGGCCG ACGTGCCCGT AGTGCTGGGC GCGCGCCCGC AACTGCTGGT TTGGGCGGTA TACGAGGAGG GCGGGCGCCG CCACCTGCTG GGCTCCACCC AGGACGAGGC CGATGTGCTG GCCGGGATTG ACGCGCTGGG GCGCGATCGC GGCACGCCTT TCGTGCTGCC GCTGGGGGAT CTGGAAGATC GCCGCGCCGC GCAGGCGGGG GATGTGATCG GTGGCGTGAC CGAGCCGCTG CAGGAAGCGG CCAGGCGCTA CGGGTCGACG GGTATCGTGG CGCTGCATGT ACGCCCGGTC GGGGGTGGTG CCGAGGCGCG TGCCATCACC GTGCATGGTG GGCGCGAGTA CCGTAGCGAG GGCCGTGGCG AATCCCCGGC CGCCGCGGCC CGCGCGGCGG TTGAGGATGG TCTTGACCAG GTCGTGGCAC CGCTGGCGCG CGTTGCCGCC GAGCCCGACT GGGTGAAACT GGGCTTTGTC GGGGTCGAGG GCTTTCCCGC GTTCGAGGCA TTGCGGCGGG AGCTGGCGCA GATCGAGGCC ATCGAGAGTG CAAGGCTGGA TTCACTGGGT GGTAACGCGG TGACGCTCGA GGTGCGCACG GGACTGGCGC CGGATGACCT GGTAGACCTC TTGCACGCCA GCGGCTACGG CGTGGCGGAA GACCCGCACG GCGATTCCGA GGCGAAGGCT TGGCTGCGCC GCGAGTGA
|
Protein sequence | MDRCFPGALW YFGRLSKTPF NLMLRLFRPS RALAACLLLC LLVGVSPLLA DDAVVVNVHA EDETAALAQG LEQALVRVAG QRSPELSELV QVLLLELDEE GRIEALRRRQ EVDDGEYRLE FDRDRLRSAL RAADVPVVLG ARPQLLVWAV YEEGGRRHLL GSTQDEADVL AGIDALGRDR GTPFVLPLGD LEDRRAAQAG DVIGGVTEPL QEAARRYGST GIVALHVRPV GGGAEARAIT VHGGREYRSE GRGESPAAAA RAAVEDGLDQ VVAPLARVAA EPDWVKLGFV GVEGFPAFEA LRRELAQIEA IESARLDSLG GNAVTLEVRT GLAPDDLVDL LHASGYGVAE DPHGDSEAKA WLRRE
|
| |