Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2109 |
Symbol | |
ID | 8807884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2227153 |
End bp | 2228397 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | molybdenum cofactor synthesis domain protein |
Protein accession | YP_003461335 |
Protein GI | 289209269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.635133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTGC CCAGCTGCGA TGAGTTCGAC CCCAACGCCC TGACCCTGGA ACAGGCACGC GCGGCCATCC TGCAGGCCTG CGATGCACTG ACCGAATCCG AGACCGTACC GCTGGAATCC GCGCTGGGGC GTACCCTGGC CGAGCCGGTC GCCGCCCGCA TCGACGTCCC GCCGGCCGCC GTCTCCGCGA TGGATGGCTA TGCCCTGCGC GCCGACGATG CCGTCGCGGG CCGCGAGCTA ATGCTGGCCG GGACCTCGGC CGCCGGGCAT CCCTGGGATG GCCGCGTGGA ACCGGGCCAG TGTGTGCGCA TCCTGACCGG TGCCGTGGTC CCGGACGGCG CCGATGTCGT AATCATGCAA GAACAGGTCG AGCGCGTCGG CGCGGCCGGG ATCCGCCTGA ACAACGGCGG CCTGGCGCCC GGGCACAACA TCCGCGGCCC GGGCAGCGAC ACCGCCAGCG GTACCGCTCT GTTCGACAAG GGGCAGGTCA TTGGCGCCGC CGAGATCGGC GTACTGGCCA GCCAGGGGAT CGCCGAACTG CGAGTGCTGC GCAAGCCGCG GGTCGCATTC TTCTCCACCG GCGACGAGCT GGTACCGCTG GGCGAGCCGC TGGGCCCGGG CCAGATCCAC GACTCCAACC GCCACACGCT GCGCGCGCTG CTCGCCGCCT ATCCGGTCGA GGCACTCGAC TACGGCGTGA TCCGCGATAC CGAGGCCGCG GTGCGCGAGG CGCTGGAGCG CGCCGGACAG GAGGCGGACC TGATCATCAC CACCGGCGGG GTCTCGGTGG GCGATGCCGA CCACGTGACC CGCGTGCTGC AGGCCTCGGG CCAGGTCGGC TTCTGGAAGA TCGCGATCAA GCCCGGCCGT CCGCTGGCCT TTGGCCGTTT CGGCAATGCC CACTTCCTGG GTCTGCCGGG CAACCCGGTC GCGGTGATGA CCACCTTCGC CCTGCTGGTA CGCCCCGCCC TGCAGGCCCT GTCCGGACGC GACACCACAC CGCCGCAGAC CATCAGCGCG CGTCTGGTGG AGGACCTGCA CAAGACCCCC GGGCGCAAGG ACTTCCAGCG GGCAGTGCTC GCCACCGACG ATCGTGGATG GACCGTGCGC TCGGCCGGTG GCCAGAGTTC GCATCAGCTG CGCGCAATGA GCGAGGCCAA TGCCTACATC GTGCTGCCAC GCGAATCCGA CGGGGCGCGC GTGGGTGAAT GGGTGGAGGT GCTCCCGTTC CGGGAGATTT TCTGA
|
Protein sequence | MSVPSCDEFD PNALTLEQAR AAILQACDAL TESETVPLES ALGRTLAEPV AARIDVPPAA VSAMDGYALR ADDAVAGREL MLAGTSAAGH PWDGRVEPGQ CVRILTGAVV PDGADVVIMQ EQVERVGAAG IRLNNGGLAP GHNIRGPGSD TASGTALFDK GQVIGAAEIG VLASQGIAEL RVLRKPRVAF FSTGDELVPL GEPLGPGQIH DSNRHTLRAL LAAYPVEALD YGVIRDTEAA VREALERAGQ EADLIITTGG VSVGDADHVT RVLQASGQVG FWKIAIKPGR PLAFGRFGNA HFLGLPGNPV AVMTTFALLV RPALQALSGR DTTPPQTISA RLVEDLHKTP GRKDFQRAVL ATDDRGWTVR SAGGQSSHQL RAMSEANAYI VLPRESDGAR VGEWVEVLPF REIF
|
| |