Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1990 |
Symbol | |
ID | 8807764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2116521 |
End bp | 2117639 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003461217 |
Protein GI | 289209151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACG TCCTGCGCGA GTGGTACCGA CGGCATTTCA CTGATCCCCA GGTGGTGATC CTGGCGTTCC TGCTGCTGGC GGGATTCCTG ATCATCGTAT TCGCCGGGCG CATTCTCGCC CCGCTGCTCG CCTCGCTGAT CATTGCCTAT CTGCTGGAGG GAGCAGTGCA GAAGCTGATA CGTCTGCATG TTCCGCGCCT GCTGGCGGTG ACGATCGTGC TGCTCGTCTT CCTGGCGCTT ACAGTCGCCG CGCTGTTTAG TGTCGTGCCG CTGATGTCCG CTCAGGTCAC ACAACTGGTC CGCGAACTGC CGGGCATGAT CCGCGAGGGG CAGGCCTTGC TCCTGCAGCT TCCCGAGCGC TACCCGCAGC TGATCACCGA CGAGCAGGTG CGCGAGTTGA TGGGCGCCAT CCAGGCCGAG GCCACCATGC TCGGACAGCG GGTGGTTTCG TTCTCGCTGG CTGGCGCCCG GCACCTGGTG GACGTCGTGA TCTACCTGAT CGTGGTGCCG CTGATGGTCT TCTTCATGCT GAAGGACCGC GACCTGATCC TCGACTGGGT GCGCAGCTTC ATGCCGCGCG ACTCGCACCT GGCCAGCGAG GTGTGGCGCG AGGTCAACGT GAAGATCGCC AGCTACGTGC GCGGCAAGTT CATCGAGATC CTGATTGTGT GGGCCGTCAG CTTCCTGACC TTCAACTGGT TCGGGCTGGA GTATGCCGTG CTGCTGTCGT TCATGGTCGG CATTTCGGTG ATCATCCCGT ATATCGGGGC GGCGGTGGTG ACGATCCCGG TAGCCGCCGT GGCCTACTTC CAGTTCGGGC TGTCCAGCGA GTTTGCGTGG CTGTTGATCG CCTACGGGGT GATCCAGTTC CTGGACGGCA ATGTGCTCGT TCCGCTGCTG TTCTCCGAGG TGGTCAATCT GCACCCGGTC GCGATCATCG CGGCGGTGTT CGTGTTTGGT GGTATCTGGG GACTGTGGGG CGTGTTCTTT GCCATCCCGC TGGCGACCCT GGTGCATGCG GTGATCAAGT CCTGGCCGCG CACCGACAAG CTCGAGGCCA GGCGCGAGGC GGAGCAGCAG GCTGCAACGC CCGAGCCCGC CGAAGCCGAC TCCAAGTAG
|
Protein sequence | MIDVLREWYR RHFTDPQVVI LAFLLLAGFL IIVFAGRILA PLLASLIIAY LLEGAVQKLI RLHVPRLLAV TIVLLVFLAL TVAALFSVVP LMSAQVTQLV RELPGMIREG QALLLQLPER YPQLITDEQV RELMGAIQAE ATMLGQRVVS FSLAGARHLV DVVIYLIVVP LMVFFMLKDR DLILDWVRSF MPRDSHLASE VWREVNVKIA SYVRGKFIEI LIVWAVSFLT FNWFGLEYAV LLSFMVGISV IIPYIGAAVV TIPVAAVAYF QFGLSSEFAW LLIAYGVIQF LDGNVLVPLL FSEVVNLHPV AIIAAVFVFG GIWGLWGVFF AIPLATLVHA VIKSWPRTDK LEARREAEQQ AATPEPAEAD SK
|
| |