Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0571 |
Symbol | |
ID | 8806312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 600818 |
End bp | 602089 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | protein of unknown function DUF395 YeeE/YedE |
Protein accession | YP_003459822 |
Protein GI | 289207756 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.484751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.560137 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTCT CCAGCTTTTC AACCGCCGCA ATGATGTTCC TGGGCGGCAC GTTCGTCCTC GCCTTCATCA TGGGTGCGGT CGCGAACAAG ACCAACTTCT GCACCATGGG AGCCGTCTCC GACTGGGTGA ACATGGGTGA CACCGGGCGC ATGCGCTCCT GGCTGCTGGC CATTGCCGTG GCGATGCTGG GCGTGGCGGC GCTCGAGTTC TTCGGGGTAC TGAGCGCCGA GGGTTCCTTC CCTCCCTACC GCGGCGAGAA CTTCAACTGG CTCGGGCACG TGCTGGGTGG CTTCCTGTTC GGCATCGGCA TGACCTATGC GTCCGGCTGC GGTAACAAGA CGCTGGTGCG CGTCGGCGGC GGCAACATCA AGTCGGTGAT GGTGATGATC ATCATCGGTG TGATCGCCTT CTGGCTGACC AGTCGCGCCA CGGTCTTTGG CACCGGCCAG ACGCTGTTCC AGCTGCTGTT TGGCTGGATG GACGCGGTGT CCATCCAGAT GGAAGGTGGC CAGGATCTTG GCTCCATCAT TGCCGGCGAG AATGCTCTGG GCGCCCGCCT GGTGATGGGC CTGGTGATCG GCGTGCTGCT GCTGGCCCTG ATCTTCAAGG CCGCGGACTT CCGCAAGAGC TTCGACAACA TCCTGGGTGG TCTGGTGATC GGCCTGGTCG TGATCGGGGC CTGGTGGCTG ACCAGCAACA TCCAGATCGA CGACGGCATG GGTGGTGTGT ACAGCGCCCA GGAATACATC CAGGAATGGG ATTTCGTCGC CGATGACGAC GATGGCGAAC GTCCGGCGCT GTCCGGCCCC TGGGCCAACC AGTCGTTTAC CTTTATCAAC CCGATGGGCC AGAGCGTGGG CTACGTGTCC AGCGGCTTTG ACCGTACCAT GCTGTACTTC GGCGTGATGG CGCTGGCCGG CGTGATCCTG GGTTCGTTCT TCTGGGCCCT CATCTCGCGC AGCTTCCGTA TCGAGTGGTT CTCGTCCTTC CGTGACTTCG TCAACCACTT TATCGGCGGC ATCCTGATGG GTGTCGGCGG CATCCTGGCG CTGGGCTGCA CCATCGGTCA GGCGGTCACC GGGATTTCCA CGCTGGCGCT GGGTGGCTTC TTCACCTTTG CGTTCATCGT GTTCGGCTCC GCGCTGACCA TGAAGATCCA GTACTACAAG ATGGTCTATG AAGACGAGGC GACCTTCGGC AAGGCCCTGG TGACCTCGCT GGTGGACATG AAGCTGCTGC CGGGTGGCAT GCGCAAGCTC GAGGCCATCT GA
|
Protein sequence | MEFSSFSTAA MMFLGGTFVL AFIMGAVANK TNFCTMGAVS DWVNMGDTGR MRSWLLAIAV AMLGVAALEF FGVLSAEGSF PPYRGENFNW LGHVLGGFLF GIGMTYASGC GNKTLVRVGG GNIKSVMVMI IIGVIAFWLT SRATVFGTGQ TLFQLLFGWM DAVSIQMEGG QDLGSIIAGE NALGARLVMG LVIGVLLLAL IFKAADFRKS FDNILGGLVI GLVVIGAWWL TSNIQIDDGM GGVYSAQEYI QEWDFVADDD DGERPALSGP WANQSFTFIN PMGQSVGYVS SGFDRTMLYF GVMALAGVIL GSFFWALISR SFRIEWFSSF RDFVNHFIGG ILMGVGGILA LGCTIGQAVT GISTLALGGF FTFAFIVFGS ALTMKIQYYK MVYEDEATFG KALVTSLVDM KLLPGGMRKL EAI
|
| |