Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1950 |
Symbol | |
ID | 3761150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | - |
Start bp | 2143410 |
End bp | 2144303 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637786699 |
Product | dihydroorotate dehydrogenase family protein |
Protein accession | YP_392214 |
Protein GI | 78486289 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000000449747 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCAG GGAATGCGGG GTATGGCATA GAATACCAAG CTGTTTCGGG GTTTTCCAAT CGAGATGTCG GTGCGGTGTT TTTAAAAGGC ACCACGCTAG AGCCAAAGCT TGGCAATAAG CCCGAAAGAG TCATGGAAAC CGCCAGTGGT TTGTTGAATT CCATCGGACT GCAAAACCCT GGCGCTCATG CTGTGATAAA AGATTATCTA CCAAAGTTGG ACTTAAGCCA ATCACAATTT ATTATTAATG TGTCCGGCTC TTCTATTGAA GAGTATGCCG AGGTGGTCCG TTTGTTTGAT CAAACCGACT TGCCGGCTAT TGAGGTGAAT ATTTCTTGCC CGAATGTCAA AAAAGGCGGC GCGGCGTTTG GAAACGATCC TGATATGGCC GCAAAAGTGG TTGAAGCCTG TCGTGCCAAT ACGTCCAAAC CTCTGATTGT GAAATTATCA CCAAATCAAA CCGATATTGC GGAAGGTGCG CGCCGTGTCA TTGATGCCGG CGCCGATATG CTGTCGGCCA TTAATACGTT AATGGGCATG CAAATTGATA TTCACTCGGC TCGGCCCACG CTTGGAAACA ATCAAGGCGG TTTGTCAGGC CCGGCCATTA AGCCGGTGGC GCTGTTAAAG GTGCACCAGG TGTATCAAGT CGCAAAGCAA CACAATGTGC CGATTATTGG TTTAGGTGGT ATTGCTTCGG CAGACGATGC AATTGAATTT CTTTTGGCGG GCGCGTCAAT GGTTGCAGTG GGCACTGCGA TGGCGAAAGA CCCGTTGTTA GTCAAAAAAA TTAACCAGGG CATTGAAAAA TACATGGCCC GTTACGGTTA TCAATCCGTT GCAGAAATGA CGGGTAAACT CATATTAAAT ACAGACACCG TTTTGTGCGG TTAA
|
Protein sequence | MASGNAGYGI EYQAVSGFSN RDVGAVFLKG TTLEPKLGNK PERVMETASG LLNSIGLQNP GAHAVIKDYL PKLDLSQSQF IINVSGSSIE EYAEVVRLFD QTDLPAIEVN ISCPNVKKGG AAFGNDPDMA AKVVEACRAN TSKPLIVKLS PNQTDIAEGA RRVIDAGADM LSAINTLMGM QIDIHSARPT LGNNQGGLSG PAIKPVALLK VHQVYQVAKQ HNVPIIGLGG IASADDAIEF LLAGASMVAV GTAMAKDPLL VKKINQGIEK YMARYGYQSV AEMTGKLILN TDTVLCG
|
| |