Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4254 |
Symbol | |
ID | 5211239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5336930 |
End bp | 5338117 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597843 |
Product | dihydroorotase |
Protein accession | YP_001278547 |
Protein GI | 148658342 |
COG category | [R] General function prediction only |
COG ID | [COG3964] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACC TTCTCATTCG CGGCGGGCAT GTGATCGACC CGGCAAACCA TCGCAACGCT CCCTTCGACG TTGCGATCAG CGGTGGACGC ATTGCGGCGG TGGCGGAGTC CATCGATCCG GCGCAGGCGC GCGCTGTGAT CGACGCGACC GGTCAGATTG TGACGCCTGG ACTGGTCGAT CTGCATACCC ATGTCTACTG GGGCGTTACC TACTGGGGAA TTGAAGCCGA CCCGGTTGCG GCGCGCAGCG GGGTGACCAC CTGGGTGGAT GCTGGCAGCG CTGGCGCGTA CAGTTTCCCC GGCTTTCGCC ACTTCATCTG CAATGCCAGC CGGGTGCGCA TATTCGCGTT TCTCAACCTG TCAGCGATCG GGCTGATTGC GCCAACCTGG GAGTTCGCCA ATCTCGATTA TTGCGACATC GATCTGGCAG CCAGGACGAT CGAAGAGAAT CGCGATATGC TCGTCGGGAT CAAGGCGCGC ATCGATCACA ACACCACGCG CGGCGTCGGC ATCCGCCCGC TGGAACTGGC GCGCACCCTC GCCGACCGGG TGGCGCTGCC GCTAATGGTG CATATCGGCA ACGCTCCGCC GGCGCTTGAC GAGATTGTTG CGCTCCTGCG CCCCGGTGAC ATTCTGACCC ATTGCTTCAC CGGCGGCACG CATCGCCTGC TCACCGACGA CGGACGCCTC TCCCCGGTTG CACGCACGCT GCAGGAGCGT GGCGTGCTGC TCGACATCGG GCACGGAACC GGGTCGTTCA GTTATCGGGT GGCTGAAGCA GCACTGGCGG AAGGGCTGCT GCCCGACATT ATCAGCAGCG ACATTCATCA ACTCAGTGTG CAGGGACCAA TGTTCGATCT GCCAACGACA CTCTCAAAGT TTCTCAATCT GGGCATGTCG CTCCCCGATA TCATCGACCG GGCAACCCGC CGACCGGCGC TGGCGATCGG GAAACCGGAA CTCGGAACAT TGCAGCCTGG GGTGCCAGCG GATGTGGCGC TCTTTCGCCT TGAAGAAGCG GACGTGACAT TCTACGATGT CGAGATGAAT CCGCGCGCTG GCAACCGGCG CCTGGTCTGC ACGATGACGA TCGTTGATGG GCAAGTGCTG CCGCAGCGGG CGGAACGCGC ACCGGCGATC TGGGCGATTT TGCCGGAGCA TCAGCGGCGC ATCCTCGATC AGGGTTGA
|
Protein sequence | MYDLLIRGGH VIDPANHRNA PFDVAISGGR IAAVAESIDP AQARAVIDAT GQIVTPGLVD LHTHVYWGVT YWGIEADPVA ARSGVTTWVD AGSAGAYSFP GFRHFICNAS RVRIFAFLNL SAIGLIAPTW EFANLDYCDI DLAARTIEEN RDMLVGIKAR IDHNTTRGVG IRPLELARTL ADRVALPLMV HIGNAPPALD EIVALLRPGD ILTHCFTGGT HRLLTDDGRL SPVARTLQER GVLLDIGHGT GSFSYRVAEA ALAEGLLPDI ISSDIHQLSV QGPMFDLPTT LSKFLNLGMS LPDIIDRATR RPALAIGKPE LGTLQPGVPA DVALFRLEEA DVTFYDVEMN PRAGNRRLVC TMTIVDGQVL PQRAERAPAI WAILPEHQRR ILDQG
|
| |