Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4100 |
Symbol | |
ID | 5735961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5232239 |
End bp | 5233660 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281254 |
Product | dihydroorotase, multifunctional complex type |
Protein accession | YP_001546860 |
Protein GI | 159900613 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.595035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACGGA TTGTGATCAA AAACGGCTGT ATCATCGATC CAGCCCGCAA AACTGCGACC GTCGGGGATT TGGTGCTGGA AAATAATCGG GTACGCGAGG TTATCGAGTT GGCCATGGCT CCCGATTCCT ATGGCGATGA TGTTGAGTTT ATTGATGCTT CGGGCTGTTT TGTGACCCCT GGCTTCATTG ATATTCACAC CCACTTGCGC GAACCAGGCT TTGAAGGCAA GGAAACGATT GCCACCGGTA TGGATGCAGC GGTGCGCGGT GGCTATACCA CGATTTGCCC AATGCCCAAC ACCAATCCTT CGCTCGATTC GGCTCCGCTG ATTCGCCAAC AATTTGATAT TGCCGCCCGC CATGGGCCAA TTCATGTGCT GCCAATCGGT GCAGTGACGC TTGGGCGTGA AGGCAAGGTA TTGGCTCCCT TGGTCGAGTT GGCCGAAGCT GGCGCACATG GGTTTAGTGA TGATGGCTCG CCCGTGTGGG ATGCCCACAT CATGCGTCAA GCCCTGTTGT ATAGCAAAAT GACTGGCCGC CCCGTGATGA ATCACTGCGA AGATTTGAGC ATCGTGCGCG GCGCTCCAAT GAACGAAGGT GCGGTTGCCA CCCGTTTGGG TTTGAGCGGT TGGCCGGCTG CTGGCGAAGA GGTGATGATT GCCCGTGATA TTGCTTTGGC TGAAGAAACA GGCGGGCGTT TACACATCTG CCATGTCAGC ACCGCTGGCG GGGTGGAATT GATTCGGGCA GCTAAGGCTC GTGGCGTGCG CGTGACTGGC GAAGTTACGC CGCACCACTT GACTATGACC GATCGCTGGG TGCTGGGCAG TATGGAGCCA TGGGATGGCA AAGGCCCATA CGATCCTTCG CAATTAGCAC CTTACGATAC CCGTACTCGT GTCAGCCCAC CCTTGCGCAC CAGCGAAGAT GTGGCGGCGC TGATCGCAGG CTTACGCGAT GGCACGATCG ACGCAGTAGC CACTGACCAT GCGCCGCATA CCCATGTCGA TAAAGAATGT GAGTATGGCT TTGCCTCCCC AGGTTTCACT GGCTTGGAAC TAGCCTTGCC CATGGTGTTG ACCTTAGTTC AGACCATGCA GATCGATATT GTCGAATTAA TTTCGCGGAT GACAATTGGC CCAGCGGCAA TTATCGATGT GCTGCCAATT TCGTTGGGGC CAAACGACCC CGCTACCTTG ACGATTTTTG ATCCAGCAGC AATTTGGAAG GTAACTCCAG AAGCCTTGGC ATCGAAAGGC AAAAACAGCC CCTTGATGGG TCAAGAGATG CGCGGCCAAG TGATGTTGAC CATGATGGAA GGCCAAATCG TCTATCGCCG TGAGCAGTTT GGTGAATCCA GTCGCCGCGA GCATGGTGGG GCCGTCGGCA AACTCAGCGG CGTATTCGAC GAAGAAGAGT AA
|
Protein sequence | MSRIVIKNGC IIDPARKTAT VGDLVLENNR VREVIELAMA PDSYGDDVEF IDASGCFVTP GFIDIHTHLR EPGFEGKETI ATGMDAAVRG GYTTICPMPN TNPSLDSAPL IRQQFDIAAR HGPIHVLPIG AVTLGREGKV LAPLVELAEA GAHGFSDDGS PVWDAHIMRQ ALLYSKMTGR PVMNHCEDLS IVRGAPMNEG AVATRLGLSG WPAAGEEVMI ARDIALAEET GGRLHICHVS TAGGVELIRA AKARGVRVTG EVTPHHLTMT DRWVLGSMEP WDGKGPYDPS QLAPYDTRTR VSPPLRTSED VAALIAGLRD GTIDAVATDH APHTHVDKEC EYGFASPGFT GLELALPMVL TLVQTMQIDI VELISRMTIG PAAIIDVLPI SLGPNDPATL TIFDPAAIWK VTPEALASKG KNSPLMGQEM RGQVMLTMME GQIVYRREQF GESSRREHGG AVGKLSGVFD EEE
|
| |