Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3928 |
Symbol | |
ID | 6134900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 4377065 |
End bp | 4378204 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641644086 |
Product | dihydroorotase |
Protein accession | YP_001770728 |
Protein GI | 170742073 |
COG category | [R] General function prediction only |
COG ID | [COG3964] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0785792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0338469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCATG AGCTGATCCT GGATGGCGCG CGCGTCATCG ACCCCTCCGC CGGGATCGAC CGGGTCACCC GCGTCGCCTT CGCGGGCGGC CGCGTGGCCG CCCTGGGCGA GGGCATCGAC ACGACCGGCT GCCCGGACGT CCGCGACCTG CGCGGCCTCA TCGTCGTCCC GGGCCTGATC GACCTGCACA CGCACGTCTA CTGGGGCGGC ACCTCGCTCG GCATCGACGC GGCGCAGTTC GCCCGCGACA GCGGCGTCAC CACGGCGGTG GACACCGGCA GCGCCGGCCC GGGCAACTTC CTCGGCTTCC GCAAGCACGT GATCGAGCCC TCGCCGGTCC GCATCCTCGC CTACCTGCAC GTGTCCTTCG CGGGCATCTT CGCCTTCTCG AAGAGTATCA TGGTCGGGGA GAGCGAGGAG ATCCGCCTGA TGGCCCCGGC CGAGGCGGCG GCGGTCGCGG AGGCGAACCG CGACGTCGTC GTCGGCATCA AGGTCCGGGT CGGGCTGCAC GCCTCCGGCC GCTCGGGCCT GCAGCCCTTC GAGGCGGCGC TCCAGGTCGC CGAGGAGGTC GGCATGCCCA TGATGGTGCA TATCGATCAC CCGCCCCCGA GCTACGAGGA GGTGGTCGAG CGCCTGCGCC CCGGCGACGT GCTGACCCAC GCCTTCCGGC CCTTCCCGAA CGCGCCCCTC TCCGGCCAGG GCCGGGTGCG CGAGGCCGTG GTGGCGGCGC GCCGCCGCGG CGTGCTCTTC GACATCGGTC ACGGCAAGGG CTCCTTCGCC TTCAAGACCG CCCGGGCCAT GCTGGCGAAC GGATTCCCGC CCGACACGAT CTCGTCGGAC ATCCACACCC TCTGCATCGA CGGCCCGGTC TTCGACCAGA CCACGACCCT GTCGAAGTTC CTCTGCCTCG GCATGAGCCT GCCGGACGTG ATCGCCGCCA CGACCGTGAA CGCCGCGACG GCCCTGCGGC GGCCCGAACT CGGCTCCCTG CGGCCGGGAT CGGTCGGCGA CGCCACGATC CTGCGGCTCG ACGAGGGCCG CTTCGACTAC GTCGACACCA CGGGCGAGCA CCTCGCCGGC GACCGGCGCC TGACCTCGTC GGGCGTGGTG GTCGGGGGCC GCTGGTGGCA TCCCGCTTGA
|
Protein sequence | MAHELILDGA RVIDPSAGID RVTRVAFAGG RVAALGEGID TTGCPDVRDL RGLIVVPGLI DLHTHVYWGG TSLGIDAAQF ARDSGVTTAV DTGSAGPGNF LGFRKHVIEP SPVRILAYLH VSFAGIFAFS KSIMVGESEE IRLMAPAEAA AVAEANRDVV VGIKVRVGLH ASGRSGLQPF EAALQVAEEV GMPMMVHIDH PPPSYEEVVE RLRPGDVLTH AFRPFPNAPL SGQGRVREAV VAARRRGVLF DIGHGKGSFA FKTARAMLAN GFPPDTISSD IHTLCIDGPV FDQTTTLSKF LCLGMSLPDV IAATTVNAAT ALRRPELGSL RPGSVGDATI LRLDEGRFDY VDTTGEHLAG DRRLTSSGVV VGGRWWHPA
|
| |