Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_3903 |
Symbol | |
ID | 5368045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 4402051 |
End bp | 4403451 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640806291 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_001342735 |
Protein GI | 152997900 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.456692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGCC ATAACTCAAC TCATTGCGAT AACACTCAAT ACTATTTTTC TGAACGCGCT TTGCTTTCTT CTGGCTGGTC GAATAATGTA CTTTTCTCAG TAAAAGACGG ACAATTTCAT TCTTTTAAAG CAGACAGCAC ACCGACAACA GACTGCCATG TATTATCTGG CCCAGTCTTA CCAACGCTTG CCAACGTTCA TTCTCATGCC TTTCAGCGAG TGATGGCTGG TGCAGCAGAA GTCAGCTTAA ATCCAAACGA TAGTTTTTGG AGTTGGCGCG ATTTGATGTA CAAGATCGTG CAAAAACTTA CGCCAGACGA TGCTCGTATT ATTGCCAAAC AACTTTATAT CGACATGCTA AAAGCTGGTT ATACACAAGT CGGAGAATTC CACTATCTGC ACCACGATAT TGGCGGAAAG CATTATGGCC AGTTTGGTGA GATGTCGAAT CAAATCATCG CTGCAGCGGA CGACTCTGGA ATGGGCTTAA CTTTGCTACC AGTGCTGTAT TCGCACTCTG GTTTTGGAGG GCAAGCACCT AACGCCGGAC AAGCGCGTTT TATCAATTCC ACAGATTCTT ATTTGGCATT ACATCAAGCC TGCGATAAAG CGTTAGCGAA TCATGCTCGC CATAAATTGG GGATTTGCTT TCACTCATTG CGCGCGGTGA CCAAACCGCA AATAGACACC GTCTTACAAT CTCTCGCTAA AGATTGCCCG GTTCACATTC ATATCGCGGA GCAACAAAAA GAAGTTCAAG ACAGCCTGGC CTTCAGCGGA CAACGTCCAG TGGAATGGCT TAACAACGAA ATAGGCTTAA GTGAGCGCTG GTGTTTGGTT CACGCCACTC ATCTAACCGA TGCAGAACGC CAAGCCATTA CGAAGAGCAA AACCGTTGCT GGATTATGCC CAACTACCGA AGCGAATTTA GGCGATGGCA TTTTCCCTGC TGTGGAATTC GAGAAAGAAA ATGGCCGTTG GGGAATTGGC TCTGACAGCC ATGTTAGTTT GTCGATTGTA GAAGAACTCA GAACACTGGA ATATGGACAA CGTTTGCGCG ATCAACAGCG CAACCGTTTA TATCGTGCGG ATCAAACCAG CGTCGGAGAC AACTTATACC AACAAGCCTT ATTAGGCGGT AATCAAGCTT GCGACGTGTC ATTGGGATTA AGCCAAGGTA ACCGAGCCGA CTTTATGGTG CTCGACGAGT CTCACCCTTT TATCGCCGCA AGCGAATCAA AGGACTTACT CAACCGTTGG CTATTTGCCA CCAATGAAAA TCTCGTAAAA GACGTTTTTG TCGCGGGCAA ACACACAATA AAAAACTTCC ACCACCAGCA AGAAGAAAGC AGCCGTCAGG CTTTTATTCA AGTGATTAAA AAGGTGATGT ATGACGTTTA A
|
Protein sequence | MTGHNSTHCD NTQYYFSERA LLSSGWSNNV LFSVKDGQFH SFKADSTPTT DCHVLSGPVL PTLANVHSHA FQRVMAGAAE VSLNPNDSFW SWRDLMYKIV QKLTPDDARI IAKQLYIDML KAGYTQVGEF HYLHHDIGGK HYGQFGEMSN QIIAAADDSG MGLTLLPVLY SHSGFGGQAP NAGQARFINS TDSYLALHQA CDKALANHAR HKLGICFHSL RAVTKPQIDT VLQSLAKDCP VHIHIAEQQK EVQDSLAFSG QRPVEWLNNE IGLSERWCLV HATHLTDAER QAITKSKTVA GLCPTTEANL GDGIFPAVEF EKENGRWGIG SDSHVSLSIV EELRTLEYGQ RLRDQQRNRL YRADQTSVGD NLYQQALLGG NQACDVSLGL SQGNRADFMV LDESHPFIAA SESKDLLNRW LFATNENLVK DVFVAGKHTI KNFHHQQEES SRQAFIQVIK KVMYDV
|
| |