Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_4101 |
Symbol | |
ID | 5366662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 4633743 |
End bp | 4635101 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640806494 |
Product | amidohydrolase |
Protein accession | YP_001342932 |
Protein GI | 152998097 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00104689 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCAGT ATTTGATCCG CCAAGCCAAA GCCGTTGCAG GTTATGAGTT TCCTCAAGAC ATCCGTATCA AAGATGGCGT GATTTCAGCC ATTGCTCCCA CATTAAATCG CGAGCTTGAC GACGAGTTAA TAGATGCGTC TCATTGTGTG GTTTATCCCG GTTTTGTGAA TACTCACCAT CATCTTGCTC AGTCGATACT AAAAGGCGTA CCAGCGGGTT TGAATCAAGG TTTAGGGGAA TGGCTGGCAA GCGTTCCCTA TCGTTTTTGG GCACAAATCA CGCCAGATTT AATGTATGTC GCGGCGAAAT TAGGTCTTTA TGAACAGCTA CGTTCTGGGG TGACAACCTG CGCGGACCAC CATTATTTAT ACCATGCAAC GACATCACCT GAGTTGGAAG ACGCTGTTTG GCGAGCTGCC GAAGATTTAG GTATTCGCTT AGTTTTATGC CGTGGTGGCG CAACGGTGCA AGGCAGTCAT AAAGGTATGA AAGACAGCGG CATTGTTCCA GAGTCATTGG ATTTGTCACT GAATCGCCTT GAAGCTTCCT ACAAACGTTA TCATGATGCC ACCCCATTCA GTATGCGACG CTTGGTTGTT GCACCGACTA GCTTGGTTCA TACCTCGCCA GCAGAGGATT TACGCAGTTA TGCTCAGTGG GCAAAAGAAC GAAAATTACT GCGTCATTCG CATTTATTAG AAGTATCTTT TGATGAGCAA ATGGCACAAC AAAACTTCGG TATGAGCGCC ATTGATTACG CCGCCCATTG TGATTGGTTG GGTGACGATG TGTGGTTTGC TCACTTGGTT CAAGCGGATG CTCATGCGAT TGAATTACTG GCGCAAACGA AAACGCGAAT TGCCCATTGC CCTACGTCAA ATTGTCGCCT TGGTAGTGGT ATTGCGCCAG TTCTTGCAAT GGAAAAAGCG GGTATTCCGA TTACCCTCGG TGTTGATGGC TCCGCGTCTA GTGAAAGTGC TTCTATGTTG CAGGAGTTGA ATTTAGCTTG GTTATTACAT CGAACCCATA GCGGGCCCTC GGCAACGAAT GTCTCGCAAG TGCTTAAATG GGGGACCCAA AACGGAGCCG AGCTTTTGGG CCTAAAAACG GGAAAAATAG CAGAAGGCTT TGCTGCTGAT TTAGTATTGT ATTCCCTTGA TGCACCACGT TTTTCTGGGG TACACAGTCC ATTAGAAGCG CCAATTTTAT GTGGTGAACC GGTGCTGATT AAACACAGCT TTGTAAACGG CAAGCGAGTA GTTGAAAACG GCCAAGTATT GGGCGTGGAT GAAGCTGAGT TAGTACATGA TGTCAAAGCC GCTGTGCTGG AATTATTAGC AAGAGCGCCA TCTAATTAA
|
Protein sequence | MPQYLIRQAK AVAGYEFPQD IRIKDGVISA IAPTLNRELD DELIDASHCV VYPGFVNTHH HLAQSILKGV PAGLNQGLGE WLASVPYRFW AQITPDLMYV AAKLGLYEQL RSGVTTCADH HYLYHATTSP ELEDAVWRAA EDLGIRLVLC RGGATVQGSH KGMKDSGIVP ESLDLSLNRL EASYKRYHDA TPFSMRRLVV APTSLVHTSP AEDLRSYAQW AKERKLLRHS HLLEVSFDEQ MAQQNFGMSA IDYAAHCDWL GDDVWFAHLV QADAHAIELL AQTKTRIAHC PTSNCRLGSG IAPVLAMEKA GIPITLGVDG SASSESASML QELNLAWLLH RTHSGPSATN VSQVLKWGTQ NGAELLGLKT GKIAEGFAAD LVLYSLDAPR FSGVHSPLEA PILCGEPVLI KHSFVNGKRV VENGQVLGVD EAELVHDVKA AVLELLARAP SN
|
| |