Gene Mmwyl1_2987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2987 
Symbol 
ID5365088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp3374596 
End bp3376176 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content46% 
IMG OID640805360 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001341833 
Protein GI152996998 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.709991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.888371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCAA ATCAAAATAC AGTAACTCCA ATCAAACGAG CGTTAATCAG TGTTTCCGAC 
AAAACAGGCA TTGTCGAATT CGCACGCGAA CTGACCGCTC AAGGCGTTGA AATTCTTTCA
ACAGGTGGTA CTTACCGTCT TTTATTGGAC AGCAAAGTTA AAGCAACTGA AGTTTCTGAC
TACACAGGCT TCCCAGAAAT GATGGACGGC CGCGTAAAAA CGCTACACCC AAAAGTGCAC
GGTGGCATTC TTGGTCGTCG TGACATCGAT GGCGCCATCA TGAAAGAGCA CGGTATCGAA
GAAATCGATA TGGTTGTTGT CAACCTTTAT CCATTCGAAG CGACCATCGA GCGTCCTGAT
TGCGATTTAC CAATGGCGAT CGAAAACATC GATATCGGTG GTCCAACTAT GGTTCGCTCT
GCAGCGAAAA ACCATAAAGA CGTTGCTATT GTGGTATCTC CTTCTAGCTA CGAAGAAATC
TTAGCTTCTC TTAAAGCAGA TGACGGTTTG ACTTATGAGC AGCGCTTCGA TCTTGCCGTA
AAAGCATTCG AACATACATC TCACTATGAT GGCGCTATTG CAAACTTCCT AGGCAAAAAA
GTAGAAGGCG GCAGCGAAGA TTTTGCGCGC ACCTTCAACT TACAATTCAA CAAGCAAGAA
GAAATGCGCT ACGGTGAAAA CCCCCACCAG AAAGCCGCTT TCTATGTTGA AGCGAATCCT
AAAGAAGCAT CCATCAGCAC TGCTAAACAA ATCCAAGGTA AGGCACTGTC TTACAACAAC
ATCGCAGACA CCGATGCGGC TCTTGAGTGC GTAAAAAGCT TTGATAAACC TGCTTGTGTT
ATCGTTAAGC ACGCGAACCC TTGTGGTGTA GCAACGGCAG CGACACAACT TGAAGCATAC
GACCTAGCTT TCCAAACCGA TCCAACGTCG GCATTTGGTG GCATCATTGC ATTCAACCAA
GAGTTAGATG CAAAAACAGC ACAAGCCATT GTTGACCGCC AATTTGTTGA AGTTATCATT
GCGCCAAGCG TCAGCAAAGA AGCAGCGGAA GTCGTAGCGG CTAAGCAAAA TGTTCGCTTG
TTAGAGTGCG GCCAATGGTC AAAAGACAAA CCTGCGGCAC TAGACTACAA GCGCGTAAAC
GGCGGTCTAT TGGTTCAAGA CCGTGATGAC GGCGTTATTA CTTTAGACGA TCTTAAAATC
GTCTCTAAAC GTCAGCCTAG CGAAGAAGAA CTAAAAGACT TATTGTTTGC TTGGAAAGTA
GCGAAATTCG TTAAATCCAA TGCTATTGTT TATGCAAAAG CTGAACAAAC TATTGGTGTA
GGCGCAGGCC AAATGAGCCG TGTCTACAGC GCTAAAATTG CCGGTATCAA AGCAGCTGAC
GAAAATCTAG AGGTAGCGGG TTCAGTCATG GCTTCGGATG CTTTCTTCCC ATTCCGTGAC
GGTATTGATG CTGCGGCTCA AGCTGGTATT ACAGCAGTTA TCCAACCTGG CGGCTCTATG
CGCGACGAAG AAGTCATTGC CGCAGCCGAT GAAGCTGGCA TGGCTATGGT CTTTACAGGA
ATGCGCCACT TCCGTCATTA A
 
Protein sequence
MMANQNTVTP IKRALISVSD KTGIVEFARE LTAQGVEILS TGGTYRLLLD SKVKATEVSD 
YTGFPEMMDG RVKTLHPKVH GGILGRRDID GAIMKEHGIE EIDMVVVNLY PFEATIERPD
CDLPMAIENI DIGGPTMVRS AAKNHKDVAI VVSPSSYEEI LASLKADDGL TYEQRFDLAV
KAFEHTSHYD GAIANFLGKK VEGGSEDFAR TFNLQFNKQE EMRYGENPHQ KAAFYVEANP
KEASISTAKQ IQGKALSYNN IADTDAALEC VKSFDKPACV IVKHANPCGV ATAATQLEAY
DLAFQTDPTS AFGGIIAFNQ ELDAKTAQAI VDRQFVEVII APSVSKEAAE VVAAKQNVRL
LECGQWSKDK PAALDYKRVN GGLLVQDRDD GVITLDDLKI VSKRQPSEEE LKDLLFAWKV
AKFVKSNAIV YAKAEQTIGV GAGQMSRVYS AKIAGIKAAD ENLEVAGSVM ASDAFFPFRD
GIDAAAQAGI TAVIQPGGSM RDEEVIAAAD EAGMAMVFTG MRHFRH