Gene Mmar10_2928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2928 
Symbol 
ID4285004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp3209832 
End bp3211427 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content63% 
IMG OID638142423 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase / IMP cyclohydrolase 
Protein accessionYP_758147 
Protein GI114571467 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACC TTGATCTCGC CCCGGTACGG CGCGCCCTGA TTTCCGTTTC TGACAAGTCC 
GGCCTGGTTG AGCGCGGACG GCAGCTGGCC GATGCCGGTG TCGAAATCCT GTCGACCGGC
GGCACGCTGC GGAGCCTCAA GGAGGCCGGC ATCCCGGCCC GGGACGTGTC CGAAGTCACC
GAATTCCCGG AAATGATGGA CGGCCGCCTG AAAACACTTC ATCCGCGCGT GCATGGCGGC
CTGCTGGCGC GACGCGACAA TGGCGATGAT CGCGCTGCCA TGGGTGCACA CGGCATCCCG
CACATCGATC TGCTTTACGT CAATCTCTAC CCGTTTGAGG AAACCGTCGA AAAAGGCGGC
GCCTATGCCG ATTGTGTCGA GAATATCGAC ATTGGCGGAC CTGCGATGAT CCGCGCCGCG
GCCAAGAACC ACGCCTGGGT GAATGTCTGC GTGGATGGCG CCGATGTGGA CCGGGTCCTG
GCCGACATGG CCGAACATGA CGGGCATTCG CGGCTCGATC TTCGCAAGTC CCTGGCCGCC
AAGGCCTATG CCCGCACGGC CGCCTATGAC GCCGCCATCT CAAACTGGTT TTCCGACACA
CTCGAAGATC CGGCCCCGGA ATACCGCGCC TTCGGTGGCG CGCTGACCCA GGCCCTGCGT
TATGGCGAAA ACCCGCATCA GAACGCCGCC TTTTACAGAA GCGCCGAAAA CCGGGCTGGC
ATCGCGACGG GCCGACAGGT CCAGGGCAAG GCCCTGAGCT ACAACAATCT GGCCGACGCT
GATGCGGCAT TCGAGCTGGC CGGCGAGTTT GCTTCAGACG ACGGCGCCGC CGTTGTTATC
GTCAAGCACG CCAATCCGTG CGGCGTCGCT GTGCACGACA CGCTGTCTTC CGCTTATGAA
AAGGCCTTCG CGGCCGATCC GATTTCCGCT TTTGGCGGTA TCGTGGCGAT GAACCGCACG
CTCGACGCGG TCACTGCCGG GGAGCTTGTG AAGATTTTCA CCGAAGTCGT CATTGCACCC
GACGCGGACG AGGACGCGCT CGCTGTTCTC TCCGCCAAGT CTAATCTGCG TGTCTTGCTG
ACAGGCGGGA TGCCGGACCC GAAACAGGGT GGCTGGCTGA CCAAAACAGT CGCCGGCGGC
CTGCTGGTCC AGGAACGTGA CATTGGCATG ATCGCGCCGT CGGAGTTGAA GGTCGTCACC
AAGCGCGCGC CGACCGACGC CGAACTCACC GATCTTATGT TTGGCTGGAA AGTAGTCAAA
CATGTGAAGT CGAACGCCAT TGTTTATGCC CGCAACGGCT CGACTGCGGG GATCGGAATG
GGCCAGACCA GCCGCCTGGA AGCCGCCCGC CTGGCCGTCC GCAAGGCAGA GGCGACCGCC
GCCGAAAACG GTTGGGATGA GCCGCGGACC AAGGGTTCGG TGTGCGCATC GGACGCCTTC
TTCCCGTTTG CCGACGGGCT GCACGCAGCT GTCGACGCCG GCGCCACGGC GATCATCCAG
CCGGGTGGTT CCATCCGCGA TGAAGAAGTC ATTGCTGCCG CTGACGAGGC GGGTATCGCC
ATGGTCTTCA CCGGGATGCG CCATTTCCGG CACTAG
 
Protein sequence
MSDLDLAPVR RALISVSDKS GLVERGRQLA DAGVEILSTG GTLRSLKEAG IPARDVSEVT 
EFPEMMDGRL KTLHPRVHGG LLARRDNGDD RAAMGAHGIP HIDLLYVNLY PFEETVEKGG
AYADCVENID IGGPAMIRAA AKNHAWVNVC VDGADVDRVL ADMAEHDGHS RLDLRKSLAA
KAYARTAAYD AAISNWFSDT LEDPAPEYRA FGGALTQALR YGENPHQNAA FYRSAENRAG
IATGRQVQGK ALSYNNLADA DAAFELAGEF ASDDGAAVVI VKHANPCGVA VHDTLSSAYE
KAFAADPISA FGGIVAMNRT LDAVTAGELV KIFTEVVIAP DADEDALAVL SAKSNLRVLL
TGGMPDPKQG GWLTKTVAGG LLVQERDIGM IAPSELKVVT KRAPTDAELT DLMFGWKVVK
HVKSNAIVYA RNGSTAGIGM GQTSRLEAAR LAVRKAEATA AENGWDEPRT KGSVCASDAF
FPFADGLHAA VDAGATAIIQ PGGSIRDEEV IAAADEAGIA MVFTGMRHFR H