Gene Mmar10_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1412 
Symbol 
ID4286148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1548591 
End bp1550261 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content65% 
IMG OID638140894 
Productformate-tetrahydrofolate ligase 
Protein accessionYP_756642 
Protein GI114569962 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.749382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000422387 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCAGCG ATATCGAAAT CGCCCGCGCG GCGACGCTCA AGCCCATGGC CGCCATCGCG 
GCGCGGCTGG GGATACCGGA TGAGGCCATC ATTCCCTTCG GCCGTTCCAA GGCCAAGCTT
TCGGGCGACT TCATTGCCAC GCTCAAGGAT CGCCCGCGCG GCAAGCTGAT CCTCGTCACC
GCAATCAGTC CGACCCCGGC GGGTGAGGGC AAGACCACGA CCACGGTCGG CCTCGGCGAT
GGCCTGTCGC GGATCGGCAA AAAGGTCGCC ATCTGTCTGC GTGAACCATC CCTGGGTCCC
TGCTTCGGCA TGAAGGGCGG GGCGGCCGGC GGCGGCATGG CCCAGGTCGT GCCGATGGAG
GATATCAATC TCCATTTCAC CGGCGATTTC CACGCCATCA CCTCGGCCCA CAACCTGCTC
GCCGCGCTGA TCGACAATCA TGTCCATTGG GGCAATGAGC AGCAGATCGA CAGTCGCCGC
ATCGCCTTGC GCCGTGTGCT CGACATGAAT GACCGCTCGC TGCGCAATCT GGTCACCGGG
CTGGGCGGTC CGGCGCACGG CACGCCGCGC GAGGGCGGTT TCGACATTAC CGTGGCTTCC
GAAGTCATGG CGATCCTGTG CCTGGCCCGT GACCTGGCGG ATCTGGAAGA GCGTCTCGGC
GACATCGTGA TTGCCGAGCG GGCCGATCGC AGCCGGGTCA CAGCCCGTGA TATCGGTGCC
GCCGGAGCGA TGACGGTTCT CCTGAAGGAC GCCTTCCAGC CCAATCTGGT TCAGACCCTG
GAACACACGC CGACCTTCAT CCATGGCGGT CCCTTCGCCA ATATCGCCCA TGGCTGCAAC
ACGCTGGTCG CCACCGACAC GGCGCTGCGC CTGGCCGACT ATGTGGTCAC CGAGGCCGGT
TTCGGGGCGG ATCTGGGGGC AGAGAAATTC TTCGACATCA AATGCCGAAA GGGAGGGCTC
GAACCCTCCG CCGCTGTCCT GGTCGCCACG ATCCGGGCGT TGAAAATGAA TGGCGGGGTG
CCGAAGGATC AGCTGGGCGC AGAGAATGTC GCCGCTGTCG AGGCCGGCTG CGCCAATCTC
GGTCGTCATA TCGAGAACCT GGCCAAATTC GGCGTGCCGG TGGTCGTCGC GATCAATCAT
TTCACCGCCG ACAGCGAGGC GGAGGTCGCC GCGGTTGAGG CCTTTTGCGA AGCGCGCGGC
GTGAAGGCCG TCCTGGCGAC TCATTGGGCC GAGGGCGGGC AGGGCACGCA AAAGCTGGCC
GAGGCCGTCA GCGAGCTTGT GGAGGGCGGA TCGAGCCGGT TTGCGCCGCT CTATCCCGAC
GACATGCCCC TGGTCGACAA GATCGAGACC GTGGCCCAAT CCATCTACCG CGCCGGATCG
GTGGTGTTCG AACGTTCGGC CCGCCTGCAG CTGGAGCGCT GGCAGGAGGC GGGTTATGGG
CATCTGCCCG TGTGCATGGC CAAGACGCAA TATTCCTTCT CGGCCGATCC GGCCCTGACC
GGGGCGCCTG AAGGCCATGA ACTGCCCGTG CGCGAAGTCC GTCTCTCGGC AGGCGCCGGT
TTCGTGGTGG CGGTCTGCGG CGCGATCATG ACCATGCCCG GACTGCCGCG TAAGCCGGCA
GCGCTGGATA TTCACCTCAA TGCTGAGGGT GAGGTTGAAG GGTTGTTCTA G
 
Protein sequence
MTSDIEIARA ATLKPMAAIA ARLGIPDEAI IPFGRSKAKL SGDFIATLKD RPRGKLILVT 
AISPTPAGEG KTTTTVGLGD GLSRIGKKVA ICLREPSLGP CFGMKGGAAG GGMAQVVPME
DINLHFTGDF HAITSAHNLL AALIDNHVHW GNEQQIDSRR IALRRVLDMN DRSLRNLVTG
LGGPAHGTPR EGGFDITVAS EVMAILCLAR DLADLEERLG DIVIAERADR SRVTARDIGA
AGAMTVLLKD AFQPNLVQTL EHTPTFIHGG PFANIAHGCN TLVATDTALR LADYVVTEAG
FGADLGAEKF FDIKCRKGGL EPSAAVLVAT IRALKMNGGV PKDQLGAENV AAVEAGCANL
GRHIENLAKF GVPVVVAINH FTADSEAEVA AVEAFCEARG VKAVLATHWA EGGQGTQKLA
EAVSELVEGG SSRFAPLYPD DMPLVDKIET VAQSIYRAGS VVFERSARLQ LERWQEAGYG
HLPVCMAKTQ YSFSADPALT GAPEGHELPV REVRLSAGAG FVVAVCGAIM TMPGLPRKPA
ALDIHLNAEG EVEGLF