Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1412 |
Symbol | |
ID | 4286148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1548591 |
End bp | 1550261 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140894 |
Product | formate-tetrahydrofolate ligase |
Protein accession | YP_756642 |
Protein GI | 114569962 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.749382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00000422387 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAGCG ATATCGAAAT CGCCCGCGCG GCGACGCTCA AGCCCATGGC CGCCATCGCG GCGCGGCTGG GGATACCGGA TGAGGCCATC ATTCCCTTCG GCCGTTCCAA GGCCAAGCTT TCGGGCGACT TCATTGCCAC GCTCAAGGAT CGCCCGCGCG GCAAGCTGAT CCTCGTCACC GCAATCAGTC CGACCCCGGC GGGTGAGGGC AAGACCACGA CCACGGTCGG CCTCGGCGAT GGCCTGTCGC GGATCGGCAA AAAGGTCGCC ATCTGTCTGC GTGAACCATC CCTGGGTCCC TGCTTCGGCA TGAAGGGCGG GGCGGCCGGC GGCGGCATGG CCCAGGTCGT GCCGATGGAG GATATCAATC TCCATTTCAC CGGCGATTTC CACGCCATCA CCTCGGCCCA CAACCTGCTC GCCGCGCTGA TCGACAATCA TGTCCATTGG GGCAATGAGC AGCAGATCGA CAGTCGCCGC ATCGCCTTGC GCCGTGTGCT CGACATGAAT GACCGCTCGC TGCGCAATCT GGTCACCGGG CTGGGCGGTC CGGCGCACGG CACGCCGCGC GAGGGCGGTT TCGACATTAC CGTGGCTTCC GAAGTCATGG CGATCCTGTG CCTGGCCCGT GACCTGGCGG ATCTGGAAGA GCGTCTCGGC GACATCGTGA TTGCCGAGCG GGCCGATCGC AGCCGGGTCA CAGCCCGTGA TATCGGTGCC GCCGGAGCGA TGACGGTTCT CCTGAAGGAC GCCTTCCAGC CCAATCTGGT TCAGACCCTG GAACACACGC CGACCTTCAT CCATGGCGGT CCCTTCGCCA ATATCGCCCA TGGCTGCAAC ACGCTGGTCG CCACCGACAC GGCGCTGCGC CTGGCCGACT ATGTGGTCAC CGAGGCCGGT TTCGGGGCGG ATCTGGGGGC AGAGAAATTC TTCGACATCA AATGCCGAAA GGGAGGGCTC GAACCCTCCG CCGCTGTCCT GGTCGCCACG ATCCGGGCGT TGAAAATGAA TGGCGGGGTG CCGAAGGATC AGCTGGGCGC AGAGAATGTC GCCGCTGTCG AGGCCGGCTG CGCCAATCTC GGTCGTCATA TCGAGAACCT GGCCAAATTC GGCGTGCCGG TGGTCGTCGC GATCAATCAT TTCACCGCCG ACAGCGAGGC GGAGGTCGCC GCGGTTGAGG CCTTTTGCGA AGCGCGCGGC GTGAAGGCCG TCCTGGCGAC TCATTGGGCC GAGGGCGGGC AGGGCACGCA AAAGCTGGCC GAGGCCGTCA GCGAGCTTGT GGAGGGCGGA TCGAGCCGGT TTGCGCCGCT CTATCCCGAC GACATGCCCC TGGTCGACAA GATCGAGACC GTGGCCCAAT CCATCTACCG CGCCGGATCG GTGGTGTTCG AACGTTCGGC CCGCCTGCAG CTGGAGCGCT GGCAGGAGGC GGGTTATGGG CATCTGCCCG TGTGCATGGC CAAGACGCAA TATTCCTTCT CGGCCGATCC GGCCCTGACC GGGGCGCCTG AAGGCCATGA ACTGCCCGTG CGCGAAGTCC GTCTCTCGGC AGGCGCCGGT TTCGTGGTGG CGGTCTGCGG CGCGATCATG ACCATGCCCG GACTGCCGCG TAAGCCGGCA GCGCTGGATA TTCACCTCAA TGCTGAGGGT GAGGTTGAAG GGTTGTTCTA G
|
Protein sequence | MTSDIEIARA ATLKPMAAIA ARLGIPDEAI IPFGRSKAKL SGDFIATLKD RPRGKLILVT AISPTPAGEG KTTTTVGLGD GLSRIGKKVA ICLREPSLGP CFGMKGGAAG GGMAQVVPME DINLHFTGDF HAITSAHNLL AALIDNHVHW GNEQQIDSRR IALRRVLDMN DRSLRNLVTG LGGPAHGTPR EGGFDITVAS EVMAILCLAR DLADLEERLG DIVIAERADR SRVTARDIGA AGAMTVLLKD AFQPNLVQTL EHTPTFIHGG PFANIAHGCN TLVATDTALR LADYVVTEAG FGADLGAEKF FDIKCRKGGL EPSAAVLVAT IRALKMNGGV PKDQLGAENV AAVEAGCANL GRHIENLAKF GVPVVVAINH FTADSEAEVA AVEAFCEARG VKAVLATHWA EGGQGTQKLA EAVSELVEGG SSRFAPLYPD DMPLVDKIET VAQSIYRAGS VVFERSARLQ LERWQEAGYG HLPVCMAKTQ YSFSADPALT GAPEGHELPV REVRLSAGAG FVVAVCGAIM TMPGLPRKPA ALDIHLNAEG EVEGLF
|
| |