Gene MCA2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2219 
Symbolfhs 
ID3102111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2396781 
End bp2398454 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID637171365 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_114639 
Protein GI53803762 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.83616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATA TCGAAATCGC GCAGCGCGCC AAGATGCTGC CCATCATTGA TCTGGCCCGC 
GAGAAACTAG GAATTCCTGC CGCCAGCCTC GACCCTTACG GGCATTACAA AGCCAAGGTC
GCGCTCGACT ACATCGACGG CCTCAAGGAC CGGCCCGACG GCAAGCTCAT CCTGGTCACC
GCCATCAGCC CGACCCCGGC CGGCGAAGGC AAGACCACGA CCACGGTCGG CCTGGGCGAC
GCGCTGAACC GGATCGGCAA GAAGACCGTG ATGTGCCTGC GCGAACCCTC GCTCGGCCCC
TGCTTCGGCG TCAAAGGCGG AGCCGCGGGT GGCGGCCATG CGCAGGTGGT GCCGATGGAG
GACATCAACC TGCATTTCAC CGGCGACTTC CACGCCGTCG GCGTCGCCCA CAACCTGCTC
TCGGCCCTGA TCGACAACCA CATCAACCAC GGCAATGCGC TCGACATCGA CCCGCGCCGC
ATCCAGTGGA AGCGCGTGGT CGACATGAAC GACCGCGCCC TGCGCAAGAT CGTGGTGGGC
ATGGGCGGCA CCGCCAACGG TTATCTGCGC GAAGACGGCT TCGACATCGT GGTGGCATCG
GAAGTGATGG CCATCCTCTG CCTGGCCACC AGCATGGCGG ACCTGAAGGA GCGGCTGGGC
CGCATCATCG TCGGCTACAA GAGCGACGGC AAGACCCCGG TCTACGCCCG CGACCTCAAG
GCCCACGGCG CCATGGCCGC CCTGCTGAAG GACGCCATCA AGCCGAATCT GGTGCAGACC
CTGGAGAACA ACCTCGCCAT CATCCACGGC GGGCCGTTCG CCAACATCGC CCACGGCTGC
AACACCGTGA CCGCCACCCA GACTGCGCTG AAGCTGGCCG ATTACGTGGT GACCGAAGCC
GGCTTCGGCG CCGACCTGGG CGCCGAGAAG TTCATCGACA TCAAATGCCG CATGGCCGGG
CTGAACCCCG CCGCGGTGGT GCTGGTCGCC ACGGTACGCG CCCTGAAATT CCACGGCGGC
GTGAAAAAGG AAGACCTGAA TCAGGAAAAC CTCGCCGCGC TGGAAGCCGG TTTCGCCAAC
CTGGAAAGGC ACGTCCACAA CATCCGCGAG CACTATGGCC TGCCCTGCGT GGTTTCGATC
AACCATTTCA GTTTCGACAC CGAAGCCGAA ATCGCGTGGC TGATGAAGAA ATGCGAGGCG
TTGGGCGTGA AGGCGGTCCT CGCCCGCCAC TGGGCCGAGG GCGGCAAGGG CGCGGAAGCG
CTGGCCCGGA CGGTCGCCGA CATCGTCGAC CACCAGCCGG GCCGGCATAC TTTCGTCTAC
GGCGACGAAG CGACGCTGTG GAACAAGATC GAGACCATCG CCACGAAAAT CTATGGCGCC
GCCGGCATCA GCGCCGACGC CAAGGTCAAG GCCCAGCTCG AAGCGTGGAA TGCCGACTAC
GGGCATTACC CGGTGTGCAT GGCCAAGACC CAGATGTCCT TCTCCACCGA CCCCAACGCC
AAGGGCGCGC CGAGTGGCCA CACCGTCGCC ATCCGCGAAG TCCGCCTGGC CAACGGCGCC
GGCTTCGTCG TCGCCATCGC CGGCGACATG ATGACCATGC CCGGCCTGCC CAAAGTCCCG
GCGGCCGAGC ACATCGACGT CGACGACGAC GGCCGGATCA GCGGCTTGTT CTGA
 
Protein sequence
MSDIEIAQRA KMLPIIDLAR EKLGIPAASL DPYGHYKAKV ALDYIDGLKD RPDGKLILVT 
AISPTPAGEG KTTTTVGLGD ALNRIGKKTV MCLREPSLGP CFGVKGGAAG GGHAQVVPME
DINLHFTGDF HAVGVAHNLL SALIDNHINH GNALDIDPRR IQWKRVVDMN DRALRKIVVG
MGGTANGYLR EDGFDIVVAS EVMAILCLAT SMADLKERLG RIIVGYKSDG KTPVYARDLK
AHGAMAALLK DAIKPNLVQT LENNLAIIHG GPFANIAHGC NTVTATQTAL KLADYVVTEA
GFGADLGAEK FIDIKCRMAG LNPAAVVLVA TVRALKFHGG VKKEDLNQEN LAALEAGFAN
LERHVHNIRE HYGLPCVVSI NHFSFDTEAE IAWLMKKCEA LGVKAVLARH WAEGGKGAEA
LARTVADIVD HQPGRHTFVY GDEATLWNKI ETIATKIYGA AGISADAKVK AQLEAWNADY
GHYPVCMAKT QMSFSTDPNA KGAPSGHTVA IREVRLANGA GFVVAIAGDM MTMPGLPKVP
AAEHIDVDDD GRISGLF