Gene Mlab_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1009 
Symbol 
ID4794627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1009057 
End bp1010739 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content51% 
IMG OID640099673 
Productformate-tetrahydrofolate ligase 
Protein accessionYP_001030445 
Protein GI124485829 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.749229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTGGA AATACTTGTT ATCTGACATC GAAATTGCAC AGCAGTGCAA AATGAAAAAA 
ATCACTGAGA TTGCCGCATC TCTCGACATT ACGCCGGATG AACTCGAACT CTACGGATCG
TACAAAGCAA AACTTGCTGA TTCACTGGAA AAACGTCTGG CGGATAAACC AAACGGCAAA
CTGATCCTGG TGACGGCCAT CAACCCGACG CCTGCCGGCG AGGGTAAGAC CACGACAACA
GTCGGTCTTG GTCAGGCGAT GCCCAAGATC GGTAAAAAGG CAGTCATTGC ACTTCGCGAA
CCATCTCTAG GTCCCGTATT CGGTGTTAAA GGCGGAGCTG CCGGCGGGGG ATACTCGCAA
GTCGTCCCGA TGGAAGACAT CAATCTCCAC TTCACCGGAG ATTTCCATGC GATTACCAGC
GCAAACAACC TCTTGTGTGC TATGATCGAC AATCATATCC AGCAGGGGAA TGCGCTCGAC
ATTGATACCC GCCGTATCAT CTTCAAGCGC TGTCTGGATA TGAACGATCG TGCTCTTCGA
AACATCATCA TCGGGCTTGG CGGTCAGACC AACGGCGTTC CGCGTGAGGA TCACTTTATG
ATCACGGTCG CAAGCGAGGT AATGGCGATT CTCTGTCTTG CAAATGATAT CGACGACCTT
AAAGAGCGTT TGGGGAATGT CATTTTTGGC TACAGCAGAA AAGGAACTCC TCTCTATGCC
CGTGATCTGA AGGCCGTCGG GGCTATGGCC GCCCTTCTCA AGGATGCCAT CAAACCGAAT
CTTGTCCAGA CGCTGGAAAA CACCCCCTGT TTTATCCACG GCGGACCGTT TGCCAACATC
GCACACGGCT GCAATTCGGT TCGTGCAACA AAACTTTCGC TGAAGATGGC AGATTATGTC
ATCACGGAGG CGGGTTTCGG TTCTGATCTT GGTGCTGAAA AGTTCTTCGA CATCAAATGC
CGGTATGCGG GCCTGACGCC GAATACTGTT GTTCTGGTGG CCACCGTTCG GGCACTGAAA
TACAATGGCG GTGTAAAGAA AGAGGACACC ACGATTCCAA ACGTTGCTGC GCTCAAAGCA
GGAATGGTCA ATCTTGAGGC GCACATACAG AACCTCCAGA CGTTTGGGGT TCCGGTTGTT
GTGGCGATCA ACCGGTTCTC TACCGATACG GACGAAGAAC TTGCCGTGCT TAAAGAATTC
TGCACAGCGC AGGGAGCCGA GTTTGCAATC TCGGAAGTCT TTGCCAAAGG TGGGGAAGGT
GGCGTTGAAC TGGCGAAAAA AGTCGTTGCC TCCTGCGAGA AGCCGCAGAA GTTCCAGTGT
CTCTACGAAC TGAATACACC GATAAAAGAG AAGATCAATG CTCTCGCGAC CCGGATATAC
GGAGCTGACG GGGTCGTTTA CTCTCCTGCC GCCGATGCTG CAATCAAAGA TATCGATGCT
CTTGGCAGAA GCAATCTGCC GATCTGTATG GCAAAGACCC AGTACTCGCT CTCCGATGAT
CCAAATAAAC TCGGCAGACC GAAGAACTTT GTTATTAACG CAGCAACGGT CAGGCTCTGT
AATGGTGCCG GATTTATCGT TGTCGAGACC GGAGACATCA TGACGCTCCC GGGACTGCCG
GCTGTACCTG CAGCATGTTC GATCGACGTA AACAATGATG GATATATCTC CGGTCTCTTC
TGA
 
Protein sequence
MGWKYLLSDI EIAQQCKMKK ITEIAASLDI TPDELELYGS YKAKLADSLE KRLADKPNGK 
LILVTAINPT PAGEGKTTTT VGLGQAMPKI GKKAVIALRE PSLGPVFGVK GGAAGGGYSQ
VVPMEDINLH FTGDFHAITS ANNLLCAMID NHIQQGNALD IDTRRIIFKR CLDMNDRALR
NIIIGLGGQT NGVPREDHFM ITVASEVMAI LCLANDIDDL KERLGNVIFG YSRKGTPLYA
RDLKAVGAMA ALLKDAIKPN LVQTLENTPC FIHGGPFANI AHGCNSVRAT KLSLKMADYV
ITEAGFGSDL GAEKFFDIKC RYAGLTPNTV VLVATVRALK YNGGVKKEDT TIPNVAALKA
GMVNLEAHIQ NLQTFGVPVV VAINRFSTDT DEELAVLKEF CTAQGAEFAI SEVFAKGGEG
GVELAKKVVA SCEKPQKFQC LYELNTPIKE KINALATRIY GADGVVYSPA ADAAIKDIDA
LGRSNLPICM AKTQYSLSDD PNKLGRPKNF VINAATVRLC NGAGFIVVET GDIMTLPGLP
AVPAACSIDV NNDGYISGLF