Gene PHATRDRAFT_49229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49229 
SymbolFTHFS 
ID7195695 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp296045 
End bp298201 
Gene Length2157 bp 
Protein Length666 aa 
Translation table 
GC content58% 
IMG OID 
Productfomate-tetrahydrofolate ligase 
Protein accessionXP_002183964 
Protein GI219127483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.167975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCAT CACCGGCGTC GTCCACGGAC GTTGCCTCGA CCAACGCGTT GGGGTATCCC 
AAGCTGCAGC CTCAACATCC CGTACCGACC GACATTCAAG TCAGTCAACA AATTGTCCAG
CAAGTTGGAC TCTTGCCTCT CTCCGACCTC GCCCAACAGT AAGTTGTCGG TTGTTATTTA
TGGACGGTAT CTACCGATGG ACGGAGAAGT AGACGTACTT TTACTTGTAC TCATAGTTGC
ACGCGTGTGA AACCTGCACT TCCTATGCAA TGCTCACTCT CACACTGACA CTCACACAAT
CACTCCCACA CTAGACTCGG CTTGACTCCG GACGAGATCA TTCCGTGGGG CATTGCCAAG
GCCAAGATCC CCTTGTCGGT CCGGGACACG CGTCGGGCTG TCCCCAACGG CAACTACGTT
GTCGTTACGG GGATTAACCC GACACCGCTT GGCGAAGGCA AGTCCACCAC CACCATTGGA
CTCGCGCAGG CCTTGGGTGC CGTGCGGGGA CGTCCCACCG TGGCCTGTAT CCGACAGCCC
TCGCAGGGAC CCACCTTTGG TATCAAGGGC GGCGCCGCCG GAGGCGGGTA CGCACAAGTC
GTACCCATGG AAGAATTCAA TCTACACTTG ACGGGAGATA TACACGCCGT CACGGCCGCC
AATAATCTAC TCGCCGCCGC GATTGATACA CGAATCTTTC ACGAAGACGC ACAGTCCGAT
CAAGCTCTCT TTCGCAGACT CTGTCCACCC AACAAACCAT TCTCGCCCGT TATGCAACGC
CGACTCCGCA AGCTCGGTAT CGACCCTCAC AAGGCACCGG CCGATTTGAC CCCCGCGGAG
CAGTCCAAAT TTGCCCGACT CGATATCGAT CCCGATACCA TTACCTGGCA ACGCGTACTC
GATACCTGTG ATCGACATTT GCGTGTCGTA CAAGTCGGAA TCGGTCCCAA CGAGAAAGTC
ACCCCGCGTA GCGACGATCC CGGACAACCC GCCAAACCGC GAGTCCAACA CGATCGCGTC
ACGGGCTTTG ATATTACCGT CGCCTCCGAA GTCATGGCCG TCCTCGCCTT GGCCCGTAGT
TTGCCGGATT TGCGCGACAA GCTCGGTGCC ATGGTGGTCG CCTACAGTCG GGCCGGCGAG
CCCGTCACGG CGGACGATTT GGGTTGTGGT GGCGCCCTCG CGGTCCTCAT GAAGGACGCC
ATTTTACCGA CGCTCATGCA AACTGTCGAA CGTACACCCG TCTTGGTCCA CGCCGGTCCT
TTCGCGAACA TTGCGACGGG AAACTCCTCC GTCGTGGCCG ACGAAATGGC GCTCAAAATG
GTCGGACCCG ACGGATACTG CGTCACGGAA GCCGGATTCG GCGCCGACAT TGGCATGGAA
AAGTTTTTCA ACATCAAATG CCGAGCCAGC GGACTGAAAC CCAAATGCGC CGTGATTGTC
GCCACGGTGC GGGCGCTAAA AATGCATGGC GGTGGCCCAC CCGTGTCGGC CGGCAAACCT
TTGCAGCCAG AGTACGTACA GGAAAATGTT GAACTCGTCC GCCGCGGCGC GGCTAATCTG
GCCCGGCACG TGGAAAACGC CAAAAAATTC GGTGTTAACG TGGTGGTGGC CGTGAACCAG
TTCCAAACCG ATACTCCCGC GGAGATTGAA GCCGTCCGGC AAGCCGCGCT GGAAGCGGGC
GCCTACGATG CCGTTCTGGC CAATCATTGG GCCGAAGGAG GACAAGGCGC CGCGGACCTC
GCAATCGCCG TCGAAAAAGC CTGTGCGGAC AATGACGAAG CCAATTTTCG ATTCCTCTAC
GACGTCAACT TGTCCATTGA AGAAAAGGTC AACGTTATTG CCAAGGAAAT CTACCGCGCC
GACGGTGTCG ACTTTTCCAA CACGGCCCGG GCGCAGATGG AAAAGTACGA GGCATCGGGC
TTTGGCAATT TGCCCATTTG TATCGCCAAG ACGCAGTATA GTTTTAGTTG CGATCCGTCC
GCCAAGGGCG CTCCTACCGG ATTTCGCGTT CCGGTCCGCG AAATCCGTAG CTGCGTCGGC
GCCGGATTTC TCTACCCCAT TTGCGGCGAC ATTATGACTA TTCCAGGACT CCCAACCCGG
CCCGGATTTT ACGACGTGGA CATTGATGAA AATGGTGGCG TTGTGGGACT GTTCTAA
 
Protein sequence
MSSSPASSTD VASTNALGYP KLQPQHPVPT DIQVSQQIVQ QVGLLPLSDL AQQLGLTPDE 
IIPWGIAKAK IPLSVRDTRR AVPNGNYVVV TGINPTPLGE GKSTTTIGLA QALGAVRGRP
TVACIRQPSQ GPTFGIKGGA AGGGYAQVVP MEEFNLHLTG DIHAVTAANN LLAAAIDTRI
FHEDAQSDQA LFRRLCPPNK PFSPVMQRRL RKLGIDPHKA PADLTPAEQS KFARLDIDPD
TITWQRVLDT CDRHLRVVQV GIGPNEKVTP RSDDPGQPAK PRVQHDRVTG FDITVASEVM
AVLALARSLP DLRDKLGAMV VAYSRAGEPV TADDLGCGGA LAVLMKDAIL PTLMQTVERT
PVLVHAGPFA NIATGNSSVV ADEMALKMVG PDGYCVTEAG FGADIGMEKF FNIKCRASGL
KPKCAVIVAT VRALKMHGGG PPVSAGKPLQ PEYVQENVEL VRRGAANLAR HVENAKKFGV
NVVVAVNQFQ TDTPAEIEAV RQAALEAGAY DAVLANHWAE GGQGAADLAI AVEKACADND
EANFRFLYDV NLSIEEKVNV IAKEIYRADG VDFSNTARAQ MEKYEASGFG NLPICIAKTQ
YSFSCDPSAK GAPTGFRVPV REIRSCVGAG FLYPICGDIM TIPGLPTRPG FYDVDIDENG
GVVGLF