Gene PHATRDRAFT_53980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_53980 
SymbolGEL1 
ID7196403 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1101597 
End bp1103003 
Gene Length1407 bp 
Protein Length403 aa 
Translation table 
GC content48% 
IMG OID 
Productgelsolin 
Protein accessionXP_002177218 
Protein GI219110933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTC GTGAAAAATT GAACTGGAAA GATACCAATC TTGCTTTGTT CGGTTCAGAC 
CTTGAAAAGA AAATCAAGGC GGCGGCTGCA GATAGTGAAC CACAATGGAG CAACATTGGT
ACATCGGTGG CGTTGCATAT TTGGCGTATC GAACAGTTTA TGGTCAAGCC TTGGCCAAGC
AACAAACACG GAAAGGTATG AAAAATGCAG ACGTGACATT AGGAACACCT ATCAAGTTGC
GAGACCTTAC AGTGTATAAC TTGGTCGTTT CCTTCCAAGT TTCACAAAGG AGATTCCTAC
GTCGTCCTGA ACACATACAA GCCAGAGCCG AGCAAGCCAA AGCTTGCTCA TGATATTCAT
ATCTGGATCG GAGACAACAG GTAAGAAGCT GTGTTTCCAA AGAAAACGCT CATTATGAGC
TTTCTAAATT GTAGGAGAGT GAATACTGTC TATATGTTCG CTTACTGCGC GGTATACCAA
TGCGTCGATA GCTCCCAAGA TGAATATGGA ACAGCAGCGT ACAAAATGGT GGAACTCGAT
GACAAGCTTG GCGGTACTGC TGTCCAACAC CGCGAAGTTC AGGGCAAAGA ATCTACCCTG
TTTCAAAAAT ATTTTGGGAA TCACTTAACC TATTTGGAAG GTGGCGTTGA GTCTGGCTTT
CACCATGTGG AGTGCAGTGC AGCGGAACCT CATTTGTACA AGATCAAAGG AACTCGCAAG
TCCGACACAC TGCGCCTAAC CCAAGAGCCT GTACGCCGCA ATTCTCTTAA CACTGGAGAT
GTCTTCGTTT TGACGGCGGG GGAAGAGGCC GTCTGGATTT GGGTGGGCAA AGAATCGAAT
CAGGACGAAC AAGCGAAAGG TGTGGAAGTG GCACAGGCCT TCTGCAAAAA AGGAAACGTG
ATTGTCTTGA ATCAGGGCGT CAACGACAAT GAAAAAGAGG CCACCGAGTT CTGGGCCTTC
TTACCGGGCA AAGTTGCAGT TTTAGGACCA ATCAAAAAGT CGGTTCGGGT ACAGGCTGCC
GACGAGAAAG ACAATAAAAG CCGAGCTTTT GTACCGGTCC TTTTTCAGAT ACCGGAGCAA
ACCGGTGGCA AGCTTCGCAA AGTGGCCACC GCCAAGAAGC AACCGGTCGG GCCAACCCGG
GACATGCAAT ATTTGTTGCC GCGTTCAACG TTGCAGAGCA AGCACGGCTA TTTGCTAGAT
ACAGGCTTTC ACATTTTCGT TTGGTTGGGC AGCCAGGCAC CTACTATCTG TAAGGCAAAT
GCTATGCCTC AAGCGCACAT GTACTTTTCG TCTTTTCGAC GCCCCTTGTT GCCTTTGACG
GTTGTCAAAG AACGACAGGA GACGGATTTG TTCCAGGAAC GATTTCACGA AGCTGGTAGC
GCGGGTTGCG CTTGTGTTCT CATGTAG
 
Protein sequence
MNVREKLNWK DTNLALFGSD LEKKIKAAAA DSEPQWSNIG TSVALHIWRI EQFMVKPWPS 
NKHGKFHKGD SYVVLNTYKP EPSKPKLAHD IHIWIGDNSS QDEYGTAAYK MVELDDKLGG
TAVQHREVQG KESTLFQKYF GNHLTYLEGG VESGFHHVEC SAAEPHLYKI KGTRKSDTLR
LTQEPVRRNS LNTGDVFVLT AGEEAVWIWV GKESNQDEQA KGVEVAQAFC KKGNVIVLNQ
GVNDNEKEAT EFWAFLPGKV AVLGPIKKSV RVQAADEKDN KSRAFVPVLF QIPEQTGGKL
RKVATAKKQP VGPTRDMQYL LPRSTLQSKH GYLLDTGFHI FVWLGSQAPT ICKANAMPQA
HMYFSSFRRP LLPLTVVKER QETDLFQERF HEAGSAGCAC VLM