Gene PHATRDRAFT_42457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42457 
Symbol 
ID7196660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp120388 
End bp121877 
Gene Length1490 bp 
Protein Length450 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177020 
Protein GI219110537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0895277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAGGACTT CCATCGCCCA CAGTTGGAAG ACGTAATCGA CGCCAGTTGA TGCTGTAGTC 
ACCATGAGAG TCAATATGGC ATCGGGAAAA GCTCGTTTCC ACGAAATCAG CGTGGCGCTG
ATGATATTAG TTTTGTCTAC GACCGAGATT TCCAGCGCTT TTGTTCCGCT GCCAATTCTC
TGTAGAGCCA AAGATTCCGT CTTGGGTTCG TCGGTAGGCG GCGATGGCCC ACCACCCTCT
TCCGGAAATA ATGGTGACAA GAACGACTGG GATGACTTCT TAGATCCCAA CTTTAAAGAA
TCGGAAGGTT TGCAAAAAGC AAGAGAGTAC ATGAGTGAAA ATAGTCTACC CATATCCTTC
GATGAGGAAG CAGACGATGG CTTACTTGTC AATGATGACA GCAATGGGCA AGCGCAAGAT
ATAGTGGTGG ATGAGAAGAA ATCGAATGTG TCATCCTCGG CACTTACTCG ACCCGACGGA
GACGGGGGAT TATTCACCTC GGGGCTAAGC GCAGAGCAGC TTGCCAAAAA TCCATATGTA
GCTGCCGTAT CCAGACTTAC GCCATCCGAG CTCATTAGCA AGTTCACTTC GACGGCACAT
CCCCGGGTAC AAAATGCTGT GCGGCAGACC GTGCTCGGCC TAATCGGAGG CCTACCCAAA
ATGGCGTTCG AAACTACCAC TATCACCACC GGGCAGCGGT TGGCGTCTCT CATGTTTCAG
CTTCAAATGA CAGGTTACAT GTTTAAGAAT GCAGAGTACA GGTTGAGTCT TCAACAAAGC
TTGGGCCTCG ATGGGCACTC CGTGAATCCG TCCACAGAAC GCTTGCTATC GGCAGTCGAC
GACGAAGGCA GTGATGATGA TAATGATGAT ACACAAATGG ATACGCTCAA GGGGAAAATT
CGAGGAAAGT TGCGCATCCG ATATCCCGGT TCAATGAAGA ACACATTAGA CGACCCAGAA
AACCAAAACG ACGTGGACAA TTCGAACGGT TTGCAAATGG AGGTTGATGC GGCTGCGTAC
ATGTCCGAGC TGCGATCGGA AGTCTCGCAA CTGAGAGATG AACTCAAAAT TACGCGCAGC
GCGAAGGAAG ATGCTCTTCG CAAAGATCTC TTACTCTACA TTCGAACACT CCCGGAAAAG
GAGCTTCGAT CACTGACCAA CACTATGGGT CCAGACGTAC TAGTGGCTAT GAAGGGCCTC
GTCAAAGCCG TCATGACCGG AATTGGGGAG GATGAAATAG GACCCGAGAC GGTTACAGAG
CAATCTAGCG AAGCCATGGC TCAACTATGT ATGTGGCAGC TCGCGATTGG CTACAATCTG
AGGACGTTGG AAGTACGGGA AGAGATGAAG AAGTCGTTAA AAGGTAGCAC TGTGGGTGGG
CAGGATGGCG ATTTGGCCAG TGGAGCGTTT GAGTAGTTTA CGTTAACAAG GCTCTTTTGG
CAAAGCTACA CCTTGCCTTT AATTATGTAT TCATAGCCAG TATTGCAAGT
 
Protein sequence
MRVNMASGKA RFHEISVALM ILVLSTTEIS SAFVPLPILC RAKDSVLGSS VGGDGPPPSS 
GNNGDKNDWD DFLDPNFKES EGLQKAREYM SENSLPISFD EEADDGLLVN DDSNGQAQDI
VVDEKKSNVS SSALTRPDGD GGLFTSGLSA EQLAKNPYVA AVSRLTPSEL ISKFTSTAHP
RVQNAVRQTV LGLIGGLPKM AFETTTITTG QRLASLMFQL QMTGYMFKNA EYRLSLQQSL
GLDGHSVNPS TERLLSAVDD EGSDDDNDDT QMDTLKGKIR GKLRIRYPGS MKNTLDDPEN
QNDVDNSNGL QMEVDAAAYM SELRSEVSQL RDELKITRSA KEDALRKDLL LYIRTLPEKE
LRSLTNTMGP DVLVAMKGLV KAVMTGIGED EIGPETVTEQ SSEAMAQLCM WQLAIGYNLR
TLEVREEMKK SLKGSTVGGQ DGDLASGAFE