Gene PHATRDRAFT_43101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43101 
Symbol 
ID7196878 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2033437 
End bp2035838 
Gene Length2402 bp 
Protein Length330 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177429 
Protein GI219111355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCCATATG CGATATCGGT GTTCCGGGCG AAGCCAAAGA ACAGAAGTTT TGTTGGATGA 
GATCGCAAAA CTCTAAATGC GCGAAAGAAA ACAATGGGAA ACCATGCCTA TGGAAGTCGT
CAGCGGGAGA CGGTATCAAC GATAGTCCGT CGTTGGACGC AGTCCATCGA CGCTTCGCAA
GCAGACCACA AAAGAAAGAC TTCTTCTCTC TTTGTTCCAC TGCGCTTGTG GTGGCCCAGG
AATTTGCTCT AAGTTGCTTT TTGCTAGCTC GGCATCGTGT CGCACTCTAT TTCGAACAAT
CCCGATCAAC CCCAGAAAAT GCGTTTACAA TCAGTCAGAC CAACTTAGAC CAAGCGACCG
TTGCCATGTC TGCGGCCCTC ATGTTGGTGC TGTTTTGCAG TAGTCGCTCT AATACAATTC
CGCGACAGAA ACGAAGAGAG AAAGCTCAAC AGCGCTTGTC GGACGCAATT CTGCTTGGTA
TTGTACTCCG TCTCCTTGCC AGCGTTTTGC GAACCTTGAC AGCTTCTTAC TCCTCCGATA
CTGTCGAGGC CTTGGCCACT ACAGGTATGA CTTTGCACGT TGTAGCGTGT GACTATTCGT
ACGCCAACGG CCGACGTCCC CATGGAGAGA TAATTCGTCC TTTGATATCG TCGCAACGTC
CGGTCTTTCG AGGAGGCACC TTTTCTCTGA ATGCGGCTCT GTTTGCGACA ACACTATTGG
TGAGCCGGGT GGAATCCAAT AGCATGGCAT ACTTTTTAAT ATCACTTGCA ATCGTCATGT
TCGCCTTTTA CCCCGACGCA AGGCACGCTA TCGCCAATAG CTATCCCCCA TCGAGGAGCG
GTACGTCCGT GAGTTGCTAC ACATTTTCAA CTTCTTTGAT ATATTCTTGT GATAGCACTT
TTGATGGCAA TCCCTTTCTG TATTTACTAA ATCTGCTGCT TCATTCCGTC TCAACTCCGA
AACTCAGGAT TCCCGTGGCT AATTACAGCA GCAATCTCGG GCTCAACTTT GGTGTTACTG
AACAACCACG AAAAAATACT TTTCTTAGTA GGCATGACAT CGCTAGTTTT TGTGGTACCA
TTGTGGAATT ATATCTCTCA GTGCAACAAG GTGTGGTTTC GTGGACCCTG GGACATACCG
ACGCGTTCTT CAATGAAGTT GGACAATAAA TTGTGAGGGT GATGTCACCA GCTCATGCTT
GTTCGATTCC TCTTCTAGCC GTCATCAAGA TAAAGAATCA AAGACTCTCC TTTCTCTGCG
ACCTTTCCTC AAGATGGGAA AGAAGGAAGG GAGCCTTCTC TCCGTTGATA TTTTCAACAA
CAGGATTTGA AGTAATTTTC ACTTAGTGGA GATTTGCCCC CTTTGCAAGG AGGATCTGTT
TCATCTTCCA GTCAAGATGG CACGAAATTG GGTAGCATTT CTCGTCTTTT TGCTGTCAAA
AATATTACAT GTGATTCACC TTATCAATCC GGTGACAGGC CAAATGGTAC AAATGTCCTC
CAAAGCATAC TGGAGTGAAC CATTTTTGTC CCTTGATTAA GGCCGCCTGT TCTCGCATGA
TGTGTTACAT TGTGCTGGGA AAGGAAGTGG TATTTTTAAG AACAATCGTG TTACGGCGAA
TGACAAATCT CAAACAGGGT TCCAAATGAG CTGGTGTCGC GTTGGCATGT GAAGACAATA
TGGGTGTCAA CAATCATCAG TACAAGGAAC AATCGCACAT GAGCTATCTA ATGAAAGCAG
GTGATGTTTG CGTTGGCTAC AATTTGAAGG AAACACAATT ATCAGCGACG AAGCCGATTC
GCTTCGATCT GAATTGAAAC TCCTTTGGCG CAACTTTTAC GGAGCCGTCG CTAATGAAGA
ATCTGACGCT GTCAAAAGCG GCAAATGTGG CGCCTTCAGC GCTTGGATGT GGCCGTTGCC
GAGACAGCAG CAGCAACTGG TAAGGTTGTC AAACACGCTG TCGAAGCCGA TGAAATGGAC
AAAGAAGATT TCTGCAGGAA GTTGAGGCGG ATAATGGCAT GCGCTTGAAA ATGAATATTT
ACTAGAGCGA AAAAAGATGC ACCAGCCACT ATGGGAGTGG AAGGAGACAC TACTGAAGAT
GAAGACGGCA AGGACGATGA AGATGACCCA CAGATCACAC TCGACGAGTT GCTAGACGGG
CTGGTTTGCA CCAAGGCCCG GAAATTGACA TGGTTGATCC GCAAGTCCTC GATCCACTTG
TAAAGGGCGA AAAAGCAGCA AAAGACGGCA TTCGAGCTAG GAACATTTGT TACAAGGATG
CGGCTGTTCC AGTCTCGAGT GGATGGGGGA ATCAATTCCC AGGCGGGTTT GTGCAATAAA
GTCGGCAAAA ACCTTTCAAT ATTGATGTCG TGTCCCCAAC TTTCACAGTG AGGAGTATGC
AG
 
Protein sequence
MRSQNSKCAK ENNGKPCLWK SSAGDGINDS PSLDAVHRRF ASRPQKKDFF SLCSTALVVA 
QEFALSCFLL ARHRVALYFE QSRSTPENAF TISQTNLDQA TVAMSAALML VLFCSSRSNT
IPRQKRREKA QQRLSDAILL GIVLRLLASV LRTLTASYSS DTVEALATTG MTLHVVACDY
SYANGRRPHG EIIRPLISSQ RPVFRGGTFS LNAALFATTL LVSRVESNSM AYFLISLAIV
MFAFYPDARH AIANSYPPSR SGFPWLITAA ISGSTLVLLN NHEKILFLVG MTSLVFVVPL
WNYISQCNKV WFRGPWDIPT RSSMKLDNKL