Gene PHATRDRAFT_14981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14981 
Symbol 
ID7203732 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp74957 
End bp76138 
Gene Length1182 bp 
Protein Length351 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182765 
Protein GI219124973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGGTCTC GTTTTGACCA AGGTTGTACG ATTCGGTTAC TTTGTCCACA CAAACATTCG 
GATTCAACAC ATGGGTTGCT CAGTTTGTTG GAATCAGAAT GGACGTGTAT GATCGGTGCA
AACGCCTACT TGACACCACC ATCGGGGTCG CAAGGATTTG CTCCACATTA CGACGACATT
GAAGCCTTTT GTTTGCAGTT GGAAGGAAAG AAGCGGTGGA AAGTCTACGC CCCGTTGCAA
AAGTCGGAAC GCTTGCCCCG CACTAGCAGC GAAGATTATG TAGAAGCCGA CTTGAGGGAT
GTGGAACCCG CGCTGGACGT TGTGCTCAAA CCGGGAGATG TTTTGTACAT GCCCCGAGGC
TGGATTCATC AGGCATGCAC GATCGATGGT ACGGATGGTT ATTCGTTGCA CTTGACGGTT
TCTGCCATGC AACAGTGGGC TTGGGCGGAT TTAATGGAAT TACTCCTACC GGAAGCCTTG
CAATCGGCAG CATCTGGCGA TTCCACCATG TTGCGCCAAG GACTACCACG TGGTTTTCTA
AATTATATGG GTGCTATGTA CGACCAAAAG GATACGGCGG AAATTCTTGA ACAGAAGGCT
GAGCAGGACA GGACCGCAGC GATGGACGAA ACTGGAGGTG ATGGCGAGGA CGAATCGGGC
GAGACTATAG ACCACGACCA TATAAAACGT AAAGAAATTC TAGTGCAGAA TGAAAAGTTT
CGACAAGAAG CCAAAAAGAA GATAATGAAG GTTGCGAAAG AAGCGATAGA TATGCTTGAC
GCAGCTTGTG ATCAGATTGG CAAACGATTC TTGTCTGACC GAGTACCACC CGTGCTAACA
CACCTGGAAC GTTCAATGAC GGTGCACGAA TCCGACGCGA AGGTATTACC GCAAACCTTG
TGTCGTATGG CCCGCCCTGG TTCCGGAAGA CTCGTTCTCG AAGCAGGTAA GGCTGTGCTG
TATCACTGCG CAGACAATTC TCGAGTGTAT CACGAATTAC CACTCAGTCC AATGGAATTC
GAAATGGACG ACGCACCCGC GATGGAACAG CTATTGACCA CCACGGAACA CGATTGGGTT
CGCGTAGCTG ATTTGATTCA CGACAGTATC GAAGACAAGG TAGGCGTAGC GCAGGCCTTG
TACGACGAAG GCATTTTATG CATCCAAACA AGCGACATGT AG
 
Protein sequence
LWSRFDQGCT IRLLCPHKHS DSTHGLLSLL ESEWTCMIGA NAYLTPPSGS QGFAPHYDDI 
EAFCLQLEGK KRWKVYAPLQ KSERLPRTSS EDYVEADLRD VEPALDVVLK PGDVLYMPRG
WIHQACTIDG TDGYSLHLTV SAMQQWAWAD LMELLLPEAL QSAASGDSTM LRQGLPRGFL
NYMGAMYDQK DTAEILEQKA EQDRTAAMDE TGAIDMLDAA CDQIGKRFLS DRVPPVLTHL
ERSMTVHESD AKVLPQTLCR MARPGSGRLV LEAGKAVLYH CADNSRVYHE LPLSPMEFEM
DDAPAMEQLL TTTEHDWVRV ADLIHDSIED KVGVAQALYD EGILCIQTSD M