Gene PHATRDRAFT_32550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32550 
Symbol 
ID7197103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp216441 
End bp217622 
Gene Length1182 bp 
Protein Length393 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177886 
Protein GI219112269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATGG GATCAACCTT AACAAAATCG AACAAAGGCA TTGCGGATGA AAAGGCTAGA 
AAGGCTTCAC CGGCCAAGGC TTACGCTGAT CTCGCCAAGG CCAAGCTTTC CGGTCTGGTG
GTCGCCACTA CTGCAGCCGG ATTTGTCGCG ACAGGCGGTC CACTTTCAAC CCAGCTAGAC
GTTTTCACAG CATGCGTTGT TGGCACAGCC CTATGCTCCT CATCAGCTGC AGCTTGGAAT
CAAATTTTGG AGATTCCTCG GGATGAAAAA ATGAAGCGAA CCCAACAACG ACCACTGATT
ACTGGTGCGC TCACACTGTC GAAAGCGAAA TCGGCTGCCG TGGTCTGGGG TGCTTCGGGT
GCAGCCTTGC TGGCAGCGGG GACTGATCCC GTTACTACCA CGTTAGGCGT TGGCAATATT
GCGCTCTACG CCGGGTTGTA CACGTACATG AAGCCTCGGT CCATCTACAA TACGTGGGTG
GGTGCTGTTG TAGGAGCAAT ACCTCCGGTA ATGGGCTGGA CCGCGGCGAC AGGAGGATCC
ATTATGGATA TGGAAGCTTT GATGCTCGGA GGCATATTGT ATCTGTGGCA AATGCCACAC
TTTTTTGCGT TGTCCTACAT GTACCGGGAA GATTACAAAC GTGGTGGTTT CCAAATGGTA
CCGTGTTTGG AAGCGGATGG TGTCCAAACA GCGAACATAG TTGTCCGATA CGCCTGGTAT
TTGAGTGCTG TCCCGTTTGT ATGCGCTTTG ACGAGCGTGA CAAGCAGTAT GTTTGCTTTG
GAAGGCGTTG CGTTGAACGC TTACGCCTTA ACGGTGGCGC ATAAGTTCAA ACGGGAGCGC
ACGAACGCTA ACGCACGCAA AATATTTTTG ACATCCCTCT GGTATCTACC ATCCTTACTA
ATGCTGTTTT TGCTACACTC CAAAACCTGG GATGATGAGG AAGAAAAGAC CAAGGATCCA
ATCGCTAATT TCTTGTTTAC GCAGATTCAT TCTATTCGCG ACAAAGGAAG GGACTTGTGC
GTTCACGAAC AAGTAGTGGC AACTCATTCC GATGGCAAAG AAGCATGCCC AGTCACCGTT
GCGGCTAAAC AAACCAGAAA GGGAGTGCAA AAAGTAAAGT CGACTGCGGA TTCAGCGACC
GATGCTATTC AGGAGAAGTC CACAAAGAGT AGAGAAACGT AA
 
Protein sequence
MGMGSTLTKS NKGIADEKAR KASPAKAYAD LAKAKLSGLV VATTAAGFVA TGGPLSTQLD 
VFTACVVGTA LCSSSAAAWN QILEIPRDEK MKRTQQRPLI TGALTLSKAK SAAVVWGASG
AALLAAGTDP VTTTLGVGNI ALYAGLYTYM KPRSIYNTWV GAVVGAIPPV MGWTAATGGS
IMDMEALMLG GILYLWQMPH FFALSYMYRE DYKRGGFQMV PCLEADGVQT ANIVVRYAWY
LSAVPFVCAL TSVTSSMFAL EGVALNAYAL TVAHKFKRER TNANARKIFL TSLWYLPSLL
MLFLLHSKTW DDEEEKTKDP IANFLFTQIH SIRDKGRDLC VHEQVVATHS DGKEACPVTV
AAKQTRKGVQ KVKSTADSAT DAIQEKSTKS RET