Gene PHATRDRAFT_33730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33730 
Symbol 
ID7198020 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp227877 
End bp229279 
Gene Length1403 bp 
Protein Length402 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178455 
Protein GI219115319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00194424 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA CTTTCCAAAT TTCAAGCCTT GGAATGGGTT TTCGTGTTGA TAATTCCATT 
GGGCCTACTA AATCTTTCCC TATCCGTTGT GACAAGACTG ATATTCTCCC AATCCGTTGT
GACAAGATTG ATAGTACCAA AGAACCTATC ACGGTTACAT ACAGAGCTTT AACGTCCTAC
GAACGTTCTA GCCTTCCCGT TTACATTTCA AACAAGTCAT CGCCATATGA AACAGTCATT
GGCGATTCAG CTGCATCATA CAAACTCAAT CACGATTCTC AAACTAGTCA TAAATCTTAT
TAGTTCATCG ATTTCCATCA TTCGCGTTCT TGCGTTTGTA GAGCTCATCT ATATTTCGGT
CGCATCGAAA AAACATTTTG TGAGACAGTA AACCTTGAAA CGACACCTTT ACTCTAGCTA
ACTGACAGCG AAACTCATCG CAATCGTCGC TCGCTGTCAG CTTCGACGAG AAAATCGAAA
GAGTCGCGTA TGATATTTCG ACTCTACCAG GAAAAAGAAA CAGCCCGAAC ACAGACACAG
CTATGGGCGT CAGTCACGAC CAACAGATTC AACCGGAAGA AACCGAGGCT ATTCTATTGG
AGCGCATGCA AGAGATGCAG GTGGAACTGA GGCTACTGTC TCCCTATGAT CGAGATTGCC
TTGACTTGGC CATGCGCAAA TGCCCCGCTC TCGCCAGCGA CCGATCGTTT CAAGTTTCGT
TTTTGCGGAC GGAAGTGCTG GACGCCAAAC GTGCCGCGAA ACGGTACGCT ACATACTGGA
AGCATCGAGT GAACCTGTTT GGTCCGGTCC ATGCCTTTTT GCCATTGGTA ATCAAAGATG
AAGCAGACGT GATGGAGGCG ACGGAACAGG CACCGAACAG CGCCTTGACA ACCGAAGACA
TGCACGTTTT GAAGTACGGT TTTACTCGTG TCGTGGCTGG CCATGGACGT GTGCTGCTCA
TTGACCCTTC TCGGACGGGA CCAAAAAGTG ACTACAAAGT CGATAGTATT GTGCGTTGTC
TCTTCTATAC CGCCACTAAG GCCCTCCTAG CAGATGAAGA AATGCAACGC AAAGGCGGGA
TATTCATTCT CGATATGAAA GGCAGTATCC GAGGCTTCGA TCGAGCGTTG ATTAAACGCT
TGACGGAAAC AACCAACGAC GGCTTCCCTT TGCGCTGCTC CGCCTGTTGC ATTTTACGTC
CACCGCTACT GGTGGACACA TTTGTCAAAA TAGCCAAGGT CTTCTTGCGA TCCCGTGTCC
GCAATCGCAT TCACGTGGTC ACATCGGAGT CCAAATTGGA AAAGCACGTC GGTGTGTACT
CTATGGAAGC GCTGTTTGAA GCAGCAGACC ACAAAGCTTG GCTGAATCAA ATGCGTACTG
AGGATTTTAA GCAATATAGG TAG
 
Protein sequence
MNTTFQISSL GMGFRVDNSI GPTKSFPIRC DKTDILPIRC DKIDSTKEPI TVTYRALTSY 
ERSSLPVYIS NKSSPYETRN SSQSSLAVSF DEKIERVAYD ISTLPGKRNS PNTDTAMGVS
HDQQIQPEET EAILLERMQE MQVELRLLSP YDRDCLDLAM RKCPALASDR SFQVSFLRTE
VLDAKRAAKR YATYWKHRVN LFGPVHAFLP LVIKDEADVM EATEQAPNSA LTTEDMHVLK
YGFTRVVAGH GRVLLIDPSR TGPKSDYKVD SIVRCLFYTA TKALLADEEM QRKGGIFILD
MKGSIRGFDR ALIKRLTETT NDGFPLRCSA CCILRPPLLV DTFVKIAKVF LRSRVRNRIH
VVTSESKLEK HVGVYSMEAL FEAADHKAWL NQMRTEDFKQ YR