Gene PHATRDRAFT_35949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35949 
Symbol 
ID7201317 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp158939 
End bp160236 
Gene Length1298 bp 
Protein Length301 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180584 
Protein GI219119658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGC AGCAGCAACA ACAACGACCC CTCGCGACTT CGGCCGTGCC TCCTCCGAGT 
CTATTGTCTC AAGTTGGCAT GGCTGGTTCC GCAGCCGTCA TTACAGTCTC TTTCATTCAT
CCCATTGACG TTGTCAAGGT AAGCCTACCG CACATTCTCG TTGTCGGTGA ATCACGACGT
GCGAACCACG GGGTCGAATC TCGACCGTAT TGGAACCCAC CGCACCAAGT TTGTGAACGC
CACACTCTTC AGTATTCGAT CCACTGTAGC GCACGACAGT GAGATTCGAC GGGAAGATCC
GGAACACTGC CGGCACGGAC ACGAAACTGT CTTCCGAACG AACTCTTTTG TCGTGCACTA
TTTGAATGAA AATGGTATCC TAGTTTGTTA GTGCGCGACG TGACGTCGAT GGAGAGTGTA
GGTTGACTGT GCGCGCATTC ATTCTGACTG TCCCTTTGTT ACCACTGCCA TTGCTGGGAC
TGCAACTCAC CTTGACTCTT TCCATTACAC TCTTTCTGCT GCCTTTCTAG ACGCGCATTC
AAATTTCTGC CGAATACGGA AACATGGGTA TGTTTGGTAC GATCAAGAGT GTGGTCGGCG
AAGAAGGTGT TCTCGGTTTG TGGAAGGGAG TCAACGCGGC CTGGCTGCGG GAAGCATCCT
ATACCTCGCT CCGCCTCGGT CTTTACGAAC CCATCAAGGT GGTCTTTGGA GCCGCCGACC
CGGAGACGGC TACCTTTATG AAAAAATTCT TGGCCGGTAG TGCCGCGGGT GCGATTGGTT
CAATAGCGGG CAATCCCTTT GATGTCCTCA AAACAAAAAT GATGGCATCC AAGGGCAAGC
AAGTTCCTTC CATGGTCAAG ACGGCCAAGG ATCTCTACGC CAACCAGGGA GTTGGTGGAT
TTTACCGTGG TATCGACTCG AACATTGTGC GTGCCATGGT TCTGAACGGA ACCAAGATGG
GGGTTTACGA TCAATCCAAA GGCTACGTCG TTGCCGCCAC CGGTCTCGCC AAGACCTCGC
TCACCACACA GTTCCTGTCC GCCGTCACGG CCGGCTTCTT CATGACCTGC ACCGTCTCTC
CTTTTGATAT GATCCGAACC CGACTGATGA ACCAGCCATC CGATGCCAAG ATCTACAACA
ACGCCTTGGA CTGTATGATC AAGATTGCCA AGAACGAAGG ACCCTTGACC TTCTGGCGAG
GATTCATGCC CATCTGGTCG CGATTCGCCC CCACCACAAC CCTGCAGCTC GTCATTTTCG
AACAGCTACG TGGCATGATG GGCATGAAGG CTCTCTAA
 
Protein sequence
MMKQQQQQRP LATSAVPPPS LLSQVGMAGS AAVITVSFIH PIDVVKTRIQ ISAEYGNMGM 
FGTIKSVVGE EGVLGLWKGV NAAWLREASY TSLRLGLYEP IKVVFGAADP ETATFMKKFL
AGSAAGAIGS IAGNPFDVLK TKMMASKGKQ VPSMVKTAKD LYANQGVGGF YRGIDSNIVR
AMVLNGTKMG VYDQSKGYVV AATGLAKTSL TTQFLSAVTA GFFMTCTVSP FDMIRTRLMN
QPSDAKIYNN ALDCMIKIAK NEGPLTFWRG FMPIWSRFAP TTTLQLVIFE QLRGMMGMKA
L