Gene PHATRDRAFT_44404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44404 
Symbol 
ID7197867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp446938 
End bp449113 
Gene Length2176 bp 
Protein Length303 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178227 
Protein GI219114863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.262575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAGG AAAGGACCCT GCAATTTTGG GATGATTACT ACAAAACCCA TCAAGATGAA 
CAATCAAAGG AATGGATTCT AAAACCGTCA ACTCTTCTTC TCCAAAAACT AGCTCACCAT
GTACCACGCA AACAAAACTG TCGCATTTTG GAAATCGGTT GTGGGACGTC TAGCTTAGCA
AGGGATTTAT TTCGATACCT TCTTGACACA GAGCCTCGTG CACCAAACAA TTCTACCAAT
GATCAACTCG TTTGGCAAGA AACGCTACAA GTGATCGCGA CCGACGTCAG CTCAACTTGC
ATTAAAAAAT GTCAAGAGAG AGACGCCAAT ATTTCTGATG AGCGTTTGAG ATACCAGACT
CTAAATGTTG TTGAAACTTG CCCAGAATTG AGGGGTCAAT TTGATGTGAT TTTGGACAAG
GGTTGTTTGG ATACGTTTTT GTTTCGTTCA AGACAGCGTG GTGGGGGGCG ACAGCCCTAT
GGGGTTCTAC TATCCACTGT TTTGAATAAT TTGCAGTCAT GGTTATGCCA GCCGTCTGGC
CAGTATCTTG TTTTGACGCC ACGTTCGAAG CTTAAGTCTG TTCGCGACTA TGTTGGGTTT
GCCTCCGTGG AGCGACAACC TCTAGATGTT TCTAGACTTG ATTTGGGTGA CTTGGAAGGA
ACTGACCCAC AGGATGAACC CTCTCTGCAG CCCGAGCCGC TCTTTTTGTA CATTTGTAAG
CCATGCCTCG AATATGATCC ATCTCGTGGA GAGTCGTTTC CGCGGGATCA TCCTGTGCCA
CATGACGTCG ATACCTGCTC AGTTTGTAGC ATGTCTTTTG CCGCGTACCG AAGAGGCGAA
CGTTTGTCTG GACGGGGGGA TGCATATTGG TCCCGGCAGT GGAGAGGGCA TTGTCTGCAT
TGCAAAGGAT GACTAGATGA GATTCGCAAA TCCAGAGGCG CTCGCATACA GACCATCTCG
ATCGAATCAC ATCGCACGAG GGCACAAGAG GTAGTTCCTT AATATTTTGC TCTCTCCTTG
AAAACATGGG GCTTGCCCGC CCGATGCATT CGCTATTCAT GTACAGGACT CAAACGGTAC
AATACGCGAA ACTGGATCTT TAGTGTGCAT TACCTTGAAG CACATAATTC CATGTTCAAC
CGAGTCTCAG ATTGAACGGG ATGACTCATC CGAAGATTCT ATCAAAATGC CGCTACTATC
CTCAAGCCCG CTGAGTTCAT CAGGGATGAC CTTAGTCGCA TAAGTAGCTT TACAGGGTCC
AAAGTCTTCC TCGAAGTAGT TGCGACTAGT CCAAATACCT CCTTTAGAGA TCCATGCGAG
CTGAATCACG CGACTAAAAA GGAGAACAGG AGTACTTTCT CCTTTCAAGC CGCGACGGTA
GACATACGAT TCTCCGACAA CCTCGCCAAT CCGATGTGTT TGGCTGCCAG GTAATACCAG
GCGGTCTCCC ATTTCAGGTC CACCATGGGT AAAACTTGGT CGAATTGTTT TTAGATTTGG
CAGCACATCA TCTGCCACTG AACGTTTGCT TGATCGTGAG CAAAACGCAG TTCTCTTGTT
TCTTTTCTGT GAAGGCTTCT GCTGTATCGT TGCTTTCCTC CGTCTTTGAG GACGGCGGAT
AAGCTTGCAT TCGGGTATCT TGGCGACCGT TTTATCATAT TTCTTATCGA GATTGGATAG
GGACGTGTCG GAAGACCTCG TTTTTTGTCG TTCCGTAGCC TTCTCGAAGG CCAATTGCTG
TCCGTTTGTC TTGGCTGACG AAAGCGAGGA TTCATACTGT TTGCCTTGCA CTTCGGATTT
GATTCGCTGC TCCAGTGCTT CGGTTGCTTC TGCAGGAGTA CGACGGTTGC GGTGTGTTCG
CAGTGCCTTC ATTTCAATAT CTGCGTCCTT CACCCAATCA GATTCTTCCG TTGAAAGCCA
TATTACACGC ACTAGAGTGT TCCCGTCTTC GTCGTGAACA TCCCGACTAT CGCTAACGAG
GAAAGCGTGT TCTGACGATT TTTTGCCGCA TTTTACCATG ACGAGATTGT CGGCATTCAT
CTTTCCGCGA CTACCCATTT TGTCCGGACG TGTGTTATCG GGTGTACATT GGACCAAAGC
TTTCTACCTT CCGGTTGTTT TGAAATTTTT TAATTGGATC TCTTCAACAT GAATATACTC
GCGACCGTAC CATGTG
 
Protein sequence
MQKERTLQFW DDYYKTHQDE QSKEWILKPS TLLLQKLAHH VPRKQNCRIL EIGCGTSSLA 
RDLFRYLLDT EPRAPNNSTN DQLVWQETLQ VIATDVSSTC IKKCQERDAN ISDERLRYQT
LNVVETCPEL RGQFDVILDK GCLDTFLFRS RQRGGGRQPY GVLLSTVLNN LQSWLCQPSG
QYLVLTPRSK LKSVRDYVGF ASVERQPLDV SRLDLGDLEG TDPQDEPSLQ PEPLFLYICK
PCLEYDPSRG ESFPRDHPVP HDVDTCSVCS MSFAAYRRGE RLSGRGDAYW SRQWRGHCLH
CKG