Gene PHATRDRAFT_45819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45819 
Symbol 
ID7200824 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp372945 
End bp374145 
Gene Length1201 bp 
Protein Length258 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180028 
Protein GI219118515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.178302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCATT CCTTCATGCA GGCTTCACTG CTTTTCCTCC TTATTTTGCG TGTCCTAGTT 
GCTCCGGTAT GCAGTTTTTC TCGAGCCACA CTATAATTCT AATTCACCCA GTGCCGCACG
TTCACAGTAA AAAGGATGGA TACGCCCTTC TTGATACAAA ATACAGTCAG TCAAAACAAG
CGCATGTTTG AGCATGCACG CTCATACTAT GCATAGTCGA ATGCTTTCGT CGTGCGATGA
ACCAACAATG AGTACCTACC GAGATGCGAC GCATTCCTTC CCTCGACAGA TACATGCAGA
AGAAGGGATC CATAAGGAGG ACAATCGCAT GTTTCTACTA GATTCCTTCG ATAGTTCCGC
AGTTCACCAC CGTTCTGTTA GCTCGCATTG TCACGATATC TCGCGGCTGG ACAGAGAATT
GCCACTAGGC TTGGAATTTA AACCTGCGGA ATGGGACGTT GTAAGTGCAT GGTAGTGGGC
GGCATTTTGG ATTGTGTGGT GGTATTGTTT GATCTCACAA ATATATTACT CCAATGGAGC
AGATTTGTGG GAGGGGGAAA GCTAGTTTCG ACCATGTCGG AAACAGGCGA CTGCGGCTCC
TTGTTGCCAA TAGCATGCAT TCATATGTAG AGGCAAAGTC TAGAGTTGAT AAAACTGCCA
TAGTCCAAAC TATTGTTGAG CAGATACGCG AAGCCAGTCC TAATGGAGGT TTTGTGAGAA
AAGACGACTT CGGCGAGTGG TACGAGATCG GAACAAAAGC TGCAAGGGAG AAGGTTGGAC
ACGCAATCCG CGACTGCTTG ACAGAACCTT TGAGGGGCAG ATCATTGAGT ACCTACCAAG
AACGACTGGA AAGCCTACAG GAAGTACAGG ACGAAGTGTT TCGTTCTCTG AAGATTGCTG
GTATTCAAGA AAGAGAAGGG CAAGCAAACA GTCCATACAA ACAAAATGGG TCCGCCTAAT
GTAAATCGAC GAAGAAAAGT TACTCTATCG AAGATTTTCA AGGAGAAGGT CGAGGTTTGT
AACTGCTCAT GATCCATGAA TTCCCTCCGA CGCCACCAGA TTCAGTAAAT GCCTGTCCTT
GTCCTTGAGT TTAAAGAAAG GAAAGGAGCA TAAATGCGTC CTCTCTTTAA CTCTTCTCCT
ACTAGCATGA TCTAGATAGA CCGTGCTCTT TAAAAGAGAT AGTAGTTAAT ATTTTGAGTT
G
 
Protein sequence
MPHSFMQASL LFLLILRVLV APSVKTSACL SMHAHTMHSR MLSSCDEPTM STYRDATHSF 
PRQIHAEEGI HKEDNRMFLL DSFDSSAVHH RSVSSHCHDI SRLDRELPLG LEFKPAEWDV
ICGRGKASFD HVGNRRLRLL VANSMHSYVE AKSRVDKTAI VQTIVEQIRE ASPNGGFVRK
DDFGEWYEIG TKAAREKVGH AIRDCLTEPL RGRSLSTYQE RLESLQEVQD EVFRSLKIAG
IQEREGQANS PYKQNGSA