Gene PHATRDRAFT_41016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41016 
Symbol 
ID7198930 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp143131 
End bp144533 
Gene Length1403 bp 
Protein Length460 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184975 
Protein GI219129606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCCCG TCGCCGAACT CGGTCGCCAG TACCAATCCG GAGACCGCGT CTGGCTGCGG 
GGACGACTAC AGTCCATCCG GGGCAAGGGA AAATCGTGGT TTCTCGTACT GCGACAAAAC
TCATTCGATA CCGTACAGGC ATGTTACTTC AAAAATGTCG ACGATGCGGA AGCCTCGCAA
AAAATGATAC GTTACTTGAA AACGCTCACG GCCGAAAGTG TCGTTGATCT CGAAGGGACC
TTGGTCGACG CGGACGTCAA ATCGTGTTCC GTCAAAAACG TCGAACTCAA CATTCACCGC
ATCCACACCG TTTCCAAAGC CGACGCCATC TTGCCATTTG AGGTAGAGGA TGCCGCCCGT
AGTGAGCAAG AAGTCGAGGC CTCGCAGAAC ACCGAACGTC CCTTTCCCCG TTTGGGGCAG
GAACTCCGTC TCGATCACCG TTGGATGGAT TTGCGCGCGC CGGCCAACAA CGCCATTATG
CGCATACAGT CCGCCGTGTG TCAACTTTTC GTGAAAGTCT CTACAGTCAG GGCTTTTGCG
AAATACACAC ACCCAAGCTA ATTGCCGGCG AAAGTGAAAG CGGCGCCGGC GTCTTTACCA
CGGACTATTT CGGAACCACG GCCTGTTTGG CCCAGTCACC ACAGCTCTAC AAACAAATGG
CCATTGCGTC CGATCTACCA CGCGTCTTTG AAATTGGACC CGTCTTTCGC GCCGAAAATT
CCAATACCCG CCGTCATCTC TGCGAGTTTA CCGGACTCGA TCTGGAAATG GCCATTGACG
ACCACTACTT GGAAACCTTG GAGGTTGTTC ACGAACTCTT TAAACATATT TTTACCGGCC
TCGAATCGCG TTGGGCGAAG GAATTGAACA TTATTCGGGA ACAGTACGAT TCCGAACCCG
TCGCTTTTAC GCCAGATCCG TGCGTGTTAC ACTGGCCCGA AGCCCTGGAA ATCCTTCAAA
ACGAAGGATT CGATATTGCT GACGGTATGC AGGATATGAA CGGTGCCATG GAACTCGCGT
TAGGTAGGGT GGTCAAGGAA AAGTACGGCA CTGACTTTTT CATGCTGGAT AAGTACCCGT
CCTCCATTCG GCCTTTCTAT ACCATGCCCG ACCCTGAAGA TTCCAGATAC TCGAATTCGT
ACGATATTTT TATTCGGGGA CAAGAAATAT GCTCCGGAGC CCAGCGGTGT CACGATCCGG
ATCTGGTCGA GAAAATTTTG CAAGAGAAAG GCATTGAAGT CGGTGACGGT CTCAAATCCT
ACATTGAGTC CTTTCGTCAC GGGGTCAGTC CCCACGCGGG TGCTGGGATC GGTCTGGAGC
GCGTCGTCTT TTTGTACCTC GGCCTCGACA ATGTTCGTAA AGCCTCCATG TTTCCGCGCG
ATCCCAACCG ATGCACACCC TAA
 
Protein sequence
MVPVAELGRQ YQSGDRVWLR GRLQSIRGKG KSWFLVLRQN SFDTVQACYF KNVDDAEASQ 
KMIRYLKTLT AESVVDLEGT LVDADVKSCS VKNVELNIHR IHTVSKADAI LPFEVEDAAR
SEQEVEASQN TERPFPRLGQ ELRLDHRWMD LRAPANNAIM RIQSAVCQLF GFCEIHTPKL
IAGESESGAG VFTTDYFGTT ACLAQSPQLY KQMAIASDLP RVFEIGPVFR AENSNTRRHL
CEFTGLDLEM AIDDHYLETL EVVHELFKHI FTGLESRWAK ELNIIREQYD SEPVAFTPDP
CVLHWPEALE ILQNEGFDIA DGMQDMNGAM ELALGRVVKE KYGTDFFMLD KYPSSIRPFY
TMPDPEDSRY SNSYDIFIRG QEICSGAQRC HDPDLVEKIL QEKGIEVGDG LKSYIESFRH
GVSPHAGAGI GLERVVFLYL GLDNVRKASM FPRDPNRCTP