Gene PHATRDRAFT_30810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30810 
Symbol 
ID7198795 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp206157 
End bp207880 
Gene Length1724 bp 
Protein Length523 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184914 
Protein GI219129476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTCACGTG AAAAACCATG GTGGATGTCT TTTTAATTGT CGCGACCGTG GTCGCCATTG 
TGATTCTTTT GATCATCGCT TCGTATCTCT TGGTTCACTA CCAACATCCC GACGACCACA
ATGATGCTTA CGTACCAAAG CTGATCGTTT TACTCGGCTT TGTCTTGGCT GGAGCGACTG
TCCTCATGTT GCCGCTGGAT GTCGCTAACA ACGAGGGCTA CGCCGGTAAG CCACGGAATC
CGCTGTTTCT AGTAGAAATT CTACGTAGAA CGGAAGCACG ACTGACAAGT CCCGTTTTAC
CTCCATCTAT TTGTTGACTG CATCTCAACG TTTTCTTATT GATGTTGTCT ACAGGTTGCG
AAGGCTACGA TACGGGATTA TGTGGTGGGC TCAATATGGA ACTCATGTGG GATATTGTGT
TTTGGATGAT TCCCATTTGG GTCTTTGTTT TGATCCCTTT CGCTACCTTC TATTACGAGG
CCGACGATGG CATGCTCATG GCCGGCACCG CCTACGCACC CAATCCAGTC AGGCAGTCGC
GTATTGGCCA AGCCATATGT TATCAACTGT TCGTTTTTGT CATTATCGGT GTCATTTTTG
CCGTCACTTA CATTAGTCTG TCGGACTCGA AAATTCCTGT CCAAGAATAC GTGGGGCCAG
CATTAGGGAA GGTTAATCAA GGGTTCACCT ACTCCGCGCA AAGAAACGCA ACCGACGATT
TGCTTCCTTT CGATTCCGAC GGATTGCAAC CTTGGGGAGA CTCGGATACC ACCTACCTAT
CAAACGTCGT GGACAACGGC GAGCAGACCC TGGTATTGCA GGTGTCCTTG AGTACCTTTT
ATGCTGGACT CATGGCGTGG TTGGGCTGGT TCCTGTTTGC CATTTTTGGA GGTATCGGCT
TGGCGGCACT TCCATTGGAC TTGTACTTGA TGTTCAAAAA TCGACCGCGG CATATGGATG
CGGCAGAATT TGCCGAAGCC CAATTGTCCC TGCGGGAACG GGTCAACGAA ATGGTAGACA
TTGGCGAACT TATCAAGATT GAACGGGAAC AAAAGGCGCA GGCCGGGCTA ACGTCGGCGT
TTGCCACCTT CTCGCTAAAT TCGGATACAC GGAAGGCAGC ACGCGATGAA AATCAAGCTG
TTCTGGGTTT CAAACAGGCT GTCTACCTTT TGGAACAGGA TGTGGAGGAC TTTCAGAATG
CAACCGTGAA TTACAAGAAG TACAATGTCC TGATACCCTA CATTGCTTTG CTGCTGAGCT
TGTGTGCCTT TATTGTCAGT ATATTCTGGT TCATTCACGT AATTGTTTAC GTCTTCCCCA
GTCCACCGTT GGCCCCATTT CTGAACAATT ACTTCGAGTG GTTTGACAAG TGGTTTCCGT
TATTTGGGGT ATTGTCGGTC GCACTCTTTG TTTCGTATTT ACTTTTAGCG GCACTTAAAG
GCTGCTTCAA ATTTGGCATC CGTTTCTTGT TCTTTCACAT TCATCCTATG AAAGTCGGCA
AAACCTACAT GAGTTCCTTT ATGTTCAATA TTGCCCTGGT CCTATTGTGC GCCTTGCCCG
CGGTTCAGTT TTCGCAGGCG GCCTTTGCCG ACTACGCAGC CTTTGCAGAA ATTCGACAAA
TCTTTGGCGT ACAGATACAG TTTTTGCAAT TCTTTTCCTT CTTCTGGACG AACAACGTAT
TTATTTACTG CTTCTTAGCC TTCACAGTGC TAACGTCCAT CTAT
 
Protein sequence
MVDVFLIVAT VVAIVILLII ASYLLVHYQH PDDHNDAYVP KLIVLLGFVL AGATVLMLPL 
DVANNEGYAG YDTGLCGGLN MELMWDIVFW MIPIWVFVLI PFATFYYEAD DGMLMAGTAY
APNPVRQSRI GQAICYQLFV FVIIGVIFAV TYISLSDSKI PVQEYVGPAL GKVNQGFTYS
AQRNATDDLL PFDSDGLQPW GDSDTTYLSN VVDNGEQTLV LQVSLSTFYA GLMAWLGWFL
FAIFGGIGLA ALPLDLYLMF KNRPRHMDAA EFAEAQLSLR ERVNEMVDIG ELIKIEREQK
AQAGLTSAFA TFSLNSDTRK AARDENQAVL GFKQAVYLLE QDVEDFQNAT VNYKKYNVLI
PYIALLLSLC AFIVSIFWFI HVIVYVFPSP PLAPFLNNYF EWFDKWFPLF GVLSVALFVS
YLLLAALKGC FKFGIRFLFF HIHPMKVGKT YMSSFMFNIA LVLLCALPAV QFSQAAFADY
AAFAEIRQIF GVQIQFLQFF SFFWTNNVFI YCFLAFTVLT SIY