Gene PHATRDRAFT_47359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47359 
Symbol 
ID7202511 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp386426 
End bp388002 
Gene Length1577 bp 
Protein Length502 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181547 
Protein GI219122428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCCA AGGCCATGCC CAAGTTACGT ATGGACGGCT CTCCTTTAAA ACGGCCACGA 
TCCATTGATT TACCAAAGAT CGATGTAAAT GAAGCGATTG CACATCTGCA AAGGGCCTTG
ACGACAAAGC AACGCCTGAT GGCCTTGCAT CACGTTCAAA ATCTGATCGA GGGCGATCCG
ACGGCAATTT CTTGGATGGT AGATTCTGGT CTGATACGCA TTCTGCAACT GCAGCTTAGT
TATGCTCTTC AACGGCATGG ATCAACAAGT CAAGAGCTCG GAACACTTTG TCAAGTTTTT
GATCTTGCGC TCCGGACTTC GCCCGCTGGT TCGCTGGAGA GTGCTCTCGA CAAAGAAGCA
GGGCGATCCC TGGTAAACCT TGTTGCCGAT GCTTTCCCAT GGGGATTCCA CCATGTCATC
GTGTCAATTT TGCACACTAT TTCGCAAACG AGCTCTGGAG CTTTTCTGAT CCTTCACTGC
AGCAAAGCAA TGCATTGTGT TACTGAACTT TTCCGATGCT GTCGTGCGTC ATCCACCAGC
AAAGAAGCTG TCTTCGAGGC CCTGGGATTG CTTAAGAACC TAACCTACTT TTCGGAAGAA
TCACGTAATA TATTGCTTGA CTTACCAGGA ATCGTCGGGT CACTAGCGAA CGTGGCTGTA
TTTGTTGATC AAAAGGGTCA CGAGAGATTG TCAGCTATCT GGCGCAACCT CTCCGTATCA
ATGGAGACAC GACGGCGTTT GGCGCAGGAT CCTGATGTAT TAAACGGCCT TCTAGAGCTG
GCTGATTGTA CCTGCTCTTA CGCCCTACGA AATTTGCTAA ACACAACAAT CAGTCTTTCA
ATGGACCCAG AATCATGCGT GATACTTGTG TTGCACGGAG ACGGAATCTT TGTAAACGTC
TTGCGGAGAC TGCTAGTCAC CGAGACAGAT GCGCTCATTC GAAAGCGTGC AGCACGCGTC
ATTAAGCTTT GGGCTTCAAA CGATTTTGTC GGTCCAATAC TAGTCAAAGA CAGAGCGTTG
ATGGACGTTC TGTCTCAGCA GGCTTTGCAA GACCAAAACG TAGATGTTCG CCACGAAGCA
GCTGACGCAT TCTGCCGGTG TTCTCAAAGA ATCCAGTCAC CAATGCCTCA GCACCAACTG
GTGCTAGATG CGATCATGTT TCTGGCAGAG CAGTCAAGAT TACCCGCTGA AGTGCTAGCA
CGAACTCTGA AAGCGCAAGC ACTTCATCCC AGAAATCGCA TTCCAATGGC TGAGCGCAAT
TCGCTACTTT CTGCACTAGC TCGCATTGCT CAGCAAGAAG GTGTCCCCAA TTCAGCTCGC
GAAGATGCAT GCTGTGCTTT GGCTTATCTG TCAGACGAAG CCGCCAACCT GCCAAAGCTA
TCAACCGCTG GTATCGTTGA AGCAGTCACG GTCAATGCGA TTGGCGGTCG TGGCCTCAGA
AGATCTTATG CAGTGCAAAC TATTGTAAAT CTCACCAGTA CAGCAGAGAA TCTTCCCAAG
CTAGCTACAC ATACAAATCT TCTTCAGGCT CTGATACAAT TTGCGGCTAC TTCAATAGAA
GACCAACTCA AGTCTAA
 
Protein sequence
MVAKAMPKLR MDGSPLKRPR SIDLPKIDVN EAIAHLQRAL TTKQRLMALH HVQNLIEGDP 
TAISWMVDSG LIRILQLQLS YALQRHGSTS QELGTLCQVF DLALRTSPAG SLESALDKEA
GRSLVNLVAD AFPWGFHHVI VSILHTISQT SSGAFLILHC SKAMHCVTEL FRCCRASSTS
KEAVFEALGL LKNLTYFSEE SRNILLDLPG IVGSLANVAV FVDQKGHERL SAIWRNLSVS
METRRRLAQD PDVLNGLLEL ADCTCSYALR NLLNTTISLS MDPESCVILV LHGDGIFVNV
LRRLLVTETD ALIRKRAARV IKLWASNDFV GPILVKDRAL MDVLSQQALQ DQNVDVRHEA
ADAFCRCSQR IQSPMPQHQL VLDAIMFLAE QSRLPAEVLA RTLKAQALHP RNRIPMAERN
SLLSALARIA QQEGVPNSAR EDACCALAYL SDEAANLPKL STAGIVEAVT YSRESSQASY
TYKSSSGSDT ICGYFNRRPT QV