Gene PHATRDRAFT_47281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47281 
Symbol 
ID7202366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp172436 
End bp174601 
Gene Length2166 bp 
Protein Length690 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181672 
Protein GI219122687 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTCGACGC CATGTCGACA TACAGTTGAG TCGCTCCCAG GAGCTTTCTC TCTTTCAAAA 
CTACACACGT GCGTTGTGGT CCCCCAGCAC GAAATGGGGA AAGGCAAAAG ACCATCGATC
AGTCTCTTTG CTTTTTTGCA AGCGACCTTT TCGGTGGTTT CGATTCAAGC TTTTCAAATG
CCGACGACTT GTCGGACAAT GAGCACTGTC GTCTCGTATA GGGAGACTTA CCCCCAACAC
CCGTGTTTCT CGCAACAAAG ATATAACAGC CACATTTGCA TGGAGCCGGG GATTATTCAA
GTAGAAGAAC TTTCCGACCA AGACACCTTT TGTTCCTATA TCAACTGCAT GAAGGAAATT
GTCGAGAACC AGCGAGTTGT CAAAAAGTGC AGCCTCGACG AATGCAAGCC AGTACAAGAA
TACTTGTTTT CCTCCCAATC GTTGCTTTCA TTACCGGATA TCAGTGTGGA TGGATTTAAG
GATCGCCTTC GCGACCAGAA AAGGCAATTT TTAGAAGAGA CAGGCTTTTC GGAGGAGCAG
TATGAATATC TAGTCCGTTG TCTAGCTTAT CTTGGCGATC GCTGTGCGAA AATCCAACAA
ATTGCACCGG CACTCATTGC GTGGAAAAAG ATGAGAGAAA GTGGCATGAA ACCTCGAGAG
CGCTTCGTAA GCACATACAT GTACGTTCTG AGTCTTGACG AACGTAATGT GCAAGCGTGT
TTCGAAACAG CAACGTTTCA CGATGTACTA TTCGAACCTA CGGAGAAATC CATTTTCTTG
AGAATAAAGA CGTTAATCAG CCAGAGCGAT GCTAAAACTG CGGAATACAT TTTGTCGTCG
CTTCCAGATA GACAAGGAAA AAGCTCCGAA TGGAAGAAAC TTAGAACGTT CCTTCCTATT
CTCAGTCACT ATTGCGATGT GGGTGATATG ATATCCGCGT TGAACTTATT TCGTAAAATG
AGGAAATCTG AAGGTGTAAT TTTCGACTCG GAGACATATG GTCTTCTCAT CGGTGCTCTA
GCCAGACGCG GTTACTTCAT ACCCGGAGGT CTTTCTGTTG ATGGCGCTTC TACGCTTGGA
TCATTGGAAA TGGGAGGTCC TGCCCTTTTT GATACTATTT CTACTGAAAT GGCCGAGGAT
CTTCTCGAGC TGAGTCTAGA TGCGGCTGAA ACCATCCTGG ATGGATTCGT GTCTGGGTTT
GCAGGGGAGA ATTCTTATGA ATTCGATTGT GTCACTGATA AGTGTATAAA TGTCAGTCCA
TTGCTGACTA TAGGCCGTGT CACGGTGAAC GAAACCTCCG CCGTATGCCC AGCAACCGGT
GCCAATCTCC GTCTATTTGC TCTGACGGAA GAGCAACGGA TAAGTGTTCA TGATACCTTA
CTTGAAATGG CAGCGGTGCA ACACGAAGAC TTTAGCGAAA AGCTAAAAGC AAGAAACCAA
AACTTTAAAG GTAGAGGAGA CTCTGAGCGG GCACGGCGAA ACCTTTTTGA GTTTTCAGAG
TGGTTGCGTG AGCACGAAGG AGAACCATTT ACTGCAATTG TTGATGGAGC AAATGTTGCC
TATGCTGGAC ATGGCAATGT GCACTATAGC CAGGTCCAGC TCGTCGTGGA CAAACTTGAG
GACATGGGTG AGAAAGTGTT GGTCGTTATG CCTTCAAAAT ATGTTGGAGA AAAATTTTAC
GTGGCAGGTA TTGACTCCGT GCAACAACTT TCAGAGAGAG AAGTTGGCAT CATGAATGGT
CTGCTTGATG AGGGAAAAAT GTATCAAGTC CCAGCAGCAT GTCTTGATGA TTACTATTGG
ATGCTGGCTA GCGTTGCAAA TCAGACTGAG CATCAACTGC ACGTTTCGAT TGACAATAGA
CAGCGTCGAT TTCCAGGTCT CCGTCCCATG CTTGTCACAA ATGATCAAAT GCGCGATCAC
AAGCTAGCGC TTTTGGAAAA ACGGCTCTTT CGTCGATGGA CGAGTTGCCA CATTGTCAAC
TATGATCTTG AGTCCTATTC TGAAAATGAA TGGCAAGATA GGGATGTTCG CTTTGTTCCA
ACTGATTTCT TCAGCCGAGA GATTCAGGGC AACGAAGTAG GTGGGGAAAG AAGCAGCACG
GTATGGCACT ACCCTGTCAC AGGCTGGGAA GGCAGCGATA GACTTTGCAT AAATATTCGC
CGGTAA
 
Protein sequence
MGKGKRPSIS LFAFLQATFS VVSIQAFQMP TTCRTMSTVV SYRETYPQHP CFSQQRYNSH 
ICMEPGIIQV EELSDQDTFC SYINCMKEIV ENQRVVKKCS LDECKPVQEY LFSSQSLLSL
PDISVDGFKD RLRDQKRQFL EETGFSEEQY EYLVRCLAYL GDRCAKIQQI APALIAWKKM
RESGMKPRER FVSTYMYVLS LDERNVQACF ETATFHDVLF EPTEKSIFLR IKTLISQSDA
KTAEYILSSL PDRQGKSSEW KKLRTFLPIL SHYCDVGDMI SALNLFRKMR KSEGVIFDSE
TYGLLIGALA RRGYFIPGGL SVDGASTLGS LEMGGPALFD TISTEMAEDL LELSLDAAET
ILDGFVSGFA GENSYEFDCV TDKCINVSPL LTIGRVTVNE TSAVCPATGA NLRLFALTEE
QRISVHDTLL EMAAVQHEDF SEKLKARNQN FKGRGDSERA RRNLFEFSEW LREHEGEPFT
AIVDGANVAY AGHGNVHYSQ VQLVVDKLED MGEKVLVVMP SKYVGEKFYV AGIDSVQQLS
EREVGIMNGL LDEGKMYQVP AACLDDYYWM LASVANQTEH QLHVSIDNRQ RRFPGLRPML
VTNDQMRDHK LALLEKRLFR RWTSCHIVNY DLESYSENEW QDRDVRFVPT DFFSREIQGN
EVGGERSSTV WHYPVTGWEG SDRLCINIRR