Gene PHATRDRAFT_47869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47869 
Symbol 
ID7202938 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp296103 
End bp298532 
Gene Length2430 bp 
Protein Length721 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182197 
Protein GI219123782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACATTTTCG TTTACAGTTT ACTGTTACAC TGATTTCCCT GGTGAAAAGA ATTCTTCGAT 
GACGCAAATC TTGATCTTGA CTGCCCTCCT TGCCGTCATG GCAGAGGCTC TCACATTCTC
TCGCAGTGAC ATGACAGGGA ACAATTTTGC GGGCGATATT TTGACATTCA AAGATCGTCT
CTACGTCCCT TACGGTCCAG ATTTGATTTC TGATCCCCCA CAGACGGACC AACCTTGGAC
GGGATTTGGC TACGGATTGG GAGCAACGGA GCATTGGGCG TACGATCATA AGGAGAAGTA
CATTTATTCC CAAAGCGAAG CTGGTGGCTA CGTAACAATT ATTGATTACA ACGCTTTGCC
AGGAGTAGTG ACACCTTATA GTATGAATGT TGGTGGACGC AATGTTGACG TGCGAGATAT
TGTCGTCTGT TCAGAAGAGG GACTACTCTT TCTGACGCTT ACTGATCGAA GCAAGGTGCT
TATGTATGAA ACCGTGAAAC GGAGCTCCCC CGGAACCCCA ACGTTGCTCT CTGAGATTGA
TGCCGGAAAC TCTCCAGATG CCATGAAGTA AGTTTTTGAA GGATTTTAGC GTGTCACTTT
CAAGTTAACA CTCTGATCAT TACCTTGTGC TGAACCTGGT TGCAGGCTCT CGAACGATTG
CAGTATACTT GCAGTTGCAA ACCAGAACGA GGGAACCTCA GTTTTAAATC AAGGCGCTGT
CACTTTGGTG ACCAATTTTC GTTCAGCAAG TGGACCCGAA ACAAAAACTG TGCTGCTCAA
TACCTTTACG GATGAGTACC TGTTGGGTCG TGAAGTGCAC ATGCCGTTGA CACGAAATGC
CATGATATAC TGGAATGCCG AGCTCGGATT GGGCTGGGAT ACTCCAAATG GTCTGATTGA
TCAATACAAT CCAGCCCTTG CGTTCGACCC TGAATTCTTG GCGTTTAACA ACGACGGCAC
GGAGCTTTAT CTAAATCTTC AACAAAACTC TGCAATGGTC CGCATCAGTA CCGCTACCGG
TACTGCTTTG TCCGTTGATG GATATGGCCT TAAAGATCTC ACCGCCGGAT CCGGTGCTGA
TATTGTCAAA GACGGCGAGT GCAAGCTTGT GACCAATCCT TGTCTTTTCC TCGCACGTTC
ACCGGACGGT ATTGCGACCG TAGAGTACGA AGGCGTCAAC TATGTACTAT TGGCTGAGGA
GGGAAGTGAC TTTGATCTTG GTGACTATGA AGAAAAGGCT GACTCGAACG ATATCTTTCA
AGGCAATGGA ACTTTTGCGT ATTCTAATTT TACCTTCGAT GCGTCTTTCT TCGCGGAAGG
TGACTCCAGC GCTGGTTGCT CCGCTAATTT CAATGCGGAG TGTGAAAGCA ACGATCTTCC
TTGGTGCTCC AACTTTGAAC TTACAGTTGG ATCGTCAGCT GTCGACTATA CAGACCCAAC
TGCTCCCAAG ATGAACCGCA TCGTTGGGTT TGGAGGGCGA GGAATCTCAA TCTTTCGAGT
ACCATCTAAC GTTCAGCAGC AAATTACGAT GGTGTGGGAG TCCGGCTCCG AGTTTGAGGA
GCGTACCTGC GCCGACTTTC CGTGGGCGAA CAACGCTCTT ACGGACGAAG AGTTTGCTCC
CATTTGTACC GATTCAAACC AAGACTTTGA GTGCGCTCGT TGGATTCTTG TTTCCAACGA
CGATCGGGAG GGCATCAACG AAAGGTAAGT CTGTCGCATC TAAACGCCGA GCATTTCTTG
CGAAGAGTAC AGCTATGTCA TTCTCACCGC GGGTTGAAAA CACGAATCAG AAATGATCCT
CTTGGCGACG GATGTACCTT TAACAATGGT AGCACTGGTG CATGTCCGAT GGGATCAACA
GTGGATACAA AGGCGCAGCA AGACGGCCTA GGCGTCGAAA CAGTTGTTGT CGGAATTGCA
TGTGACCATC TCGTTGCTCT GGGTTGTGGC GAGAACAATG CGATGTGCTT TCTATACGAC
ATATCCGACA TTGAGTCTCC GGTCCATCTC AAAACTTTCA ACTTGAGCCC GTCATCTCGC
AATAGAAACC CCGAACAGTC TTATCTCGAC GATCTTGGTG ATATTGATGC TGAAACGATC
CAGTTTATTT ATCCCGGCCA GAGCCCTACC GGAAAGTCTG GATTTATATT TGGCGGTGCC
ATTAGTGGTA CCCTCTCTTT CTGGGAGTTT GAGTGCGCTA GCGAAGAAAC CGCTCAAAGC
GGCTCTGGTG GTGGGCAGAG CCAAGAGTTA AGTGACAGCG ACGAGAGTTT GGAAGGCGGG
GCGATCGCAG GAATAGTGAT TGGATCGGTT GTCGGCTTGG CTTTGCTTGC TGTCATTGCC
TTGAGGGCCA TGGGAGGAAA CAAGAAAGAA ATAGACACGG GCAAAACAGG CAGTAGCGAC
CATACCGAGA CCGTAGATGG TCTAGCTTAA
 
Protein sequence
MTQILILTAL LAVMAEALTF SRSDMTGNNF AGDILTFKDR LYVPYGPDLI SDPPQTDQPW 
TGFGYGLGAT EHWAYDHKEK YIYSQSEAGG YVTIIDYNAL PGVVTPYSMN VGGRNVDVRD
IVVCSEEGLL FLTLTDRSKV LMYETVKRSS PGTPTLLSEI DAGNSPDAMK LSNDCSILAV
ANQNEGTSVL NQGAVTLVTN FRSASGPETK TVLLNTFTDE YLLGREVHMP LTRNAMIYWN
AELGLGWDTP NGLIDQYNPA LAFDPEFLAF NNDGTELYLN LQQNSAMVRI STATGTALSV
DGYGLKDLTA GSGADIVKDG ECKLVTNPCL FLARSPDGIA TVEYEGVNYV LLAEEGSDFD
LGDYEEKADS NDIFQGNGTF AYSNFTFDAS FFAEGDSSAG CSANFNAECE SNDLPWCSNF
ELTVGSSAVD YTDPTAPKMN RIVGFGGRGI SIFRVPSNVQ QQITMVWESG SEFEERTCAD
FPWANNALTD EEFAPICTDS NQDFECARWI LVSNDDREGI NESTGACPMG STVDTKAQQD
GLGVETVVVG IACDHLVALG CGENNAMCFL YDISDIESPV HLKTFNLSPS SRNRNPEQSY
LDDLGDIDAE TIQFIYPGQS PTGKSGFIFG GAISGTLSFW EFECASEETA QSGSGGGQSQ
ELSDSDESLE GGAIAGIVIG SVVGLALLAV IALRAMGGNK KEIDTGKTGS SDHTETVDGL
A