Gene PHATRDRAFT_45518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45518 
Symbol 
ID7200597 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp422181 
End bp425391 
Gene Length3211 bp 
Protein Length918 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179642 
Protein GI219117704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTGCAAAA CATGAAGCTC TCCATCGGAG GATGGGAATG GAAGAAGAAA GAAGGGACCT 
ACAAGAAGGA AGTGTCAAAC AAGGAAGGAA ACAAACTCAC CAACACATCG TCAGCCACCG
AAAGAATTCT AGAGAGAAAT AGCATTATAC AATGAGCTTC TTCTGGACTT CCTCGATCAC
AAACGAACCT GAGAATTCCT TGGATACCGG TGTCACTGTC AGCAATTCGA GACCATCGTC
ATCGCTGCAA CAGCAGAACA AGAATATAAA GGAACGATTT GGAGGAATCG AGACAGAATC
CCGTCAAGTT CCATTCTTGC CTGACGAAAA CTTGACTATT GAATCAATAT ACGACTTTTA
TGTGCATACA CTATTCCCTG CTGTGATGAG TTCTTTGCCT TTATTGGGTC GTCGCCTTTT
CCATGACACG CAATCGGTAC TTCATGATAG TTTTCAACGT TTTGTAAACT TTATTCTTCC
GGACTATTGG ACGTCTTCCT CCTCATACTC ACATCCGCAG CTACCAAGCT TTCGGCGGAT
TTGGGAAGAA AACGCCGCTG ACCTGTCCGT TTCCGACTTT TTGCTCCGAC ATTTCGATAC
CAACCACGAC GGCAAGATAT CTCCCGAGGA ACTCTTTCAC GTGGACGAGA TGCTACTCTC
TCGCATACTA CCTCGCTCAG ACGAAAGTTG GTGGAGCTGG TTCTCCCGCG AATGGCCCCT
GTTGGATTGG AAAGTGGGAC TTTTCTTGTG GCGCACTTTT GGTGGCGTGC TCTTAACCCT
GGCCGTTTTA TCGATTGTAC CGGGACGGTT GCATGGCTGG TCGGCTCGTG TCTTGCGATG
GCCCATTCTC GGACTCACTT ACTTTCTCAT CGCCGTTGAA CTTATGGTCT ATATCGTAAT
TCGCATTTTC ATTCGGATTG CCGAATACAT AATCGCACGG CCCAAGCATC GAGCACTGCG
CCAACAAATG GCTAAATCCC AGTCCTATGA AGAGTGGTAC GGCTACGCAG AAGATTTGGA
CAGATCACAG AAACGAGATG TATGGATTCG ACGAATCAAA GATCAAACAT CTTTTCACTA
CAATTGGAGC TTTGTTCAGG ACCTCATTCG GGATTTGCGT AACGCTCGCG AGCAAGGCGA
CTCCATACTA GCGCTTGCCG TTATTCAACA ATGCACGCGG AAAAATGTAG GCGGTATTAT
GAGCGAAGAT TTATTTTCAT ATACGAATAC GGGGGAACCG AAAGCAGTCG TACGGGAGTT
CATCGAGGAA GCTGTCCAAA CCTTGCACTG GCTCACTGAT GAAGCATTGC ATATTCCAGT
AGATGATTCC GCACAACAGC AAAGCAACGA AAAACGCACC TATGAAGAGC AAATGGAAGA
AACTGTTCAA GCCGAAAAGG ACAAAATATG GAAATCATTG ACCAATTTGT TTGGTAATGG
CGAAGATGGA AAACAAAATG ACGATGTCAA GACGAAAGAT GGAAATGAAA CGCATTCGCC
GGAATCCGCC CCCCACCAAG ACATAGAAAG TCGGACGGAT AATCCTTTGC CATCCTTTCA
CCGACAAGAG GTTTTAACGT TTTTGAAGCG GGCTCGAGCA GCTTACGGTC GAACCGCTTT
GTGCCTGAGC GGTGGAGCGA TGATGGGAGT TTATCATTTT GGACACGTTC GGGCTCTACT
GGAAACTGGT TCTCTGCCCA ATATTGTACG TACCGGGATC GTGTTTTTAG ATCACACACT
CGTTACAAAT AGGAAACTTT GTATTTATGT GCGACAACGC CTGATGTTTG TTTGCTTCTT
TTTTGTGAAA AGATTTCAGG AACTAGCGCA GGAAGCATTA TTGGTGCGAT TTTGTGCACC
AGGAAGGACG ATGAATTGGA TCGTGATTTA CGCCCAGAAA TTTTAGTACA TAAGCTTACG
TGCTTTTCGC GTCCTTGGAG GGAGAGGATC TCTAGCCTTG TCAAAACCGG GAGCATGTTT
GATGTCGATG AATGGTTGGA GCTACTGAAA TGGTGAGTTA CTTGCTGCTG CTTGTTTGCA
TCCCTATACG TCAGGTCGTT CTTGTCTCAC CGTCATGTTT ATAAGGTTTT GTCGGGGTGA
TATGACATTT GCTGAAGCAT ATCGTTTGAC AGGGCGCGTT TTTTGTATTA CTCTATCCCC
TACTACGAAG AAGGCGCCGC CGGTGTTGAT AAATTATTTG TCTGCGCCCA ACGTTACGAT
AGCCTCTGCT GTCGTAGCGA GTGCAGCTGT ACCTGGGTTT GTTGCGCCTG TTCGCTTGCG
TATCAAGGAC ACCAATGGCG TAGTTCAGAG AGGCGGAGCA AAGGACGAAG CATACTTTGA
TGGATCGATC AAACAAGACA TTCCGACTAC AGGATTAGCG GAGATGCTCA ATTGCCAGTT
TTTTGTCACG GCTCAATGCA ATCCACATAT TGTTCCAATG TTTTACAACA GTAAAGGCGG
AGTTGGGCGT CCCAGCCGAT GGTCCAGTGG AGCACAAGAA GATTCCTGGA GAGGCGGGTT
CCTACTGGCC GCGTTGGAGA TGTATCTCAA GAATGACATG AAAGCCAAGT TTGTTTTTCT
TCGTGATCTA GAGGCGGCTG TTGGCTTTAC ATCGGAGCTA CTGACGCAAG ACTTTGTAGG
CACGACAACC ATCGTACCTC AAGTTTCCTT TAAGGATTAT TTCGGAGTAA GGACAAGCGC
AAATGAGATG TGTTGCCCAA ACAATCGCAT GAAAAACTAA CTGCTTCTTT CTGTTCGTTC
GTTCGCAGCT CTTTGAGAAT CCGTCTTTGG AGCAACTCCA GCGGTGCTGT CATGCAGGAT
CTGTTGCTGC GTACGAACAT ACTGTTATGA TCCAGATGCA TTACAGCATT TCGGATGCAC
TGGAGGAATG CATTGCAAAG CTTGAAACCA ATAAGCGCAA AGTACATATT CGGCGACGTA
CAAAGCTAGG ATCGGCGAGT ATGACGAGAG GCGATCCAAA GGGAGGGGTC GTGGAGGAAT
CAACGGAAGC GCGAATCCAG CCAACAGTGG AAAATACCCA GACGTTTCTG GTCGGTGGGT
TGACATCAGA CGGACTGAAA GTACGTACAG CGTTCAATGA ATCATCGGAT GATACAGATC
GAGAATCGGA GTACGATGAG TTCGAAGCTG ATTGGACGGA CCTTAAGTAA AAATTGCATC
GGCTTTAAAA TACAAAGAAA GTTGAACTGT T
 
Protein sequence
MSFFWTSSIT NEPENSLDTG VTVSNSRPSS SLQQQNKNIK ERFGGIETES RQVPFLPDEN 
LTIESIYDFY VHTLFPAVMS SLPLLGRRLF HDTQSVLHDS FQRFVNFILP DYWTSSSSYS
HPQLPSFRRI WEENAADLSV SDFLLRHFDT NHDGKISPEE LFHVDEMLLS RILPRSDESW
WSWFSREWPL LDWKVGLFLW RTFGGVLLTL AVLSIVPGRL HGWSARVLRW PILGLTYFLI
AVELMVYIVI RIFIRIAEYI IARPKHRALR QQMAKSQSYE EWYGYAEDLD RSQKRDVWIR
RIKDQTSFHY NWSFVQDLIR DLRNAREQGD SILALAVIQQ CTRKNVGGIM SEDLFSYTNT
GEPKAVVREF IEEAVQTLHW LTDEALHIPV DDSAQQQSNE KRTYEEQMEE TVQAEKDKIW
KSLTNLFGNG EDGKQNDDVK TKDGNETHSP ESAPHQDIES RTDNPLPSFH RQEVLTFLKR
ARAAYGRTAL CLSGGAMMGV YHFGHVRALL ETGSLPNIIS GTSAGSIIGA ILCTRKDDEL
DRDLRPEILV HKLTCFSRPW RERISSLVKT GSMFDVDEWL ELLKWFCRGD MTFAEAYRLT
GRVFCITLSP TTKKAPPVLI NYLSAPNVTI ASAVVASAAV PGFVAPVRLR IKDTNGVVQR
GGAKDEAYFD GSIKQDIPTT GLAEMLNCQF FVTAQCNPHI VPMFYNSKGG VGRPSRWSSG
AQEDSWRGGF LLAALEMYLK NDMKAKFVFL RDLEAAVGFT SELLTQDFVG TTTIVPQVSF
KDYFGLFENP SLEQLQRCCH AGSVAAYEHT VMIQMHYSIS DALEECIAKL ETNKRKVHIR
RRTKLGSASM TRGDPKGGVV EESTEARIQP TVENTQTFLV GGLTSDGLKV RTAFNESSDD
TDRESEYDEF EADWTDLK