Gene PHATRDRAFT_49851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49851 
Symbol 
ID7198673 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp91596 
End bp94601 
Gene Length3006 bp 
Protein Length703 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184645 
Protein GI219128912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.165107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTCC GGCAACACAA TCCCAATGTG ATCACTCCTC CCAACCCTTC TGTTTCTAAC 
AAGAAGGATG TGGGGGTGAT CGGTGAGAAG GAGCCGACTG TGCGATCGGA ATTCATCGAC
AAGGAAAACA TCCCTCCCTC TTTTTCTCCT CCTATGCCTC CTAGCGTCCA CGGTTTCCAC
GACGTCGAGA TGGAACGGCT CGTTCGAGGT GTTAATCGCT TCGGATTGAA CCATGGTAGC
GATAATCCCA GTCGGGTCGA AGGATCCATC CTTGGGTTAG CTCCGGCCTA TCCTCCCGCC
GTCGTACGAC TAACCTCCCT CAAAAAAGCA AACTCGGCTG CGGAGTCGTC TGAGCCCTTG
GCGCAGCATC GCTTCAACCG TTCTCTCTGG AACCGAGCTG GCTCCAACAC AACTCAGCAG
GACAGTATCA AAAAGCTTGA TGTGCTCGGT CGTGGTATCC GTTCACCGAA GAGGATTCCT
TTCGGAAACG TACACCACGG CGGCTGTGAC AGCGCACGCA ATGACGTCAA GAGCGCCGAA
ATTCCTCCTC CACGAGCCGA CTTTGCTAGC ATCGTCCGAT GCAACGCCCC CCAGAGTCCA
AACACTTTGC AAGATCAACA CCATCAACAT TTGCTCCTCA AAGAAGCCGC TCCCAAGGAT
CCGTCGATTC CCGCTGCTGT GGCGGATGCC ATCAAAGCGA GTCGGGAATG CCGTGGCACC
ACGAAAGTCG CTACGGTGGA AAAAGCGGAG CTTTATCTTC GCCATGCTAT CGCCGAATAC
CATGCTGGAC GCATCACGGA GGCTCCGAAT GGATCGTGCT ACAACAACAT TGTACATGGC
TACGCCGATC TCAAGGAACC GGCAAAGGCC GAAGCCATTT TGCATCTCAT GTGGTCCGAT
TTTCAACAAG GCAACGAGGT AACTTGAACC GCTTGTACTC GAGTGCTCAT GGCTCTTACA
ATACCCTCTT CTAACCAATT TCTTTCTTCG CACAAGTTGG CAGAGCCCAA TGTCCGCATC
TATACCAGTG TCTTGTATGC CTGGGAAAAA TCGAAAAAGG AAAGTGCTCC AGAACGCTGT
GAAGCCATCC TTCAGCAAAT GCATCGCCTC CACGATTCGG GAATTGCCAA ATTATGCAAA
CCAGATCTTT ACGCGTATAC TGTGTGTCTT CATACGTGGG CCGACTCAAA ACGCCCTGAC
GCTCCGAAGC GAGCCGAGCA ACTCTTTCGC AAGATGAAAG ACCGCTATCA TAACGGTGAC
ACAGAACTCC AACCAGACTC AGTGTGCTAC GTCAATCTCT TAAACGCTTA CGCTAACTCC
GCAATTGAAT ACGCCCATAC TGAGGACTTA CTCTGGGAAA TGGTGGATGA CTTTATTGCC
GGTAATGAAA GTGCCAAACC CATTATTCGC AACTTCAATA CCGTTTTAGC GGTGTGGTCT
AAATCTGGTT CAGCAGAGGC TCCGGAACGC TCCGAAGCGA TAATCCGACG CTTGCATGAG
TTGAACAAAT GGGGAGCCTT GGACACCAAA TCTGACCAGT ACACATACTC GCTGCTTCTA
AAGACTTGGT AAGTTTTCCC ACGGTAGTTT GTCTGAGCAT GACATGGCAC CTCACCAACA
CAACCTATTC CCCATTAGGA CGACGTGTAA TCGTCACAAT TCTGCTCAGG AAGCCGAACA
GGCTTTGTAT TGGATGGAAA GCCTTCACAG TGAAGGTGAC CAAGGGGCGC GTCTTGACGT
CATTAAATAC ACAACAGTTA TCAGTGCTCT CGCACGTTCC GGCAATCCAG GCAGCGCCGA
GACTCTTCTA GAGAAGATGC TCGAAGATTA TCAAAAGGGA AATTCGAAGG CCAAGCCCGA
TGCGAAGTCA TTTAACATGG TTCTCTCTGG ATGGTCTCGT TACCACAACG CAACGGTCGC
CGCTGCTCGT GCCCAAGCTC TCCTGCATCG TATGTGGAAC TACAGAGCGG TCCATATCGC
CCCGGACACG TGGTCCTATA ATACCGTTTT GTTCTGCTGG AAGAACGCGA ACGGTCCTAA
ACAGGGGGAA TCGCTGCTAC TCGACATGGA CCGCATGGCA GCAAAGGGGT TTGCCAAAGC
TCGTCCTAAT AGTACAAGCT TTCAGGCCGT CATCGACTCT TGGAAGAAAT CGAACTTTCC
CTTCAAGCAC CAGCACATTC ATCAGCTTCA GGAGGAGTCT CAGAAGCGCT TTGGTGAAAG
TGCAAAGTCC AAAAACATGG ACTCCCGCAG CTTCAAATGA GTAGCGCGGA TACAATTGAG
AAGAGCCGTC AAGGTTATGT TGCTCAGAAC CACACACCTT CCAACACCTA TCCGCTACGG
GCTCGAATAG CATAGTTCTG CTTTCTGTAT CGCCCTAAAT CTCAACGTCC AATCTCACTG
GTAGTAATAT CTGCAACCCT TAAAGACCGC GTGGTCAATT TGTCCTCTGT ATGTTTTAAC
GGAAATTGCA TTCGGTCTCA TTTGCGAGAG TATATTCGCG AACCCAAAGT TTCAATGGCA
ATCCGTACTA TACGCGTCTC TCGGCGACAG GAGGAATAGC GAAGCGAGCT AGAAAAGTGT
TTCGACAACA GAAGAAATGT CGGATTGTGC GGCTACAAGT AAGCTTTTGC GTTTCATAGG
TTGTCGTAAA CAGCTTTGGG AATAACATAA ACACTTGATG CTTCGCAGAA AATCGTCCCG
GTTTCGGGAC ACTCCACAAC CACCTTCCAG ACCAGTTTTC GCCGCACTCG TTCCACGAGG
TGACATCGTA TATAAAGATG TGTATGAGCG GGAACGAATT TTCGGTAGTC GATCGATAAA
TTGGCAGTAT ACGCCATTGG CACATCCCCC AAGGCTTCGT AACCGGCACC TAAAACGTCG
TCTATAAGCA GCGCCAGGAT ACCACCATGC ACAACCCCCG GGTGGCCGTT CACGTGCGAT
CCAATGAGGA CCGATGCCAC CACAACGTTC TCACGGACAG TGTTGCCTTT TGGAGCATCC
AATGCG
 
Protein sequence
MAFRQHNPNV ITPPNPSVSN KKDVGVIGEK EPTVRSEFID KENIPPSFSP PMPPSVHGFH 
DVEMERLVRG VNRFGLNHGS DNPSRVEGSI LGLAPAYPPA VVRLTSLKKA NSAAESSEPL
AQHRFNRSLW NRAGSNTTQQ DSIKKLDVLG RGIRSPKRIP FGNVHHGGCD SARNDVKSAE
IPPPRADFAS IVRCNAPQSP NTLQDQHHQH LLLKEAAPKD PSIPAAVADA IKASRECRGT
TKVATVEKAE LYLRHAIAEY HAGRITEAPN GSCYNNIVHG YADLKEPAKA EAILHLMWSD
FQQGNELAEP NVRIYTSVLY AWEKSKKESA PERCEAILQQ MHRLHDSGIA KLCKPDLYAY
TVCLHTWADS KRPDAPKRAE QLFRKMKDRY HNGDTELQPD SVCYVNLLNA YANSAIEYAH
TEDLLWEMVD DFIAGNESAK PIIRNFNTVL AVWSKSGSAE APERSEAIIR RLHELNKWGA
LDTKSDQYTY SLLLKTWTTC NRHNSAQEAE QALYWMESLH SEGDQGARLD VIKYTTVISA
LARSGNPGSA ETLLEKMLED YQKGNSKAKP DAKSFNMVLS GWSRYHNATV AAARAQALLH
RMWNYRAVHI APDTWSYNTV LFCWKNANGP KQGESLLLDM DRMAAKGFAK ARPNSTSFQA
VIDSWKKSNF PFKHQHIHQL QEESQKRFGE SAKSKNMDSR SFK