Gene PHATRDRAFT_43772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43772 
Symbol 
ID7197282 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1431877 
End bp1435168 
Gene Length3292 bp 
Protein Length1013 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177831 
Protein GI219112159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.400632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCTATCGA GAACGTTTCG ATCGCTTCCT AGCTCTAGAT TAAAGAGCTT AACATGGATA 
CTCGCCACCT TAAGTACAGC TTTTGGACCG GAGGAAATTT CAGCAATAGC AACAATAGCA
CTGACTTTGA TTCGAATTTG GAACGCCCTC AAGATGTTTC CGTGGATTCC GTCTTGACAT
CGTTGTACTT CAATAGCATT GTATTTGTCA TGCTCATGGC CAGTTATGAA ATCTTACGTC
GTGTGTTCCC TGCTGTATAC TCATCCCGTA AACGAATTTC ACACGCTCGT CCCGATACAC
AGAATGGTCA TCGACCGGAA GCTCCCTTGC ACGAGGATGC CACAGTGCCC GACCCGAATG
GCACCGACTA TCCCAAAATT CATCACGAGC GTCATGCTTC GTTGACTTCT CTGCCGGACG
ATCGACCACT CGACTGGCTT GGGCCCGTAT TTGGCGTGCC CTGGAGCAAG GTTCGTCGCA
TTGCGGGCTT GGATGGGTAC TTCTTTCTAC GCTATATACG TATGAATGTG AGAATTACGG
CCGTTTCCAC GTTTTGGTTC TTTCTGATAC TGGTCCCTAT TTACGCGACG GGTAGTTCAA
AGGAACACTC GGCGGAGGGA TGGTATCATC TGTCGGCGGC GAATATTCCA AGAGACGGTT
GGCGTATGTG GATACCTTGC CTTTTCGCGT ACTTGTTCAG CGCCTTTGTT TGCTTTGTTG
TCAAGCAGGA GTATCGGCAC TTTCTGGACC TCAGACAAGA CTTTTTGGCG AGAGGAAACA
TGCACGTCGA TCCGCAACAC CATCATTCCT TGGAAATTGA AAATATTCCC TACGAATTAC
GATCGGATCG TGCCTTGAAA GAATATTTCG AAAAGATGTT TCCCGGACGC GTTCATTCAG
CCAGTGTCGT TCTCAATCTA CCGGAACTGG AGGACGCCTC CGTCAGATGT ATGAGAACAT
GTCGTCGTCT CGAAAAGAGT ATTGCTTTTT TGCACGCAAC GGGTAGCCGG CCCACTCACG
TTGTTGGCCG TGGCCGAATA TCTTGCTTGG GAATCGAGCT ACAACCGTTG GACTGCAATT
GCACAGCGAG TCAGGAAACC TTGTTCGTCG AGAACGATAT GCGAGCAGAA CGACCGAAGA
GAGGAACTCG TGTCGACTCA ATTTCGTACT ACGCACAAGA ACTGGCAGCC GATAGTCGAT
CGTTATTTCA GATGCAAAAA CGCAAATCAC GGATTGCGGA ATCTGGAAAT CAGTTAAAAC
AGGTGGACAA CTGGTTGGAC AAGGCGGTCC GCCAAGCATC AGAGGTTGCG AATACAATTT
TGGAAGACTC AATAAAGGAC AACCATTTGA CTTCGCCGTA CGAGAGCTTT GACGAAGGTG
GAACAGTACC TCCAGCCGAG AGCATGACTT CCAGGTATGG GTCCTTCAGT CAGGCAATCA
GCCATCGAGC AATATACGGA CGGAGCAAAT TTGACAAAAT TCCTGAGGAA AGAAAGGAAC
CTCTTGTTTG TGATGATGAA ATGGTACGCA GAGATCGCAA CCCCTCGACT TTATCGCATA
TTTTGACTTA CCTGAATGAT CTGCGTCAAT TTACAGTCAG ACTCGACCGA AGACTCTGAT
TTTCCAATCC CCTTTAGTAG AGATAGCTAT CAAAACAGGT GGAGACGTTG GGCAGGTCGG
TTAGGTTTAG ACTTTGCCAT TGCCGGTCTT AAACTTGTTA ATAAGCAGCT CGACGTTGCA
CTTGAAGAAG TCGTCGGAGC TACAATGTCT TCTACTGGGT ATGTCACGTT CTTGGATCTT
TCCTCGACAA CATGCGCGGC GAGTGCACCA CTGACGGTGA AGGCCAATGT TCTCGATGTA
TCTGTTGCTC CCGAACCTAG GGATATCATT TGGAAAAATG CTCATATTTC CAAGAGATCA
CAGTTGAGAC GTGGCAATTT CACGAACTTC TTTCTATTTC TTGGCGTTAT TCTATGGAGT
TTCCCTCTGG CTGCTATTCA AGCTTTCGCA AAAGCTGAGT TTTTGGCACA AATTCCTGGA
ATGGAATGGA TTTTAACTTT TCATGGGGGA ACTTTTACAA ACTTTATGAA CGGCTACCTT
CCAGTGGTGG CCCTTTTGTG TCTGATCCTT ATACTTCCGT TGATTTTTGA GTATGTGGCT
GTGAGTTACG AGCATCGCAA GACTTATTCG GATGTTCAAT CATCAATGCT GAGCCGTTAC
TTCTATTATC AGCTTGCCAA CATCTATGTG TCTGTGACTG CAGGATCAAT TCTGAAGTCT
CTTTCGGACA TTCTTGACCA TCCATCGAAC ATTTTGCAAC TTTTAGGGGA CTCCTTGCCT
ACCATGGTCG GCTACTTTGA TGCTCTATTA GTCACAAAGA TTATGGCCGG TCTACCAATG
ATTTTCTTAA GGTTTGGTGC ATTGTCCCGT ATGCTTTTTT TGAAAACACT GTCAAACGAA
AAGAAAATGA CACAGCGTGA ACTCGATGCC GTGTATAGGC TGGAAAATGT CCAGTACGGG
TGGGAGTTTC CAACACAGCT TCTTGTGGTT GTGATAGTTT TTACGTATGC CATTATTTGC
CCCGTCATCC TCCCGTTTGG CTTGCTTTAC TTCCTCGGAG CACTTTTGGT GTACAAAAAG
CAAGTACTAT ACGTCTACAG TCCGGTATAC GAAAGCGGAG GTGCTATGTT TCCCGTTGTA
GTCCAGCGAA CGCTTTTCGG ATTGGTGTGC GGCCAGATGA CATTTATTGG ATATGTGGTA
ACACGAGGTT GTTACTATCA GCCCATTTGC TTATTCCCTT TACCTATTGG CACAATTTGG
GCAATGAACT TTTTCCGACA AAATTATGCA GATCCTAGCA CTCGGCTAAG TCTGGAACGG
GCCCGCGAAT GCGATCGGTT GTCCTCGTCT AAAGCGGCAA CGGAAGAGGA TGGATTGGAC
AGCAACATTG ACCGTGGCGT AGAATTGCGA AGAACGAAAT TCGATCGCAA GTCATACCGG
CAACCTGTCC TCACAGAGCT CGCCACGGAA CCAGAGTTCT ACCGCTCAGG CTTTCAAGAC
GACGAAACCT TCGCTGTAAG GAAACAGCTT CAACGAATTA ATCGATACAT CAAGGAAGCG
ACTTTGGAAC ACAATGATGG TCTCAAAGAT GCTTTGTTTC CAATATAGAA AAAAATTATC
ATTTCCGTGG GCCTCAAATA CTTTTTACAC CATTCGTTTA TGAGAGCGGA GACAGTGAGA
GTTAGCCTTT TCGGCCAAAG CGCTGTCTAA TCTATACTGT GGTCCGTCGA AT
 
Protein sequence
MDTRHLKYSF WTGGNFSNSN NSTDFDSNLE RPQDVSVDSV LTSLYFNSIV FVMLMASYEI 
LRRVFPAVYS SRKRISHARP DTQNGHRPEA PLHEDATVPD PNGTDYPKIH HERHASLTSL
PDDRPLDWLG PVFGVPWSKV RRIAGLDGYF FLRYIRMNVR ITAVSTFWFF LILVPIYATG
SSKEHSAEGW YHLSAANIPR DGWRMWIPCL FAYLFSAFVC FVVKQEYRHF LDLRQDFLAR
GNMHVDPQHH HSLEIENIPY ELRSDRALKE YFEKMFPGRV HSASVVLNLP ELEDASVRCM
RTCRRLEKSI AFLHATGSRP THVVGRGRIS CLGIELQPLD CNCTASQETL FVENDMRAER
PKRGTRVDSI SYYAQELAAD SRSLFQMQKR KSRIAESGNQ LKQVDNWLDK AVRQASEVAN
TILEDSIKDN HLTSPYESFD EGGTVPPAES MTSRYGSFSQ AISHRAIYGR SKFDKIPEER
KEPLVCDDEM SDSTEDSDFP IPFSRDSYQN RWRRWAGRLG LDFAIAGLKL VNKQLDVALE
EVVGATMSST GYVTFLDLSS TTCAASAPLT VKANVLDVSV APEPRDIIWK NAHISKRSQL
RRGNFTNFFL FLGVILWSFP LAAIQAFAKA EFLAQIPGME WILTFHGGTF TNFMNGYLPV
VALLCLILIL PLIFEYVAVS YEHRKTYSDV QSSMLSRYFY YQLANIYVSV TAGSILKSLS
DILDHPSNIL QLLGDSLPTM VGYFDALLVT KIMAGLPMIF LRFGALSRML FLKTLSNEKK
MTQRELDAVY RLENVQYGWE FPTQLLVVVI VFTYAIICPV ILPFGLLYFL GALLVYKKQV
LYVYSPVYES GGAMFPVVVQ RTLFGLVCGQ MTFIGYVVTR GCYYQPICLF PLPIGTIWAM
NFFRQNYADP STRLSLERAR ECDRLSSSKA ATEEDGLDSN IDRGVELRRT KFDRKSYRQP
VLTELATEPE FYRSGFQDDE TFAVRKQLQR INRYIKEATL EHNDGLKDAL FPI