Gene PHATRDRAFT_44942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44942 
Symbol 
ID7199841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp707669 
End bp709485 
Gene Length1817 bp 
Protein Length556 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178831 
Protein GI219116072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGACATTTTC GTTAATTCCC AAAAAAGCAG CATTGACGAT ACTATACTAG CGTTGTCTAT 
TGTCTCTGGA AATTTTATTC CAACACCAGA ATATCACCCT CCTCAAGAAC GGAGTCATAC
ACGTCACGAC AGTGCGACTA GGAAATCATG GCTACGATCA ACGCATCTAC CAAAATCAGC
GTTCCGAAAA AGAAGCCCTC CATCGCGGTG GGCGCGGCCA AATCGGTCAT CGGGGCCAAG
AAACCCAGCG GCACCGCGGC GCGCCGCGGC TCCGTGGCCC CGCCACTGAA ACCCGCGACA
CTCTCGAACG CGTCCAATTT GGCGCGTTTG CAGGAGTCGG ACTCGGCGGG ACAAATTGCT
CACGTGCGAG ACTTGTTCCG ACAAGCGCTG GGGGAACACA CCACGTTGAA TTTGGGGTCC
CCCAAAGCCG CCAGTACTAC TCGCGATCGG GCGGGAGCTG CCGCCGATGT TGCCGTTGCT
GCTAAGACTC TGGGCGTGCG CTGGATACTC AAAAAGTGCG GCATTGTTGA TGAAATGCAG
CGTATGCTGT TTCCCGGCGG AATTGAAGAG TTCTTGGCGC GCAATGCGGA CAACGAAAGT
GAAATGAACG GGAATGGTAG CACCCCCATG ACGGGAGGTC TCAAAGCCAG TGCTTCGGCG
GTAAGTCTCG CGAGTATGGA CGAAGTGACG ACCGTCACCT CGGCTAATTC GCTAGGGACG
GATACGAAAC GTGGAAAAAC GACACCGGCC AACGCCCGTG AAGGGTGTTT ATTGGTGATT
CGAGCCTTGT GCCAAATTGT CGGTAAGGCG GCGGAATCGT TCGTGGTAGG GGCCTTTTTG
GCCGCCGCTT TGGATGAATG CGCGAGTTCG TCCGGTGCCA TTCGGGAAGC GGCCGAGGAC
GCGTCGACGG CGATTGTAGC CTTGGCCAAT CCATGGGCCT TTCGCACCGT CCTGTGTCCG
TTGCTGCTGC AATCGCTTAA GTCAACCGAA TGGCGGACCA AGGCGTGCAC GTTGGAACGA
TTGGAGCAGT GTGCCTCGAC CGCATCCGCA CAAGTGTACA AAATGATTCC TACCCTGATT
CCTGCCGTGG GGAACCAAGT GTGGGATACC AAGGCTCAAG TTTCGAAAGG TTCCCGCGCG
GCACTGTTGG CCATTTGTAA CACAAACAAC AACAGGGACA TCAAAAAGAC CATTCCTGCA
ATTGTTAACG CCATGTGCAA GCCTTCTGAA ACCAACAAGG CCGTGTCGGA GCTCATGGGC
ACGACCTTTG TTGTCCCCGT GGACGCTTCC ACGTTGGCCA TGTTGTGTCC GATTCTAGCC
CGAGCATTGA AGGAAAAGCT CGCCATACAC AAGCGTGCCG CTTGCATTGT CATTTCCAAC
ATGAGCAAGC TGGTGGAAAC GCCCGATGCG GTGGCTCCCT TCGGCTCCTT GCTCGTGCCG
GAATTGCAAA AAGTGTCGCA CAATGTTCAG TTTGAAGAAA TTCGGGACGA AGCACTCAAA
GCGTTGGCCA ATCTGACCAA GGCTTTGGGA GACGCATACA AACTGACCGA TGAAGATGAC
CAAGCGGCGG AAATGGCCAA CGAAAAGGCC GAAGTAGAGG CGGAACAGAA ACGTATCGAA
GACGTGCGGG AAGCAGAACG ACTGAAAGAA GAAGCCGTAC AGAAAAGGGA GGAAGAGGAG
CGCAAAAAGT TCAAGGAAGC CATGGATGCA CAGCGGGAGC TGACTCGCTT GGAGGCGGAA
GAAGCGGAAC GTCAACGCTC GGAAGAAGAA ACCAAGCGCG AAGCCGCAAG ATTGAGTACG
AAGGGCGGTA CTGGGAA
 
Protein sequence
MATINASTKI SVPKKKPSIA VGAAKSVIGA KKPSGTAARR GSVAPPLKPA TLSNASNLAR 
LQESDSAGQI AHVRDLFRQA LGEHTTLNLG SPKAASTTRD RAGAAADVAV AAKTLGVRWI
LKKCGIVDEM QRMLFPGGIE EFLARNADNE SEMNGNGSTP MTGGLKASAS AVSLASMDEV
TTVTSANSLG TDTKRGKTTP ANAREGCLLV IRALCQIVGK AAESFVVGAF LAAALDECAS
SSGAIREAAE DASTAIVALA NPWAFRTVLC PLLLQSLKST EWRTKACTLE RLEQCASTAS
AQVYKMIPTL IPAVGNQVWD TKAQVSKGSR AALLAICNTN NNRDIKKTIP AIVNAMCKPS
ETNKAVSELM GTTFVVPVDA STLAMLCPIL ARALKEKLAI HKRAACIVIS NMSKLVETPD
AVAPFGSLLV PELQKVSHNV QFEEIRDEAL KALANLTKAL GDAYKLTDED DQAAEMANEK
AEVEAEQKRI EDVREAERLK EEAVQKREEE ERKKFKEAMD AQRELTRLEA EEAERQRSEE
ETKREAARLS TKGGTG