Gene PHATRDRAFT_44969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44969 
Symbol 
ID7199646 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp841999 
End bp844170 
Gene Length2172 bp 
Protein Length561 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179074 
Protein GI219116558 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAGCCATCC ATCAAATGGA GTAAAGTACG TCAACGACTC CTTTGACAGT CCGTGAGAAT 
CTCATTGTGA ACTTTCTCGG CCCATACAAT GATCCTCTCG CATTTACAGT GACAGTGAAA
AAGACGTGTT ACCTCTACAT TGGATGGATA GTTTGATGGT TGTCTGTCTT TTATCGAGTG
CGGACGGATT AAAGACCACA AAATCACGTG GCAGGCTTAG GCTTCGGAAT GGTATGAGAC
CTCATTTCAT TCTTGAATGT GCTTTGGCTG CAAAAATTGT TGACGAGCCT TCTGCCTTTC
CCGGTTGACC TGTAAGCGTG AAACTGTCCA CAGATTGATT GTTTATTGTT CCACTGGCTG
ATATGCTGTT TGCTTGTTTG TTTGTTTTTT TGCAAAAGTG GTGCACATCT GACTATGAAG
AAACAAATCG GTGTTCTGTT CGTTTTATTG CACCTGTGTA ACGCGTCGTC TAGATCCGTA
GTTGCGGCTG CAAATGTTCG TACAAACCAG AAGTGCTCGA CAAAGACTCG TGAACCACCT
TTCGGTAAAT GGATTGAGGA GGCGCGGCTC TTGGGATGTT GCTCGCCACT GATTTCTGCA
TCAATGTTTG GTGCGGCAAG TGCAGTACTA CTTGCTGGAA CCGCAATGGA GCCAGTCTCA
AGAGCTCTTT ACTTTTGGAG GACAGCCGGG CCCGCAATTT TCCATTATAA ATTTACGCAG
TGGTGGTTGG AGGCATCCAA GGCTGACATA GAAAAGCGCG ATCTAGTTTA CGAAAGCTTG
CACGATCGAT ATGCTGAACC CGCCTTGAAG ATGATGATTC GCCAAAAAGG TCTGTACGTG
AAACTCGGAC AGGTTCTTTC CTCGCGACCA GACTTTTTAC CATCCCAGTA CATCGAACGT
TTCGCAACTG TTCAAGATTC GATACCGCAA TGGCCTATCG ACCAGGTACG CGCGATAGTG
GAGAAGTCCT TGATAGTCGA ACTCGGCCTG TCTTGGGGGG ACGTCTTCGA ATCCATGGAT
GATATAGCAC TTGGGTCGGC AAGCATTGGG CAAGTCCACA GAGCCGTGTT GACCGAGAAG
TGGGCCAAAA CTACAGGATA CAGAGGAGAT AAAGAAGCGG CTGTGAAAGT CATGCATCCC
AACTCCCAAA AGCTATTTGC ATATGATTTT GATGTGTTTC GATGGGTTTG CCGTATTGCA
TTACCCGGTT GGAAAGGTTT CTTGGACGAG CTCGAGCGAA GAATCATGAG CGAGTTCGAC
TATCGGCACG AAGCCACTTC GTTGGATGAA GTTCGTTCTC CGTACAAATC TAGAGTTTAT
ATACCGCAAC CTTTACAGGA GCTCTGTTGC CGTCATGTGC TTGTAATGGA GCTTCTAAAA
GGGCGGAAAT TAGTGGACTC CTTTGAGGAC GGCCTGGCGA ATGCGATGGG AGGGCATGAT
CTTGCCAAAG CATATTTGGC GAAGAAACAA AGAGAAATAT TGCTAGGTTC TTGCGATGGT
ACAGATTATG ACGCCATCTG GCGGCTACCC ATAGTCAACA AACTCAAACT TTTGTGGCTG
AGAAGAGTGG CATCCAAATA CATCGATCTT TTGCTAGACG TCCACGGGTA CCAAATCTTT
CAGAACGGAT GCTTTAACGG AGATCCTCAT CCTGGAAACT GTTTGCAACT AGAGGATGGA
CGTCTTGGCT TGATTGACTT CGGTCAAACC CGCCGTCTTA CAGAGACAGA AAGATACGAT
TTGGCCCGAA TTGTATGTGC TTTAAATGAT CCTTCCACGG ATGCTATCGG AATTGATTAC
GCAATGCAAA CGGCTGGCTT TCAACTCAGA GACGCAAGTG CGGAGATGAT GGTCAAATAT
GCTACGATTT TCTTTGACTC TGACGAGGAT AGCAAACACC TGGGCTTCGC AACACCACAA
CTTTATTTTG CTAGTTTAAT GGCAACCAAT CCTTTGGTTG TCATTCCAGA TTCAGCTGGT
ACGTAGTTTG ATTGAAACTT GAGCTATGAA TTGTCGAATT GATTGCATCT GAAATTGCTT
CTTTGTACAG TGTTTGTCGC TAGGACGAGC TTCTTGTTCC GTGGCATCGG TAGTGGTGTC
GGCTCCGGCC CACTGAGAAC TTCACAAAGG TGGCAAGAGC ATGCTGCTGC TGCAATTAGC
CAGGTCTCAT AG
 
Protein sequence
MKKQIGVLFV LLHLCNASSR SVVAAANVRT NQKCSTKTRE PPFGKWIEEA RLLGCCSPLI 
SASMFGAASA VLLAGTAMEP VSRALYFWRT AGPAIFHYKF TQWWLEASKA DIEKRDLVYE
SLHDRYAEPA LKMMIRQKGL YVKLGQVLSS RPDFLPSQYI ERFATVQDSI PQWPIDQVRA
IVEKSLIVEL GLSWGDVFES MDDIALGSAS IGQVHRAVLT EKWAKTTGYR GDKEAAVKVM
HPNSQKLFAY DFDVFRWVCR IALPGWKGFL DELERRIMSE FDYRHEATSL DEVRSPYKSR
VYIPQPLQEL CCRHVLVMEL LKGRKLVDSF EDGLANAMGG HDLAKAYLAK KQREILLGSC
DGTDYDAIWR LPIVNKLKLL WLRRVASKYI DLLLDVHGYQ IFQNGCFNGD PHPGNCLQLE
DGRLGLIDFG QTRRLTETER YDLARIVCAL NDPSTDAIGI DYAMQTAGFQ LRDASAEMMV
KYATIFFDSD EDSKHLGFAT PQLYFASLMA TNPLVVIPDS AVFVARTSFL FRGIGSGVGS
GPLRTSQRWQ EHAAAAISQV S