Gene PHATRDRAFT_43176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43176 
Symbol 
ID7196769 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2234459 
End bp2236609 
Gene Length2151 bp 
Protein Length464 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176938 
Protein GI219110373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTGCGTGTA GGTGAGCATC GGACCATGGT GGGATATTGT GACATTTTCG GGCAACCTGA 
AAGCGTGATT TAGTGTTTGC GTGTCGATAA TGCTTCTAAG CGTCGCGGAG CGTCTTTGGT
CCCGTCCAAA ATGGTCTTGG ATACGTCGAG GGTTGCCTCT GGTTGCGACG TACGTTTTAC
GTGAACGCAG TACTACGCGC CGCTAGTCTG CGCCCGAAAA ATGAACCCGT TGCTTGCTGC
GGACCCAGCC GCTTCGATCG TCGGTATCCT TTGGTTGAAT CGTTTCGACT TCAATCCTAT
GACTTGAATA TGTCAAGTCA TCGCACCGTT GATAAATATG CACTTGATCC CGTAGAGAGT
CGAGGACGGA AACACATGAT TGCCTGATTC TGACTGACCA TTATTGTGAA TTCTGTTTTT
GCCATTTTGT AGAAGTGGTA CATTGATTGA TCGACGGGGG TTCTTGACGC GTGGGTACTA
GAACGCTCAC ATGGAGGGGG CTAGTACAAC ATCCTCGGCG ACGTCTTCTG TCGACGGACG
AAGCGCCGGG GGCGGTACCT CCCACAACAC TACCGCATCC GCGTCGGATT ATTTGAACAT
CGGAATGTTT GCCTCGGGAA ACACGCCCGG GGGCAGCGAC TTACTCGGAA ACAGCAACGC
GGTCTTCTTG GGGAATGGCT TTGGAAATAT TGATTCTACC AATACAGCAG CAGTATCAGC
ACTAGTGAAT TCGACAGGGA CAAGCTTTGA TTTGACAGAT TTTCCTAGTT TAGCGGGAGG
AATAGGTGGG GCAAATGCGA GCGTAGCCGG GAATGGCCTA GCTGCCGCGT TACGACAGCA
GCAACAACAA CAACAACAAC AGCTATTGGC GCACCAACAA ATGCTGCAAA GCAAGGGAGG
TGTAAGCAAC GCTTCAAACT TATACCGACT GGCTATGTCG GGAGCCAACG GAAATTTTAA
CATGGCAACC GAAGACTTCC CGGCACTAGG GGCCAACGCA CCACAGCCAG CACCGAGTGG
ATCCTCAGCG CTGAATCCGT CATCGCTGTT GTCGGGAAGC ATGCCTGTAT CGAGAGGCGG
CAATGCCAAC GGAAACGTTG GTGGCTTGTA CGCCGATATC GATACCAACA AGAACAATAC
TGCTAGCAGT GGCTCCCAGT TAGATGTTTC AGGTGGTCTG TTAGGTGGTA CAGGTCTTGG
TGGACTAGGA GGTATTCGAG GTCTTCAGCA AGCCGGTATG ACCGGGGCTG GAAATGCAAT
GGGACGAGCG CCGTCATCGA CGGTACCGGG TGCGGGAGCA ATAGGTTCTT CGAGCTCCGG
AGGGGCAGCG GCTGGTGGTG CACTGACTGG TGATTATGGT TTGCTGGGAT TACTAGGAGT
CATCCGAATG ACCGATGATG ATAGAAATAC TCTGGCACTG GGGTCAGATC TGACAATGTT
GGGTCTGAAC TTGGGATCGA CCGAACAGAT TTACAGTACA TTTTCTAGTC CATGGTCGGA
CAATGTCGCA ACAAAAGAAC CGCATTATCA GGTACGTTAG TGAAGTCGCA CTTTGATTTC
TACCGGAAAC TATGGCTGGC CCTAAACCGA CGATTTACGC ACCTTTACTA GCTTCCTGTG
TGTTACTATA TGCAACCACC AGCACTGAAA ACAGGCCACC TGTCAAAGTT CCAACTCGAA
ACCTTGTTTT ATATCTTTTA TGCTTTGCCA AAAGATGTTT TACAAGCGTA CGCAGCACAG
GAACTATATT CACGGGAGTG GAGGTATCAC GGAGAGCTTA AGTTGTGGTT CAAGCGAGCA
AGTCCTTCGG ACGGCGTGTC TAGCAGTTCA AGTGGATCAC CGCAGTACCT CTACTTCGAC
ATTAACTCAT GGGAGCGACG CCTTTTTAAT GGCAGCATGA ACCAGAACAT TACTAGCGGC
TTCATTACGG AAGACGAGGT ACAAGTCAAG TTCCCAAGCT CATGAGTTCA TTGTCTCAGT
TAAGTTGGTT TCATGGCTAA ATTAGCTAAG GCGTCGTAGT CAGGCAAGCT ATGGAACTAA
TGATATACTA GTTGCAAATG AAAAAACGCA TTCTGCTCTT TGCTGACTTA CTGCCAATAA
ATTGTTAGTC GATAGGTATA ATTCCAAGTA ATTTAACAAC CAATAAGAGC T
 
Protein sequence
MEGASTTSSA TSSVDGRSAG GGTSHNTTAS ASDYLNIGMF ASGNTPGGSD LLGNSNAVFL 
GNGFGNIDST NTAAVSALVN STGTSFDLTD FPSLAGGIGG ANASVAGNGL AAALRQQQQQ
QQQQLLAHQQ MLQSKGGVSN ASNLYRLAMS GANGNFNMAT EDFPALGANA PQPAPSGSSA
LNPSSLLSGS MPVSRGGNAN GNVGGLYADI DTNKNNTASS GSQLDVSGGL LGGTGLGGLG
GIRGLQQAGM TGAGNAMGRA PSSTVPGAGA IGSSSSGGAA AGGALTGDYG LLGLLGVIRM
TDDDRNTLAL GSDLTMLGLN LGSTEQIYST FSSPWSDNVA TKEPHYQLPV CYYMQPPALK
TGHLSKFQLE TLFYIFYALP KDVLQAYAAQ ELYSREWRYH GELKLWFKRA SPSDGVSSSS
SGSPQYLYFD INSWERRLFN GSMNQNITSG FITEDEVQVK FPSS