Gene PHATRDRAFT_43631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43631 
Symbol 
ID7197347 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1052431 
End bp1054071 
Gene Length1641 bp 
Protein Length479 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178059 
Protein GI219112615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.370877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTAATTTCA TGCTTGCAGT TTATGTAAGA CGCAAAGAAA AACGCTTTTG TAGCTTTCCG 
CTGCTTACAA CGCAAGAATT CAAGCCACTA TGAAGACAAG AGGGCATTGT GGTGTTTTTT
CACTCCTCCT TGCTACTGTT GCTTTCAATG AATTTCTAGC CGTCAGCGCA TACTCAGCAT
CTTTTGCACG GGCAGCGGGG CTGGACAGCA TCGCCCCGTG GAGGTATTCC CAATACCCTT
CCCCTTTCGA GTATCCAACA GTGTGTCGGA CAACTTCTGA GAAACTTTGT GATCCAGACG
GGATTTTGAA CGACAGTGAA GTTGAGCGTG TCGATCGTGT ATTGAAGACT AGCCGTGAAT
TTGTACTCCC CTGTAAAGCA GAAAGCAACG TCGAAGGAAT AAAAGGGAGA GTGATAAATA
TTGAGATAGC AGTGGCTCTC GTAAAGCAGG TGCGTCGCCA GCGCCTTCCT GCACCATTCC
TATGTTGCGC TGGCTGATGC TTCAACCTTA TTATAGATGG ACCTCCTCGA ATTCGAAATG
AACAGCAAAC AACATGAACG CGCTGCGGAA GTATTTGCAA GATCACTGCA TAATGAGTGG
GGAGTAGGAG TTACTAACAG CTGTGGAGGA ACCGGTATTT TATTGTTTCT TTCCGATTTA
GATCGTGTGA TCTATGTTTC ACGTGGAACA GCACTTAAAA CAATTTTGAC CGATCGTCGA
TTGGATCGGG CTATGAACAA GATGAAACCG CTGTTACAAG AGAAGAAATT CGAAGAGGCC
ATTTTGAGCG CTGTGGAAGA GTTTGAATTT CTTATTCAGT ATGGCAAGCC GCACACATGG
GAACTGATCA ACGACTATAT TACCAGGTAC GGCGGTCTCT GTTGGGTCGC TGTTTTTTTA
GTTTTTGCAG GCAGGAATAT CCATGTACAA ACCAAAAAAC AGAGAGAATA TGCCAAGGTT
CGCAGTCATC TGTCAGAAAT GGATCGTGCG CGAGCTGAAG CGCTGCAAGG TCGTTTCTGT
GCGACGTCAT GCCCAATCTG TCTCGAACCA TTCCCAGATC ATGCCACCAC CAGCACCCGT
ACCCCGGAAC AATTGGGCTC CGATAATCTC CCAATCAAAC TACTGCGCTG CGGACACGTC
TTTGACCATA ATTGTTGGCT AGAATGGGCA AGCAAAGGTC AAGGTCAGGT TACCAAATGC
CCTATTTGCC AGCAAGATGT AGGCATGGGG GAAGACCTCA CAACAGCCAG AAATACTCAG
TCACTCTCAC GGCGGTCAAG TCGGGTTGTC AGTGATGATC TTGATGACAG TATTGGACAT
CGAGGCTTGG CTGCGGAAGG AGAGCGATTT CTTAATCTCC ACAATCGTGA ACGCAGTTTT
CGTCTCACGC AGCTAGGATA CCAGTTTCCT CAGATCATTG GGCCTCACCA AATTCAGCAG
TGGTCACAGA ATGATTACAA CGGAATGTTG GTACAGGATC CCACTTTTAT AAGCAGTGAT
CCGGTGTCTG GTGTGGGGAG CTCTGCCCGC GGTGTTGGGA TCAAAAGCAG TTTTAGTGGC
GGGTCCAGCG GAGGCGGTCG TTGTGGTCGT TGGTGAGATA CTTGCAAATA GCAACGCTAT
TAGTGTTGAA GCCATGACTT T
 
Protein sequence
MKTRGHCGVF SLLLATVAFN EFLAVSAYSA SFARAAGLDS IAPWRYSQYP SPFEYPTVCR 
TTSEKLCDPD GILNDSEVER VDRVLKTSRE FVLPCKAESN VEGIKGRVIN IEIAVALVKQ
MDLLEFEMNS KQHERAAEVF ARSLHNEWGV GVTNSCGGTG ILLFLSDLDR VIYVSRGTAL
KTILTDRRLD RAMNKMKPLL QEKKFEEAIL SAVEEFEFLI QYGKPHTWEL INDYITRYGG
LCWVAVFLVF AGRNIHVQTK KQREYAKVRS HLSEMDRARA EALQGRFCAT SCPICLEPFP
DHATTSTRTP EQLGSDNLPI KLLRCGHVFD HNCWLEWASK GQGQVTKCPI CQQDVGMGED
LTTARNTQSL SRRSSRVVSD DLDDSIGHRG LAAEGERFLN LHNRERSFRL TQLGYQFPQI
IGPHQIQQWS QNDYNGMLVQ DPTFISSDPV SGVGSSARGV GIKSSFSGGS SGGGRCGRW