Gene PHATRDRAFT_50345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50345 
Symbol 
ID7198996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp373373 
End bp375034 
Gene Length1662 bp 
Protein Length549 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185185 
Protein GI219130045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0816818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATATTTTC CCATGTCTTC TGGTTCTTCT CTTTCCCCGC TGTACGCGCC ACGCTTTGCG 
GAAGGTCTCC AAATGCCGGC GGAAGACTGT CGCGACTATC TACGAACAGG CCGATGCAAG
TATGGGCCGT CATGCAAGTA TAATCACCCA GCTAATGTAC AAAGCGGAGG GGGTATGCGG
GCGCCTATTG ACCCTTCGGA ACCACTCTTT CCCGTGCGTC TCAACGAGCC ACTCTGCCAA
TACTACATGA AGCACGGCAG CTGTAAATTT GGACAAGCAT GCAAGTTCAA CCACCCTCCT
CAGCTTAGCC ACAGTTCACA AGTTGCAGGG GACACCCCTG TTACCGGCAA TGGGCGTAGT
ACCGACGTAC CTGTCGTCTT CAGTCAATGC GATGGTCCAA TGATGCTACA ATTTCTTCCA
CAACGCCCCG ATGAACCGGA CTGCATCTAC TTTTTGAAGA ATGGACGATG CAAGTACGGA
GCAACTTGCC GCTATCATCA TCCGGTGAAC TATCACAAGC ACCGCGCGGA GGAATCTCGT
CGTCAACATC GAGCGCAACT TCAGGAGCAG TACGCGCCTC AAAAGGTACA ATACATTGCC
CAAACAGTGC CCAACGGAAA CTTCAAGGGT CAGCACGTGA TGTCTGATAA CCCTTTGACT
TTTATGAGCT ACGATGTGCC ATCCGGAACT CCAGGGTTCC AGCCAATGTC TCTTGTCATG
GGAGCCGATG GTAGTACTTC GTACGCAACT CACATCGGCC CAAATATAGT TACCGAGCAA
GGATCTTCGG CTTCATCTAT TGCTTCTTCT TACGAAACGG CGCCAACAGG CTTTGATCAA
TTCCAAGGCG ATCCATCCAT GTGGGCTCGT GCTCGACGCA ACGGCAGTGG AAATAGCCTG
ACAGCATACA CGATCGACTC ATCTAATCGA GGAGCGCGTC TTGCCATGAC TCACAGTCCA
AGCGAGGGTA GTATGGCTTC GCGCAGACAT CGTGCGAGCT CTCACGGAAG CGCGAGCGAG
AGCTCCTATC ACGATGTGAA CCAATCTGGA TTGAGTCGAA GCGGTTCGGT TGGTTCGTGG
CGCAATGATC GAGTTCCTTC CTCTACCTAT GATCGCCGGC TTCCGACACA ATACATTTCA
AGAATAGACG GAGTTGTGAG CGATCAACAA CCGCGTGGAC GACCCCCTTC TATGTCCATG
GCACCAGGAC ATCGACCCAG CCCCAGAAGC CGCAAACCAA GGGCGCACGG AGAAAATGAT
GAGGGCTTTA CCATGATGAC CTCCGCTCTG TTGAATATGC TTGACACACC GGAAGAGGCA
TCGACTGAAA GCTTCAGCGA CGAAGACAAC AATCGCTATC GCTTACAAGA GCCGTGTGAA
GAGCAACGCC CCCTATACGG CGACCCGTTG GACGTCGAAT CATCCATGTT TGAACGTTTG
TCTTTGAATG GTGTAAAGCA CAATTACCAA ATTCGATCCG TATCAGATAC AAACACGAGT
GATTCATGGT CTCCAACGTG GCAGGGTTCC TTAAGAGGGC CAGCTTCCCC TCCTGCATCC
TCACTCGATG GCAATGCTCA AGCTTTGTCG GCTATCCCAC CACGCCATTC GCAAGGTCAT
AACACCCCAC CATCCTCTGA TATCGGTCTC TTTATACCCT AG
 
Protein sequence
MSSGSSLSPL YAPRFAEGLQ MPAEDCRDYL RTGRCKYGPS CKYNHPANVQ SGGGMRAPID 
PSEPLFPVRL NEPLCQYYMK HGSCKFGQAC KFNHPPQLSH SSQVAGDTPV TGNGRSTDVP
VVFSQCDGPM MLQFLPQRPD EPDCIYFLKN GRCKYGATCR YHHPVNYHKH RAEESRRQHR
AQLQEQYAPQ KVQYIAQTVP NGNFKGQHVM SDNPLTFMSY DVPSGTPGFQ PMSLVMGADG
STSYATHIGP NIVTEQGSSA SSIASSYETA PTGFDQFQGD PSMWARARRN GSGNSLTAYT
IDSSNRGARL AMTHSPSEGS MASRRHRASS HGSASESSYH DVNQSGLSRS GSVGSWRNDR
VPSSTYDRRL PTQYISRIDG VVSDQQPRGR PPSMSMAPGH RPSPRSRKPR AHGENDEGFT
MMTSALLNML DTPEEASTES FSDEDNNRYR LQEPCEEQRP LYGDPLDVES SMFERLSLNG
VKHNYQIRSV SDTNTSDSWS PTWQGSLRGP ASPPASSLDG NAQALSAIPP RHSQGHNTPP
SSDIGLFIP