Gene PHATRDRAFT_37556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37556 
Symbol 
ID7202416 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp400149 
End bp402348 
Gene Length2200 bp 
Protein Length703 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181720 
Protein GI219122786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.388143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCTT GGGAGCATTC TCCGCGACAG AAACCTTCCG CTCACTCTTT CTCCCACACC 
CACGGTACTT GCACTGCGAG TGCTCGCATG GCTACCGCTT CCATCCTCAC GCGATCCACC
AGTCCACCCG TCGTGGCTAC TTCTACGCCC GACGAACTCG ACGACGAAGA CCGACCATGC
CGTGCGTGTC GTCGCACCGG TACGTTACGG ACTGACTGGC AGCAGGGTGA TCGGGTTTGC
ACCCAATGCG GCGTCGTCGA CCAGGGTCAC GTCATCGACA CGCGTCCGGA ATGGCGAGAG
TTCCACGACG ACGCCGATCT CGCCAAAGGC CGTCCTTCGC AAGCGCGCTC GGGTTTGACG
GTGGTGGACG AATCGCGGTA CCTTGGGGGA CTCCAACCCA CCATGCTCTC CACGAACGTC
TACGGCACGG CCTCGTCCAC GTCGCAGCAA ACACGGAAAC GCCTCTTGGT GGCCGCGCAC
AAAATGGACC GACTCATGGA ACGATCGCAC GCGCGCGCGT TGGAGGCGGT CCGTGTCTCG
CGCGCGGCGA ACCGGAAACG TCGACGAGTG GAAGGACACC CGGTACCGGG AGCGGTGGAT
CCGGACGAAG ACGACGCGGA TATCGACGCC ACGATGCGAC CAGAATACGA AGATTTCGTA
CAGTTGGAAG AGCAGGAAGC TCAACGGTTG CAAATTGCGT CGTACGGCGA CAAATGGAGC
CTGGAACGGG CCATCCGGTT GTACGGATCC GCCTTGGAGC AACAGAACTT GTCGACCGAC
GACACAATAG ATGACCGGGG CCATCTGGAT GACGGATTAA AACGTGCGTC TCGAGATCTC
TATCAAGCGT ACACATTTCT GTCCACAGCG GTGCAAACGC TGGAGCTCAC GGATCGGGTG
CAGCATGAAG TGGTCGGACT GCTGGTTCGG TACGCAAAGT GTCGCGACGG CCTCCAAGTT
CGAGGTGTGT CCTCCACGCT ACAAAAGCGC CCCTCCACCA AAGCTACCTC TCCCAACGAG
ACGCAGCGGG CGCGTCGGAG TTTACGCGAA TACAATCAGG CCAAGCAAAC CGGTGCGCTT
CTGGCCGCGC TCCTCTTTTA TACCGCCCGC AACCTCGGCT GGCCCCGCAC CCTCGTCCAA
GTCTGTCACG CGATTCCTTT TCCATCCCAG TCCTTGCCTC ATTTGGATCT CAGATGCGAG
GACGGGGAGT TTATTAAGCG AAAACACTGT TCCAAAGCCA TGACGGAAGT AAAACAGGTT
TTCCCAGATA TTTGCCGGGT GACGGCCACG TTGCATGCGG TATCGAACGT TTCGTCTGCC
AGTAGCAGCA GTACCAATAG CAACAGTAAC AACTACAACA AGAGGAGCGA AATTCCGCAA
CCACAGAGAC TTCAAGACCA TGTTTCGGTA ATCAATTTTG TCGATCACGC CATTCGCAAA
TTACGTCTAC CACCTGTTGC CGAAGCATGC GTGCGGATCT TGGTATTGCG GTATTGCCAC
GGAACAAAAG ACTCTGCTTT GCGATTAGGT GCCATAACGG CTTCCTCAGT GTATTTTGTT
ACACAGACGG GAGACATTAT GCAGCGATTG GCCAAGCAGG CTGTGAGCGG TAGCAAACCG
TCCTTGGCAT CGAAGCACGA CAAAGTGACG AGAACGAATT CCTCGCCTAC AGGTTTCCAC
AGGACAAAGT TGGAACTCGA CGGTTTCACT GCCGCACGAG ATCCGTCGGC TTCCGCAAAC
GTCAAGCATG AAGATTTGTT CAGCGCGGAA GCAGTGCAGG AATTTGCCTC GGAGCAAAAG
GTGTACGAAA TGCGACGGGT GTGGGACGCT TGGTCGGAAC AAACAACCTG GATGCGCAGC
TTGGGTGAGA TTGAACGAGC AATGGGAGTT TCGAGACCAA CGCTTGTGGA AGTCTTCAAG
AAAGAGATTT TTCCGAAGCG AGTTGAGCTC TTGCAGGCTC TTCAAGATTC TGTCGAGACG
AGTGATACAG AGCAGAAGAC TGTTTTGTCC GAGACACCAT TGGCCTCGGT GTTAGTTCCA
CACATTGCTG CTGCGGCGCC CTTGCTCAAA GCTTCTAAAT TGTAAAAGTA TGTAATGTAA
ACTTTTGTTA CAATCGAGCG GGATCAGTTA CTACATTGAA AAGCAAGAGA AAACAGCGCA
GAGGTGGATG GATGTCTCTG TTACTGTCAG CGCTTCTTAA
 
Protein sequence
MSPWEHSPRQ KPSAHSFSHT HGTCTASARM ATASILTRST SPPVVATSTP DELDDEDRPC 
RACRRTGTLR TDWQQGDRVC TQCGVVDQGH VIDTRPEWRE FHDDADLAKG RPSQARSGLT
VVDESRYLGG LQPTMLSTNV YGTASSTSQQ TRKRLLVAAH KMDRLMERSH ARALEAVRVS
RAANRKRRRV EGHPVPGAVD PDEDDADIDA TMRPEYEDFV QLEEQEAQRL QIASYGDKWS
LERAIRLYGS ALEQQNLSTD DTIDDRGHLD DGLKRASRDL YQAYTFLSTA VQTLELTDRV
QHEVVGLLVR YAKCRDGLQV RGVSSTLQKR PSTKATSPNE TQRARRSLRE YNQAKQTGAL
LAALLFYTAR NLGWPRTLVQ VCHAIPFPSQ SLPHLDLRCE DGEFIKRKHC SKAMTEVKQV
FPDICRVTAT LHAVSNVSSA SSSSTNSNSN NYNKRSEIPQ PQRLQDHVSV INFVDHAIRK
LRLPPVAEAC VRILVLRYCH GTKDSALRLG AITASSVYFV TQTGDIMQRL AKQAVSGSKP
SLASKHDKVT RTNSSPTGFH RTKLELDGFT AARDPSASAN VKHEDLFSAE AVQEFASEQK
VYEMRRVWDA WSEQTTWMRS LGEIERAMGV SRPTLVEVFK KEIFPKRVEL LQALQDSVET
SDTEQKTVLS ETPLASVGIS YYIEKQEKTA QRWMDVSVTV SAS