Gene PHATRDRAFT_35554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35554 
Symbol 
ID7200788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp210654 
End bp212486 
Gene Length1833 bp 
Protein Length610 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180192 
Protein GI219118851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.190884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAC CGCCACGGGA AGCTTCTCGG GACCTTACAA CAAGCCTTCG AATTCGACTC 
TTCGCTGCGA ACCTTCCGAA GCAAGGACGA GGATTGTTGA ACAAACAAAA CCCAAGCGCG
TACGCTGTGG TAACGTCGAT CTCGACGAGC GATGGCTACA AAGTTCCCTC CACCGACGCA
TCGTTTCAAA GGCAGGGTTC CTTTGAAGGT TACCGGTGGG GAGATACTGA GATTGTAAAT
AGCAGGAACC CACAATGGAC AAGGACGATT CCACTGGAGT ACGAATATGG ATCCGAATCC
TATTTTTATG TCCATGTCTT GCAAAGCAAC TGTGACGGCC AGGCCGTGCA TGGCCACGGT
TCGACGAAGA GTCTCGACAG CGATGCTTTT TCTACTAGCT TTGGCACAGC ACTGTTTGAA
GTAAGCGATG TCCTGGGGAC TCGAAATACA ACAAAGGTGA AGCGATTGCG CTCTGGTGGC
TGTGTGTTCT GTAAAATTGA GCCAGTTCAA CAAGGTGAAG CGGGAATGCG TGTGTGCTTG
CAGGTTGAAG CCAGAGGTCT CGTAATCTCG CACGGGAATA GACGAGCTTG GACAAGCAAT
TCTTTTTATC GAAAACCGGA CGCGTTATTT GAAATTGCTA AGCAACACGC GAGCAACAGC
GAAGGAGCCT ATGTTACTGT TTATCGTTCC ACGCCAGCCG TCAACACACT TGATCCAGTG
TGGGATGCAA TTGATCTTGA TTGTGGAACA TTATGCAACG GCAACATCGA CCAGCATCTT
CGTTTCTCTG TTCTGCTCCA GAAACAAAAG GGAAATCGGG AGCTGATCGG GCTGGCCGAA
ACAACACTTC GCCACTTGCT GCAACAAAAC AGTTCTTACA TCGGTGACAC CGAATGTGCA
AATGGAGGCG ACAATGACCC AGTAGAGAAA TACAAGGAGC TGATTCTGCA GCGCAACTCT
TCAAAATTAA AGCAAGTCGG GTGTCTGCGA ATTGGTGGGT ACGAACTTAT TCCGGAGTCA
ACGAATCGAT CTTTGTCCCT TCGCGAAGTG GGTTCTGTGG AAGGAACAGT TTTGGAAATT
GTGGACTTAG CAGAACTGTC ACCTATCGGG ATACCTTCGT CCGCCACAGG GTTTCAACAC
TACATTGAAA GAGACTGCGA AATCAAATTC TGTGTCGCCA TTGATTTCAC GAGCTCAAAT
GGAGACCCAA GGTTTGAATC TAGTCTTCAC TATCAAAGCC CTCAAACTTT CAACGATTAC
GAGGAAACTA TTTCTTCCAT TGGGCGGTCG CTTTCGGCGT ATATTAGAAC GGAGGAATTT
GCGGTATGGG GTTTCGGGGC AAAATTTGAC GGCAAAATAA GTCACTTGTT TCAGTGTGGG
CCGGACCCGA CTGTCAAAGG AGTGGATGGT ATTTTAGAAG CGTACAAGAG CGTGATTCAG
GGAGGTCTTA CAATGAGCGG ACCTACTGTC TTTTGTAAAG CACTTCAAGC AGCTGCAGTA
CGCGCCAAGA GAGACCATGA AATCATGACT CCCCAGACAT TGTCTTACAC TGTCCTCCTG
GTGATCACTG ATGGAAATGG AGATAGTTTA GACGAAACCC GCAGAAAGCT GCTCGTTTAC
AATCAACTTC CGCTATCAGT AATTTTTGTG GGTGTCGGCC GTTCTGACTT CGGGCAAATG
TACAGTTTTC TCCAAGAATC GACCCGTGAA AGCATGAATT GCAGTTTTGT GGAGTTCCGA
AAACATCAGT ACAACCCAGC TGCGTTGGGT CGAGTTGCAC TGTGTCAACT TCCTGACGAT
CTCTGTGCAT ACATGCGGCG ACGGGGTTTC TAA
 
Protein sequence
MNLPPREASR DLTTSLRIRL FAANLPKQGR GLLNKQNPSA YAVVTSISTS DGYKVPSTDA 
SFQRQGSFEG YRWGDTEIVN SRNPQWTRTI PLEYEYGSES YFYVHVLQSN CDGQAVHGHG
STKSLDSDAF STSFGTALFE VSDVLGTRNT TKVKRLRSGG CVFCKIEPVQ QGEAGMRVCL
QVEARGLVIS HGNRRAWTSN SFYRKPDALF EIAKQHASNS EGAYVTVYRS TPAVNTLDPV
WDAIDLDCGT LCNGNIDQHL RFSVLLQKQK GNRELIGLAE TTLRHLLQQN SSYIGDTECA
NGGDNDPVEK YKELILQRNS SKLKQVGCLR IGGYELIPES TNRSLSLREV GSVEGTVLEI
VDLAELSPIG IPSSATGFQH YIERDCEIKF CVAIDFTSSN GDPRFESSLH YQSPQTFNDY
EETISSIGRS LSAYIRTEEF AVWGFGAKFD GKISHLFQCG PDPTVKGVDG ILEAYKSVIQ
GGLTMSGPTV FCKALQAAAV RAKRDHEIMT PQTLSYTVLL VITDGNGDSL DETRRKLLVY
NQLPLSVIFV GVGRSDFGQM YSFLQESTRE SMNCSFVEFR KHQYNPAALG RVALCQLPDD
LCAYMRRRGF