Gene PHATRDRAFT_43688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43688 
Symbol 
ID7197235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1190295 
End bp1191483 
Gene Length1189 bp 
Protein Length378 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177774 
Protein GI219112045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.569479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGTC GACGATCCAT TCCAGCTGCC CTGCTGCTGC TGCTGGTCGC TGGGTTAATA 
TCAACCATCA ATGGTTTTTC CAGTCGACAA ATGAGGTCGC ATTCTTGCGC AAACTTGACC
CAGTGCTGGA GCAAGGGTTT TGGCGGTGAG GGTCGTGCTG GGTTCGGCGC AAAGACACCA
CCCGCAATCA AAAAGGCGAA TCAGAGATCG GCCGTGAAGC GAGCGCAAAA ATCGTATGGT
GGAGCCTCAG CTCGTGAGAT CGCACAGGCC ACTCAGCAAA AGATTGAGAA TGAAATGATG
AATTTACCGC CACACTATCA AATGGCGACA CAGCTATACC AGCAACTTCA AACAAGGAAT
GCACACGTCG CAGATCTGAC GGTCTTGGAG CAAGCCGGCT TGAGCCTCCA GGAATTGGAT
GGGGCCAAAC GAGCGCAAGA CAAGTTGGAG CGACTGTACC TTGAGTACGA CTTTTCGGAG
AATGACTTAC ATAATGTTTT CCAAAGGATA ACTTGGGACG CTTCGGCTGA CGCCAAAGCT
GCGAAAGCTA TGCTGGGTGA AATGCCGAAG GAGATCTCGG ACCGCGTGGA CCGTGCATGT
AGTTATGTTG CCGATGGCGT ACTAGCTGCT GGTCCATCCG GCCGCTGTTT AGACGTCGGA
TGTGGATACG GTGTTTTGGT TCCCCATCTT ATCGAGAGCG GGATTGCGCT CTCTCAGATC
TACGGCGTTG ACCTGAGCAC AGAAATGATT CGCAATGCTC GAGAGCAGCA TCGCGGAGCT
ACGTTCGAAG CCGCAGACTT TTTAGAAGAA TATCAAGATT TGAACGATGA GGTCGGATTC
GACAGTATAA TATTCTGCTG TTCATTGCAT GATCTACCTG ATCTTCCCAG GTCTTTGCGT
AAAGCTGCAT CTCTACTACG CTCTCAAGGG AATCTGATAG TTGTTCACCC ACAAGGTGCA
TCACACGTGA CCAAGCAAAT GAAGTCCAAC CCTGTCATGG TGAAAAGAGG TTTGCCAAAC
GCGGAGGAGC TGCGTGCTAT GAAACTTGAA GGGCTTGAAT TGCAAATCGA GCCTACCAAA
GAGGGCTCAC GAGAAGAGCT AGAAAGAGGC TATCTAGCGG TTTTTCGCAT AATATAATGT
CACTTGCGCT ACGGCTAGAG AATGTAGATA GGATGAAAAG TTTTTATGG
 
Protein sequence
MASRRSIPAA LLLLLVAGLI STINGFSSRQ MRSHSCANLT QCWSKGFGGE GRAGFGAKTP 
PAIKKANQRS AVKRAQKSYG GASAREIAQA TQQKIENEMM NLPPHYQMAT QLYQQLQTRN
AHVADLTVLE QAGLSLQELD GAKRAQDKLE RLYLEYDFSE NDLHNVFQRI TWDASADAKA
AKAMLGEMPK EISDRVDRAC SYVADGVLAA GPSGRCLDVG CGYGVLVPHL IESGIALSQI
YGVDLSTEMI RNAREQHRGA TFEAADFLEE YQDLNDEVGF DSIIFCCSLH DLPDLPRSLR
KAASLLRSQG NLIVVHPQGA SHVTKQMKSN PVMVKRGLPN AEELRAMKLE GLELQIEPTK
EGSREELERG YLAVFRII