Gene PHATRDRAFT_40158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40158 
Symbol 
ID7195931 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp280653 
End bp283030 
Gene Length2378 bp 
Protein Length643 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184220 
Protein GI219128017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACT ACGGTAGCCC CGGATCGCCT CGTACTCCAC AAACGTATCA GATTGACTTT 
TCGGCGGTAG CGAGCGGTAA ACGCATTGCG AGCTCGAAAC GTCGCGTCCG ATGGTACGTC
TAGCCTCGGG TACCACCACT TTATCCTCGT GTGAAGCTTC CGTTCGAACC GATACAGCCA
GATTGTTGGT GTTTGGGAAC AGACAGTCTG TGTTGTGTTG TGTCGCATTG TTTCGTGTAA
CTGGTCCGTT GCTTAGATGA TGGCTGCGTG GACTGGATCT ATTAGGAACA CGAGCGATCC
CCATCGAACC AAATCTCACA GTCAACGTGG TATTGCGACT TGCTCTTGCT GTTTTCTTAG
GCGCTTTGGA TTCGCCAATT CGGAGGCTCT CGCCAGCGGT GAGACCGGCA CGGCTTGTCG
CGGTGAAGAA CACGACGTGA CGCTCGTTTG GTCGATTGCC TCGGGCAAGC GCCTCATTCT
CGTCGACGGA CACGAAGTCC ACTACTCCAG CAGTCGCAAC GCCATCTTTG ACTTTTCCTG
GACCATGCGC GGCAATCACG TCCTCAAGAT GGTAGCCCAC GTGTCGCCGC CACTCTCTCC
AACCCCCGGG TTCCGGCAGT ACGATTTCTT CATTGACGGA CAGTCCTTTT TTACCTTTCC
CAAGGTCTTT CGCCTCGGAT TGGCACCCGG ACAGGCTCCA GCGAGTCCAT CAGGCGCATC
AACATCCTAT GCTGGTATGG CTCCGACTTC CGCCAGGCGT TCCGCTAGTG GCGAGATTGT
TTCGATGGAA GCCCCCCACA ACCCCGATGA GGTACGTCTC ACTCGTTGTC CGGGATTTTG
GTGATGAAAC GCAGCATCGG GACGCAAACG AATACAGTGA AACGGATCGT TTAGACGCAA
ACGAATACAG TGAAACGGAT CGTTTAGACG CAAACAAAAT TGACCGTCTC GTCTCTTACC
ACTCGCATGC TTTCGTTCGT CCTTTAGGAG GAAGCGTATC TTCAAGAAGC CATTCGTCAG
TCCCTCCGGG ATGACACACC GGCTTCAACC TCGCGAGGAG CACCGACCAA CCCAACTAGT
GATCTCCTCG ATTTTAGTAG CCCTCCAGCC GGTGCCCCGA CCCAGACATA TTCTCCGTCC
ACCAACGACC TCTTTTCTCC ACAAGCATCG GCCAGTAACG GTCCCTATCC GTATCAGCAA
GAAAGCAACA ACATGTTTGC CTCGCAAGGT TCTATTACGT CGGATCCTTG GGGAGCTCCG
GCTCCGGCAG CACCACGGAT CCGTGGGGAG CTTCCGCTCC CGCGACACAT TACGGATACG
GAACACCAGC AGCTGGTCCA TTGCCAGCCC TTACCGGGCC TGCGCCCGCC GCTACCGGAT
ACGGTGGTTA CAATACGGAT CCCGTTCAGG CCCCGTACGG AGCTCCCGCA CTCTATCCTC
CGCCAGCCCA GTCCTACGCC CCCGCACCAG CACAGTACGG TGGGCCGGAA CAAACCCACG
CTCCGGATCC GGTGCAAGCA CCGTACCAAG CTCCTGTTCA ATCGCCATAC CAAGCTCCTC
CTCCCGCATC GGCGTGGCAA GTCCCGGCTG CGATTGCCAC GCAGCCGCCG TACGGGCAAG
ATCCGAGCAT TCCGCCGTCA GTGACACCGC AAGCCCAAGC GACGCCGTCA ACAATAGGAT
TTTCTTCACC CCCACCCGAC TTTTCTGGAT TCTCCTCGGC TCCGCAAGCA TCCGAGCCGG
CGCAGGCGCC AAGCTCGGAC CCTGTCGTGT TCTCCATGAA CGCTCTTAGC GGCGAACAAA
ATGGACTGGT TGACAGCAAC TCGACGGCCC AGTCCGCGTC ACTGGTAGAT CAGGCTTATT
CCAAATTGGT CAATATGGAT ACCTTTTCGT TGGTTTCGAA GAATGACGAA GCTCGGTCCA
ATCCTTTTGA CATGGGTAGT ACTACGGTGG GTGGAAACGT ACCATTGGCC CAAATGAGCA
AGCATAAGAG TCAAACCGCA CCAAAGAAAG AAGTCATGAG ATCGCCTGCG CCGCCTCCAG
GATCAATGAT AGTTGCCAGC AATCACAACG GAAACTGGGG TGGCCAATAC GGACAGCCGC
AGCAGCCGGA TATGCAGCAA GCGTATGGTC AGCAACAGTC TCCGATGCAA CAGCAACCTC
AATACGGACA GCAGCAGCCT CCGATGCAGC CGCAACCACA GTACGGACAG CAGCAGCCTC
CGATGCAGCC GCAACCACAG TACGGACAGC AGCAGCCTCC AATGCAGCAG CCTGGACAAC
TCGGGCAACA TGGACAGCAG CACTTTGGGC AAACTAATCA AATGCAGTAT GGTCAAGCGC
AGCAACCTCC AGCTCAACCA GGATACAACT ATTTTTAA
 
Protein sequence
MADYGSPGSP RTPQTYQIDF SAVASGKRIA SSKRRVRWRF GFANSEALAS GETGTACRGE 
EHDVTLVWSI ASGKRLILVD GHEVHYSSSR NAIFDFSWTM RGNHVLKMVA HVSPPLSPTP
GFRQYDFFID GQSFFTFPKV FRLGLAPGQA PASPSGASTS YAGMAPTSAR RSASGEIVSM
EAPHNPDEEE AYLQEAIRQS LRDDTPASTS RGAPTNPTSD LLDFSSPPAG APTQTYSPST
NDLFSPQASA TRKQQHVCLA RFYYVGSLGS SGSGSTTDPW GASAPATHYG YGTPAAGPLP
ALTGPAPAAT GYGGYNTDPV QAPYGAPALY PPPAQSYAPA PAQYGGPEQT HAPDPVQAPY
QAPVQSPYQA PPPASAWQVP AAIATQPPYG QDPSIPPSVT PQAQATPSTI GFSSPPPDFS
GFSSAPQASE PAQAPSSDPV VFSMNALSGE QNGLVDSNST AQSASLVDQA YSKLVNMDTF
SLVSKNDEAR SNPFDMGSTT VGGNVPLAQM SKHKSQTAPK KEVMRSPAPP PGSMIVASNH
NGNWGGQYGQ PQQPDMQQAY GQQQSPMQQQ PQYGQQQPPM QPQPQYGQQQ PPMQPQPQYG
QQQPPMQQPG QLGQHGQQHF GQTNQMQYGQ AQQPPAQPGY NYF