Gene PHATRDRAFT_42557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42557 
Symbol 
ID7196096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp413686 
End bp415771 
Gene Length2086 bp 
Protein Length692 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176580 
Protein GI219109652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTGG ATTCTTCCGT CGCTTCTTTC GACGACACGA CCCGAACCGA TGGTTCCTCT 
TTTGAAGCCG ACGATGTCTT GGCTTTTCTG GAAAACAATT TACAAAACGT TCGTGACCTG
TGTGTCTCGA AACTTTGGAA CGACGACCGC CTCGCTGATT TAGACGATTT CTTACGGAGC
CATCACGCTG CTCTGCACCT ACAAGCCTTG CGCTTACCCA ACAACGGACT TACATCCCGT
TCCTCGGTTA ACCTTGCCAA CATTCTTTCC ACAACGCAAA CCCTGCGCGA ACTCGATTTG
TCCGACAATC AAGTGGAGTC CCAGGGACTG CTGGCGCTGT TGCCGGCGTT GACGCACGAA
ACCTGCGCAC TCCGGCGCCT CGACTTGTAC AATAATAAAC TCGGCGCCAC CGGGGCCACA
CAAATTGCCG CCATTCTACG GGACAACCGA TCTCTTCGCG AACTCCGCAT TGGCAAAAAC
AATCTCGGCC GGAAAAAGTC CCTCAAAGTA ATTTCCACGG CGCTGCAACG GAACGCAACC
CTGCGAACGC TTGATCTGTC GCACAACCAA ATTGATGACG GCGGCGCCAT TTTATTGGCG
CCCGTTTTGG ATCCGGAAGT CTCACAGTCA CGTTTGCGCC GCTTGGACTT GACCTACAAC
AAAATTTGGC CAGAGGGTGT CCGAAACCTT ACCGGAGCCC TGCTGGAAGG CAACCGGACC
TTGCGATGTT TGAACTTGAG TATGAATCAC GTTGGACCCG AGGGGGCGGA GTCATTGGCG
GTCCTATTGA AGTTCTCCTT CACGTTGCAG GAACTTTTAC TGTCGCGCAA CGCTTTGGGA
GACCATGGCG TCAAATTATT GTGCCAAGGG CTAGACGAGA GTAAATTGTT GAGTGGGACA
GGCTTGCAAA GATTGGATTT GGACTGGAAC GAGATACACG ACGACGGAGC CAAGGAATTG
GCGACAATGC TGCTAGACAA CGCTATACTG GAGTCCCTCA ACTTGGCGAG TAACGCTATC
GGTAGCGATG GAGCCAAGGC TCTAGCGAAT GCTCTGCACT CCAATCAAGC TTTGACATTT
TTGAATCTAA TGGGAAACCA AATTCGAGAT CCTGGTGCGT TCTCTCTAGC CGAGAACCTT
TGCCGCCCGT CGTGTCGAGT GGAAACGTTG CTGTGGGAAA AGAACAATTG TTTGACGCCT
TTGGGAGAAG AGCGACTCAT CGCGGCGTTT GACTTTCGGA AGAACCGGAG AACGTGGCTA
GGTCAGATAC TTCGTGAAAT AGAAACATGC CAAAGTGTCA ATTTCAATTT GTTGTCGTGC
AAACTCAGCG ACGAGGAAAT TATGGCGTTA GCGAAACATC TCGCTCAGTA CCGGCCTCGA
GTTTCGACCG CGTATCTGGG TGGACACGGC GTAACAGTTC GAAGCATGAA AGTTTTGGCC
AAGGACGTGC TTGCCAACAA CCACGTCAAT CTTCAACGGT TACACTTACA GCATACTCGT
GTTGGGGATG AAGGGGCAGG AGCATTGGCG GAAGCACTGC TGTCTAACTC CAATTTGCGA
ACTTTGACAT TATTCGATTG TAGTATAAGC CCAGAAGGAG CAAAGTTGTT GGCGCATACG
TTGGCTCAGA ACAAGTCTTT GACACAACTA AATCTTCACA AGAATGCAAT CGGGAACCGA
GGGGCACAGG AGCTTTTTAC GGCTTTAGTT GACCCACCGC ATCCCTCACT GGTTGTGTTG
AATTTAGAAC AGAACGAAAT TAGCGACGGT GCACTCTTGC AGTTCCAATC GTTTGGCAGA
CTGCAGCAGC TGAACATTGC TTCCAACAAT TTCACAGACC GTGCCGCCTT GGATTTGGCC
AAGGCATGTT TCAACTCTTT GGCCAACGGC ACCCTTCAGC TAAGCTGGCT GACGGTGTCG
AACAATTTTA TCTCAAAGAA AGGCTTGAAG GCCTTGGCAT TATTTCTTCC GGACGGGTTA
GTCCTCGAAA ATGATGGCCA ATTAGAAGCG CAAACGGTAT TACCAATTAG AAGCGCAAAC
GGTATTGACG CGTTGTACAA CAAAGCTTCT CTTAGCTAGC AATAAG
 
Protein sequence
MAVDSSVASF DDTTRTDGSS FEADDVLAFL ENNLQNVRDL CVSKLWNDDR LADLDDFLRS 
HHAALHLQAL RLPNNGLTSR SSVNLANILS TTQTLRELDL SDNQVESQGL LALLPALTHE
TCALRRLDLY NNKLGATGAT QIAAILRDNR SLRELRIGKN NLGRKKSLKV ISTALQRNAT
LRTLDLSHNQ IDDGGAILLA PVLDPEVSQS RLRRLDLTYN KIWPEGVRNL TGALLEGNRT
LRCLNLSMNH VGPEGAESLA VLLKFSFTLQ ELLLSRNALG DHGVKLLCQG LDESKLLSGT
GLQRLDLDWN EIHDDGAKEL ATMLLDNAIL ESLNLASNAI GSDGAKALAN ALHSNQALTF
LNLMGNQIRD PGAFSLAENL CRPSCRVETL LWEKNNCLTP LGEERLIAAF DFRKNRRTWL
GQILREIETC QSVNFNLLSC KLSDEEIMAL AKHLAQYRPR VSTAYLGGHG VTVRSMKVLA
KDVLANNHVN LQRLHLQHTR VGDEGAGALA EALLSNSNLR TLTLFDCSIS PEGAKLLAHT
LAQNKSLTQL NLHKNAIGNR GAQELFTALV DPPHPSLVVL NLEQNEISDG ALLQFQSFGR
LQQLNIASNN FTDRAALDLA KACFNSLANG TLQLSWLTVS NNFISKKGLK ALALFLPDGL
VLENDGQLEA QTVLPIRSAN GIDALYNKAS LS