Gene PHATRDRAFT_31533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31533 
Symbol 
ID7196074 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp304048 
End bp305768 
Gene Length1721 bp 
Protein Length516 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177063 
Protein GI219110623 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0011992 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCTGC TGGACGGACG CACCATCAAG TCCACTCACG TCACGGAACT CGACGTTCCC 
GATCTCCCGT TAGCCGCCAG AACCGCTCAC ATTTTCCCAG GGCTCACAAA CGGTTCACTA
ATTTCAATCG GACAGCTTTG CGACCATGGC TGCATAGCCA CATTCACTTC TGACGCCGTC
ACCATAACTC TCGACAAAAA AGTCATCCTC CGGGGCGATC GCTCGGCCCC TAATCGACTG
TGGACTCTCC ATGCACCCAG CTCGACCCCT CCTACCAAGC CGCTTCCTTC ACCAATTTTT
CCTGTCGCCA ACAACGTCAA ACACTCTTCT TTGCTTGCCG ATCGGATTGC CTTCCTACAT
GCATCCCTAT TCTCGCCTCA GTTGTCAACA TGGTGTAAGG CCATCAACGA AGGCCGCCTC
ACCACTTTCC CTCAAATTTC TTCGGCCCAG ATAAAACGCC ACCCCCCTTG ATCCGCTCCC
ATGCACAAAG GCCACTTGGA CCAACAGCGA GCCAACCTTC GCTCTACTCA ACCTACAGCT
GTCGCTTTCT CGGCCACAGA CTCAATCGTC GACAACCTCG ACGAAAATCC AGTTCCAGAC
GACCCTCCAG CTCTCAAATC CAATTTTTTG TACGCCGATT GCTACGAAAC AACAGGAAAA
ATCTTTTCGG ACCTCACAGG CCGTTTCGTC ACCTCTTCCA GCACCGGCAA CGCGTACATG
CTGGTCGTAT ATGATTACGA TAGCAATTTC ATCCATGTCG AACCGATGAA AAATCGTACC
GGAACCGAAA TCCTCGCGGC TTATCGGCGC GCTTTCGACC TCTTTTCATC TCGAGGCCTG
CGACCCCAGC TCCAACGGTT GGACAACGAG GCATCTACTG CCCTTCAGCA ATTCATGGAT
GACTCTAGAG TTGACTTTCA GTTAGTACCT CCTCATTTGC ACCGTCGCAA CGTTGCCGAA
CGAGCGATCC GCACATTCAA AAATCATTTC ATTGCCGGGC TTTGCAGCAC TGACAAAGAT
TTTCCTCTTC ACCTTTGGGA CAGGCTACTC CCCCAAGCAA TCATGACCTT GAACCTTCTG
CGTGGCTCGC AAATTAATCC TAGACTCTCG GCGTGGGCAC AGGTCCACGG CGCCTTTGAC
TTTAATCGCA CTCCGTTGGC GCCACCCGGC GTAAAAGTTC TCGTTCACGA GAAGCCTACT
GTACGCAAAT CGTGGTCTCC CCATGCCGTC GACGGCTGGT ACATCGGACC TGCCATGCAT
CACTACCGAT GCTACCGTGT TTGGATCAAT AGCACCACCA GCGAACGCAT CGCCGACACT
TTAACTTGGT TTCCAAGCAA AGTACAGATG CCGACCACTT CGTCACGAGA CACCGTGGTA
GCTGCCGCCC GCAACCTCGC CACAGCCCTA TCAAATCCGA CTCCAGCTTC TCCACTTGCA
CCGCTTGCCA CTCAGGAACG CGTTGCTTTG CAGCAATTGT CGACTATTTT TTCGAATTTT
TCTGATCCCA CAAGCCCACC GGCAGCAATT TCCCCGTCTG TTACCTCCGT ACCCCGCGCA
GCCCCTGCTG CCGTGCCACC GCGAGTCCAA TTCAAGGATT TGCCCACCGC GCCACTTCCG
AGGGTGCCAC CAAGGTCCAC CGGCCCAGCC AATTCCCAAT CACTTCCGAG GGTGCCCGTT
TTTGCTCCCG CGACGGAAAC ATACAAGTTA GTCACCTGTA A
 
Protein sequence
MVLLDGRTIK STHVTELDVP DLPLAARTAH IFPGLTNGSL ISIGQLCDHG CIATFTSDAV 
TITLDKKVIL RGDRSAPNRL WTLHAPSSTP PTKPLPSPIF PVANNVKHSS LLADRIAFLH
ASLFSPQLST WSVAFSATDS IVDNLDENPV PDDPPALKSN FLYADCYETT GKIFSDLTGR
FVTSSSTGNA YMLVVYDYDS NFIHVEPMKN RTGTEILAAY RRAFDLFSSR GLRPQLQRLD
NEASTALQQF MDDSRVDFQL VPPHLHRRNV AERAIRTFKN HFIAGLCSTD KDFPLHLWDR
LLPQAIMTLN LLRGSQINPR LSAWAQVHGA FDFNRTPLAP PGVKVLVHEK PTVRKSWSPH
AVDGWYIGPA MHHYRCYRVW INSTTSERIA DTLTWFPSKV QMPTTSSRDT VVAAARNLAT
ALSNPTPASP LAPLATQERV ALQQLSTIFS NFSDPTSPPA AISPPCCRAT ASPIQGFAHR
ATSEGATKVH RPSQFPITSE GARFCSRDGN IQVSHL