Gene PHATRDRAFT_53961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_53961 
Symbol 
ID7196307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp675017 
End bp677388 
Gene Length2372 bp 
Protein Length668 aa 
Translation table 
GC content49% 
IMG OID 
ProductRTX toxins and related Ca2+-binding protein 
Protein accessionXP_002176627 
Protein GI219109747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAT CGATGAACGC AACTCCCGAA GAAGAGGAGA CGCCGCCGCC AAAGTGGAGG 
AGAAAGGAAG AGGAGGAAGA GGACGTTGGA TGCCAGCCTG TTGGCTCATG CCCTCATGGA
TCCGTCGGGA TCATGGCGAT CCTTGTGTCA ACTGGTCTTT TCTGTTTGTC CGCTGCCGCA
GCTGGTAGTT GTACCTTTGT TCTCGTCGAC ACCGTCGAAA GAAATGGTTT AAGCTTCGAA
GATCGTAGAA TAGGGCTGTA TCGGTTTGAA GATAAGCGTA CAGACTCCTC CTGTTTATTT
TGGACGTCTG GTGAGAACGC CGATCGTGTG TACAATAGTC ACTGGAGTGC TGCTCGTGCC
CTTGTCTGGG CCGCCTTGAT CCTAACGCTC GTGAGCGCAG TCTTTCTATG CAGTGCCAGC
TGCTACGCGC ATCCCAGAAA GCTCTTCCAA TCGCTCTCTT TTTTATTTAT TTTCAACTCT
ACGCTGTTTG GGATGTCCTT CATCGTCTTT GCGTCAGACA TTTGTCAAGA AGCTGGATCT
TGTGCCATGA GTACTGGAAG TATAATGATG GTTGCAGTTG CGGTTCTCTG GTTATTGACG
TCCTTACTGC TGCTCTATAT TCCTTCTTAT TATAAGAACC CGCGCTCTCC AAGGGAGAAG
CCTCGGATTC AATTTAATTC GACACAAAAA ATCTGGTGTA TGGCTGCCTT GATTGCTGTC
TTACTTATCG GACTAGTCAA CGGTTTAATA ATTGGACAAA GCGATGGTTT CTCTGAGAGG
AATGGAACTG ATGTTGTCTC ATCACCAGTC CCGGAGATCT CCAGCCCTCC GCCAAAAAGT
TTTCCAGGCT CCTGGGATAC TATTGCCATT GTTCCTGGCG GTTACCAAGG GAACGCCATT
TCACTGGCTC CTGGTGGGCG GGCAATCGCG GTGTCTACTT CGTACAATCC AGGCCGTCTC
TCGTTTTTCT ATCAACCTGA AAATAGTCTT ACGTGGACAG TCCTGGGTGA GACGGGAGGA
CACATCGGAC CGCCGAGTTT CGGTAGGTCC CTCGCGCTTT CCGATGTTGA CGTTTATGCT
GTCGGTACTC CAGATTTTGC TGTTGATGGA GTAGCATTTG GACGAGTGGA TGTTTGGTTC
TATGATCGTG ACGCAGAAGC TTGGTTGATA GACGGTTTAC TGATTGGCAA CAAGCCAAAC
TTTCATTTCG GCAGCGATGT GGCGATTACG GCTGGTGCCG ACTATGCGGC AATTACTTCA
ACTTCTATGG AAGACGGCGC ATCGGCGGTC CAAGGCTATG CATATAGCGA AGAGTTGTCG
TGGGTTCCTA TCGGCCAGGA AATTTCCCTT CTCCAGTCCA GTCCGGCAAC AAGTGGTTGG
AATGCGACTT CGTCATTGAT TATTTCTCCA TCGACTGGGG TGGTCACATT GTCGATCGGA
ATTCCTATTC AGGATATTGG AGGGATGGTT ATCGTGTGGG ATTACCAGCC TGCAAATGAT
GTATGGGTAC AGAGGGGCTC GACTATTGAT GCGAATGCTG TCGCCAGCTC AGACGACGGC
GACGACTTTG GATATTCTGT TGCTTTAAGT GAAGATGGCA ACGTTCTCGC CGTGGGTGCC
CCCCAGGGCG GAAACAAATC GACGGGTAGC GGTGGTCACG TACGAATCTT CTCATTCCAA
CCAGGGACAT GGCAGCAAGT TGGGCAAGAT TTGACTTGTG GATTGAACGC AAGACGTTGT
GGGGAATCCG TTAAACTGAC ATTTGATGGA AAGATGGTTG TGATTGGAGA CTCTGGGTTT
GATGGTGGTC GTGGACGCGT TATTGTTTAT CAGATTGACG ATTTTGCGGG CGAGTGGTAT
CAGTTCGGAC CCGTCATTAA CGGGGAATCT TCTGGTGGTG TTGGCGCAAA AGTTTCGATT
TCTCGTTTTG GAGAACTTGT GGCCTACACT GATGCTACCG AGGGCTCCAG AAAGGGCGTG
TTGGTAGAAT ACAACCCAGA GGAATAGCTT CAATTAGTCA TTGTCCATCG ATGTGACGGT
CTGCAGAATT TATGTGCGTA CTTTTTTGTC GGCGCTGCTA CTTTCTGATC TTATTGCGCA
CGCAATTGCT TTACGGTATC TGTATCGTTG GTGAGTTGGA ACTCTGCTGT GGTCCTTCAC
AGATGCATAC CGATCAACTG TATAGACATG ATTGTTCCCC CAGAAAGCGA GGCAATGCTA
GTATCGAGTG GCTAGCCATG TGGGTCCGAC ATTCCCGAAG ACTCGACAAA TGACGCACTA
TTTGCCTTAA TGAAGTTCGA GCTGCAAGAT TATGTGTGTG TTTTCCATTG GATTGACACA
TATGTGATGC CGATTAAGGC CTTCACTGAT GC
 
Protein sequence
MAVSMNATPE EEETPPPKWR RKEEEEEDVG CQPVGSCPHG SVGIMAILVS TGLFCLSAAA 
AGSCTFVLVD TVERNGLSFE DRRIGLYRFE DKRTDSSCLF WTSGENADRV YNSHWSAARA
LVWAALILTL VSAVFLCSAS CYAHPRKLFQ SLSFLFIFNS TLFGMSFIVF ASDICQEAGS
CAMSTGSIMM VAVAVLWLLT SLLLLYIPSY YKNPRSPREK PRIQFNSTQK IWCMAALIAV
LLIGLVNGLI IGQSDGFSER NGTDVVSSPV PEISSPPPKS FPGSWDTIAI VPGGYQGNAI
SLAPGGRAIA VSTSYNPGRL SFFYQPENSL TWTVLGETGG HIGPPSFGRS LALSDVDVYA
VGTPDFAVDG VAFGRVDVWF YDRDAEAWLI DGLLIGNKPN FHFGSDVAIT AGADYAAITS
TSMEDGASAV QGYAYSEELS WVPIGQEISL LQSSPATSGW NATSSLIISP STGVVTLSIG
IPIQDIGGMV IVWDYQPAND VWVQRGSTID ANAVASSDDG DDFGYSVALS EDGNVLAVGA
PQGGNKSTGS GGHVRIFSFQ PGTWQQVGQD LTCGLNARRC GESVKLTFDG KMVVIGDSGF
DGGRGRVIVY QIDDFAGEWY QFGPVINGES SGGVGAKVSI SRFGELVAYT DATEGSRKGV
LVEYNPEE