Gene PHATRDRAFT_49763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49763 
Symbol 
ID7198346 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp201211 
End bp203325 
Gene Length2115 bp 
Protein Length594 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184505 
Protein GI219128617 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACGA TTCCCACATA TATTCCATCG GAGAGGAATT TTGAATTGAC TCGATGCCCC 
AGAAACGCTG TCTTTCTTGG CTTGGTTTGA ATCCCCGGAA AACATTCATG GAACCGCTTG
CGTAACGAAA ACCTTCTTGT CATTTGTGCT GGCATTCCTG ACGCTGATCA TCGGCCCCAC
AATGCAGACT GGTCTTTCAG CTATAACTCC TTTCCGGTGT GATCATGGAG CGTTGGTGCG
ACCATTCCAC TTACGGTTTG GAGGAGGACA ATGGAAATCG ATCCAGTACA CTAGAAGGTG
GTGAATCGGA ACGACCCAGC ATATCAGAGG GTGCAGAATC GGAACATTTT CCGTTCTTTC
ACGATACCTT GACTTTTGCA AGAATCACGG AGGGGATGGG ACATATATTA CCTTACGCTT
CCCAATCTGG ACAGGTAGCA GGAGAAGAAC AAAGCGCGTA TTCTCTATCC CCATACGTCC
CGTATCCTCC ACGTAATTAC GTACGGAGAG ATTGCGAGGA AGACGACGTC GGGAGAAGAC
CTAGGGTTCG ACGATGCCTG GATTTAAACA AAGGAGTGAA CCTCACTAGT CCCAAGCGCT
TCAGTATTTC GCAAGACCCA TTTCTATCAA AAAATCATCC ATCGTCTCTT TCGGACGGCG
AAGAGACAAG TCAGCATTCG AGAAAAGCAG CAGGCATTGC TTTGGAAGAG GACGCGGTAG
GAGATATCTT TTCGAGCCGC AGCATTCGTG CTCAATACGG GTCTGATCCG ATGCAAATGA
AAGTTCGATC CCCCCGATAT CTTCCAGAGC CGAAATCTCC ACTGGTACAC TCACAGCATT
CAACTCCGTC ATCACATCGT CATCCTGGAA GTCACCTCTA CCAAAGCCCA GGTTTTTCAG
GGAACTTTGC ATACTCTTCC TATCAAATGG CCTCTCCCCA CAGCTATTTC CAGAGACGAT
ATCCCATGGT AGCTTCGCAT CCTTTCAGTG ACAACACAGG CACGCCAGGT ATTTTCATGT
CCCAATTGCC ATGTCCTGTT TACTCGCCAC CGCAATACCC TCCACAACCA TTTATACCCC
AACAATCTGT TCACAGCGAC ACTCCTGTAG ATGCCAATAG GATGGATAAT TCTGGTTGCA
CGGCCACGGG ATCATCTCCA AATAGGACTC TTCCTTTCCC AAACCCGAGC TTGAGCCTAG
CTATTCCACC TTCCCCTATA CGATTGCGCC AGAAACCGCC CGGTAAATTG AATCCGGTGC
GTCGATCCGC CCGTTCAGAA TCCAAAATCC GGGAAGTGCG GACCCGGACG AGCTCTGGCG
AATCGACGTC GCAAGCTGCG GGCTTGTGCG CAGCGGAAGT GGCGACGGCA GGTAGTGTGC
GGGCGAAGGC AGCTATTGTG ACATGGTACG ATCGACTGGA CGATTTACGT CGGTTCCGGA
AAGAGTTTGG CGATTGCAAC GTGCCTCAAA AATATGAACC AAATCGAGCT CTGGGTATTT
GGTAAGTCTT TCGCAATAAG GACAGATGCA TGGTTGTTTT CCTTGTGCTT TCAGCTACGG
CCTTTAACAT CTCTATTGAT GCTGCGTACG TTTCTGTGTG TTTACAGGGT CAACAAGCAG
CGGATGGAAA AGAAGAAGCT CGACAGGGGC GAACGATCAT CCATGACTAC GGAACGACTG
CAGGCGCTGC AGAGCGTCGG GTTTCAGTGG GCCAAGCTTA AGGGCGATGT TTCTTGGAAC
CAGAAGTACA CAGAATTGCT GGAATACAGA TCCGTGTTCG GTGACTGTAA CGTGCCTACC
AAGTACCGCA CCAATCCGGC GCTGGGACGC TGGGTTTCGA CTCAGCGATC GCAATTCAAA
GAATTCCAGG CCGGTCTGGT AACGCATATA ACTGATCAGC GAATTTCCCA CCTGGAGAAG
ATAGGCTTTC GGTGGAGCAT GATGGAGGAG GAGGAAGAGA ACAACTGCAC AAATGAAAAT
TCGCTGAGGG ATGGTAGCGA AGCAGATGCC ATCTTTTCAA GGTCAATGCG AGTGGAGAAA
GTAAAGCGTT GGCAATACGA CAAATCCAGA ACAAGTACCC GCCACTCTTC CATCAATCGG
GTTACTAGTG TGTGA
 
Protein sequence
MERWCDHSTY GLEEDNGNRS STLEGGESER PSISEGAESE HFPFFHDTLT FARITEGMGH 
ILPYASQSGQ VAGEEQSAYS LSPYVPYPPR NYVRRDCEED DVGRRPRVRR CLDLNKGVNL
TSPKRFSISQ DPFLSKNHPS SLSDGEETSQ HSRKAAGIAL EEDAVGDIFS SRSIRAQYGS
DPMQMKVRSP RYLPEPKSPL VHSQHSTPSS HRHPGSHLYQ SPGFSGNFAY SSYQMASPHS
YFQRRYPMVA SHPFSDNTGT PGIFMSQLPC PVYSPPQYPP QPFIPQQSVH SDTPVDANRM
DNSGCTATGS SPNRTLPFPN PSLSLAIPPS PIRLRQKPPG KLNPVRRSAR SESKIREVRT
RTSSGESTSQ AAGLCAAEVA TAGSVRAKAA IVTWYDRLDD LRRFRKEFGD CNVPQKYEPN
RALGIWVNKQ RMEKKKLDRG ERSSMTTERL QALQSVGFQW AKLKGDVSWN QKYTELLEYR
SVFGDCNVPT KYRTNPALGR WVSTQRSQFK EFQAGLVTHI TDQRISHLEK IGFRWSMMEE
EEENNCTNEN SLRDGSEADA IFSRSMRVEK VKRWQYDKSR TSTRHSSINR VTSV