Gene PHATRDRAFT_54952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54952 
Symbol 
ID7195109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp278616 
End bp280186 
Gene Length1571 bp 
Protein Length415 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183457 
Protein GI219126423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.417126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACAAAACAA ACATAACGGT GAGACCGAAC CGAACGACGG GTGTTGTCTA CCTACCTACT 
TACCTACCAC AAGGCTGTTC CCCGTAAACG TTGTCGACCC GTCCAGAGTG TTTCATCGAC
AACTCCTTAC GTATGAGTCG TTCCTGTTTG GTAGTTACGA CGCTCTTGCT GGCGGTGCTC
GTTGCTGTGA GCCTGCCAGG CTCGACGGAT GCCTTTGGCT CGGTCCGGAT CGTCGTTCCA
AGGACCAGTG TTCCCATGCA ATCAACAATA CCGCCGTTAC CTTACCGGAG CAGTTTATCG
TCATCCATGG CCACGGCTGT GAATGGAGGA GACTCGTCCA CTGCTTCGAC GGCGAACGGT
GCACACGATG ATGCGAACAA CAACGAGACG TTCACTCCCA ACCTCAACAT TCGTCTGAAC
GTATCGGAAA AGGCCCGGAC CGTGACGAGT GTGTGCGTCT CCGGGACCCT GTGTACCGTA
TCAGTACACG AGGGTATTGC GGGAGCTCCC TTTGGCTCGT TCGTCGATTA CGTACTGGAC
GATCAAGGTA ATCCGGTCTT GCTCATGAAC GAAATGAGTA TGCACACTAT CAATATTCAA
AACGCGGCAC AAACCCTCCT CGATGCCAGT GGCACAGCCA TTGGACCGGG CCCGTCCATG
GTCACGCTAT TTACCCAGCT CGGTTCCGGG ACGACGTCTC TCAGTCCGCC GCGGACCGCG
GCCGGCGGCG CCAGCGGTAC CGCCAAATCC AACAATCTAC AGGACGTTTC ACGTTGTTCG
TTGACGGGAA CCCTGTACAA AATCGACCCC GCAGTGGATT CGGACGTCGA TGCCATCCGT
ATGCGGTACT CACTGACCCA CACCTACGCC GACCAAGTCA TGGACAGTCC CAAATTTGCC
TTTTACCGAT TGGTACCGGA AAAAATATAC TTTGTGGGCG GCTTTGGCGT CATGGCCAAG
TGGGTGGATC CGGAAGACTA CGCCGCGGCC GCGCCGGATA TTCTGGCCAA GGAAGCCTCC
GCGATCGTGG CCAAGCTCAA CCGTGAACAC GGGGAAGACT TGCAAAACAC CGCCCGGCAT
TTGTTGCGGG TGGAAACCCC GTTGGAAGAC ATCCGCGTCA CCAACGTGGA TCGACTCGGC
GTTGATCTAC GGGTCACGTC CCAAAAGGGA TCCCGACGCA ACAAACTGCA AACGGACGAA
TTCCGTATCG GCTTTCGCAT TCCCGTTATT AGTGTCGAAG ACGCCAAATC AGAAATCCTC
AAGACCTTTC AAGAAGCCTG GGAGATTGGT AACGGTATGG ATTGGGGCGA AGCGAACGGG
AGCGACGGTG CCGCTACCTC GGTGCCCATT CTTAAAATTG CGGCCGACGG TTTGGAATAA
TGCCAGCGCG CAGCCGCCGG CAAGGTAGGG TTATTGCCCT GGTGAGCGCC ATCTAACGAA
AGACGTTCTA CTTCTTTTTT TGGAATGAAC CAACACGGGA TATTGTACTT CGTAGGAGAG
CCATTCGGCA TTGGGACCGT TTGGAATAGG TGGAAACTAG CGTATAGAAT AGGGCCAGGG
TTTGTCGTAG G
 
Protein sequence
MSRSCLVVTT LLLAVLVAVS LPGSTDAFGS VRIVVPRTSV PMQSTIPPLP YRSSLSSSMA 
TAVNGGDSST ASTANGAHDD ANNNETFTPN LNIRLNVSEK ARTVTSVCVS GTLCTVSVHE
GIAGAPFGSF VDYVLDDQGN PVLLMNEMSM HTINIQNAAQ TLLDASGTAI GPGPSMVTLF
TQLGSGTTSL SPPRTAAGGA SGTAKSNNLQ DVSRCSLTGT LYKIDPAVDS DVDAIRMRYS
LTHTYADQVM DSPKFAFYRL VPEKIYFVGG FGVMAKWVDP EDYAAAAPDI LAKEASAIVA
KLNREHGEDL QNTARHLLRV ETPLEDIRVT NVDRLGVDLR VTSQKGSRRN KLQTDEFRIG
FRIPVISVED AKSEILKTFQ EAWEIGNGMD WGEANGSDGA ATSVPILKIA ADGLE