Gene PHATRDRAFT_48747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48747 
Symbol 
ID7195005 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp133228 
End bp135692 
Gene Length2465 bp 
Protein Length667 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183428 
Protein GI219126362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTAAAAAAAT TGTGGAAAAA CTACACACAC TCCCTTTTTC CTTCGACTAG TTTGGCCTAT 
TTACATTAGT ACTCAAACTA CTTAGCAGGC AAAATACCCT TCCTGGATTC TACGACACTT
TCATTAATCC CAATGAGTGC AGCCTTTAGT CACCGTGCCA ACAAACGCCG CCGCAAGCGG
ACGCATCCGT TCCACCACGC CACGGCGGCG GAGGAACGCT TTCTCCAACA GGCCATCCAA
AATTCCAAGC TCGATCAGGG ACGGGACGGA ACGCTCGAAG TGCCCTGGGC ACCCACCTTT
TATCCCACCG TACAAGATTT CGAAGGCAAC ATGATTCATT TCGTCGAAAA GATTCGGCCT
GTGGCGGAAC GCTACGGGAT CTGCAAGATC GTGCCTCCGG ACGGATGGAA TCCTCCCTGT
CGTAAGTAAT CAATCCCGTT TGGAGCGTGT CCAGGTTTGG CTGGGAACGG CATACTCGAT
CGACTGGAAC GTGACCATGG ACGGTACTCG CGCTTGCCTG AACACTTATT TACAATATGT
ACACACACAC ACACACACAC TTGACCTTGA TACCTGTTTT GATTTTGGTC CCGCGAATTA
CGGCTAATGC CTCGATAGGA ACTAGTTTAC TGCTAAACCG CGAAATTAGC GGGTAGCTCC
CAACGGATTT TTTTTTCTCA CGGCCTTTTT TTCTTTGCGT ACAGAGGTGG ATCGCAATAC
GAGAAAGAAA TTTCAAACCA AGCGACAGTT GCTACATCGT CTCCAAGAAG GAATCAGTTT
CGACGATGGC GTCGACTATA CACCAAAGGA GTACCAACGC ATGGCCAGTG AACGGACCCA
GGAATGGAAG GCTCTCAACT ACCCTGATCA CGATCTACTC TCCCGGCACG CGGACGTAGT
CCAGGAGGAT GCCCAGCGCG CCAAGCTCTT TCGTCCGGAA AACCTGGAGC GGGATTACTG
GGACATTGTG GAAACCCATA CACGCCCGGT TACGGTGGAC TACGGGAACG ATGTCGACAC
GGAAGAGTTC GGGTCGGGCT TTCCTCTTTC GCAGCGCGGA CGGTCCGTGT ACGGCACCAA
GAAACTGGAA AAGATGGATC TACCGGAACC TACATTTGGT AGCGAAGATT ACTACAAAGA
AACGTGGTGG AATCTCAATA ACATTCCCTG CGCACCGGAT AGCGTGCTAC GCCACGTCAA
GGTTGGTATC AACGGAATCA ATGTTCCCTG GATGTATTAC GGATCACTAT TTACCACGTT
TTGCTGGCAC AACGAGGACA ATTACCTCTA CAGCATAAAT TACAATCACC GTGGCGCCCC
CAAACTGTGG TACGGCGTAC CTGGACAGAG TAAACAAACT GCGGATGGTT TGGAGAAAGT
GTTCAAGAGC TTTTTGTCCA TGAAGATGCG TGATGTACCG GATTTGCTCC ACCACATCAC
TACCATGTTC AGTCCTAGAC TCTTGCAGAA TGCGCTGGTC CCCGTCTACA AGCTTCTACA
GCACGAAGGT GAATTCATCA TCACCTTTCC CCGGGCTTTT CATGGGGGGT TTAGCCTAGG
TCCGAACGTG GGCGAAGCGG TCAACTTTGC CACTCACGAC TGGATTGCCT ACGGTTCGGA
TGCGAACGAG CGGTACCGTT CCTTCGCTCG TCCGGCCGTC TTTTCACACG ATCGCCTGAC
CTTTACTATG GCCAATCATC TACAAGAACA AAAAGCATAT TCCACTTGCA AGCTGCTCTT
GATTGAACTG AAACGTGTGG TCGAGGAAGA GTTGCGTTTG CGGGCCAAGC TACTGGGGGA
GGGTGTCCGG GATGTGTCCA AGATTATATC TTTGCCGAAG AATCGTCTCG ATCAGTTGGA
CGAAAATAGC GCCAACTACG ACGACAAACG TTTGTGTCAC GGCTGCAAAC ATGTATGCTT
CTTCTCGGCC GTTGCCTGCG AGTGCAGTCA ATCAAAAGTG AGCTGTCTGC GACACAGTCA
CTACATGTGT CGGTGTGCGA CGGAGCGCAA ATACTTCATG ATTTGGAGCG ATGACGAGGA
GCTCAAATCG ACGATGGAGC GGGTACGCAA TCACTGCGAG GTACTCAAGA TCAAGGAAGG
ATGCACTGAC GAAGCGTTAG CGCAGTGCAA AGATCTTTCC GCCAGTCAAG AACCTCTTCC
CACAATGGCC CCTGGCGCTG AGCGGGACTT GGCGATTCAC AAAAACCATG AAATTTCAAC
TGCGGAGTTC TTGACCGAGA CGTACCGTTT CAACCCTCCA ATGAGTGCCA GCTTCAAGGA
AGAATCAAGG TCGGTGGCGA CGACTGTTGA TTCCGATGCA TCTTCAGGTT GCATGATTGA
TGAAGTGGCC TTTGCCGAAG CGGACGAAAA CGAGATTGAA GTTGTGGGCG TTCGCGGGGG
CGTAGGTCCG ACTGTCTAGT TGTCTCGATC AGTAAAAAAT AGGTCGTGTA TATCACGCTG
GGGAC
 
Protein sequence
MSAAFSHRAN KRRRKRTHPF HHATAAEERF LQQAIQNSKL DQGRDGTLEV PWAPTFYPTV 
QDFEGNMIHF VEKIRPVAER YGICKIVPPD GWNPPCQVDR NTRKKFQTKR QLLHRLQEGI
SFDDGVDYTP KEYQRMASER TQEWKALNYP DHDLLSRHAD VVQEDAQRAK LFRPENLERD
YWDIVETHTR PVTVDYGNDV DTEEFGSGFP LSQRGRSVYG TKKLEKMDLP EPTFGSEDYY
KETWWNLNNI PCAPDSVLRH VKVGINGINV PWMYYGSLFT TFCWHNEDNY LYSINYNHRG
APKLWYGVPG QSKQTADGLE KVFKSFLSMK MRDVPDLLHH ITTMFSPRLL QNALVPVYKL
LQHEGEFIIT FPRAFHGGFS LGPNVGEAVN FATHDWIAYG SDANERYRSF ARPAVFSHDR
LTFTMANHLQ EQKAYSTCKL LLIELKRVVE EELRLRAKLL GEGVRDVSKI ISLPKNRLDQ
LDENSANYDD KRLCHGCKHV CFFSAVACEC SQSKVSCLRH SHYMCRCATE RKYFMIWSDD
EELKSTMERV RNHCEVLKIK EGCTDEALAQ CKDLSASQEP LPTMAPGAER DLAIHKNHEI
STAEFLTETY RFNPPMSASF KEESRSVATT VDSDASSGCM IDEVAFAEAD ENEIEVVGVR
GGVGPTV