Gene PHATRDRAFT_47347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47347 
Symbol 
ID7202507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp363118 
End bp364617 
Gene Length1500 bp 
Protein Length499 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181538 
Protein GI219122409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAC AAAAACTCAC AGACAGAGAA ACCTCTCTAG CCCGGCAACA CGAGAAGCGA 
ATTGGAACGT CAGGTGCAAA TGAAAGTGGA ATGAATAGTG CCTTGGAAGG GGCCATTATC
TCGCCACAGA GCCGGGTGAT CGGCCATTTT ACGTCGTTGC TCTCTAGTGC CTTGAATGTA
TCACGGAGCA CCGGCACCGT AAAGCAGCCG CTTCAGTCGG TATCTGAAAA GCTCCCTTTC
ATTTTCTCCC GATCGATCGA CGCTGCTGCC CTATGTTGTA GTGTGCCATC GCTGCAACTG
AAACAAAAAG ACCCGACCTT TTTCCCGCGT TCTTCAAAAC CGCTGGTAGA AGACGCCGCT
GCTTTGAATC TTGGAACACC CTTACGTGTT CACCGATCCG ATTTGAAAAG TTTGCCAATC
ACATTATTGT GGAATCTCAG TCGATCTTTC CTATCCTTGG TAGATTCACG ATTGCGGTCT
TCACAAACCG CTTTGGTAAG GCAAAGTCGA AGTAGGCATC GGGAAGATGA CGCACATTCC
CGCGTCCTCG TTGGTCTTTT GGCTGCGTCT TCTACTCCAA TCAATCCCAC AGCTGTCGTC
ACAACTTTTC GTGCTCTGGC TTTCTCCGAG CGTGTCGACG AAGGTGACTA CATTTTACCT
ATTGTTATGG AAGCAGTATT TGATCTCGAT GTTTTGGGCC ATTTCATGAC CGTCACGATT
GAAGCTCCAG GAACCATTCA AGGAAGCTTT GTCGGCAACA ATCACATAGC AGGTCCGGTA
GAGCTGTTAA AAATCGAAGT TCAACTAGAC ACGTCAGCCA TGCTCAAGTC GATGATGACT
GAAGCGCGTT CTGTGGTGCG AAAAGCCCTT GTCGTAGCCA CCGAAATAGC CACTAACCTT
CTTCACTCGA CCCCGTCGAG AGTCTCATAC CACGATACGA CGGATCTGTT GGTGCTGCAA
GGTTCAGGCG AAAGATCTGT GAGAGAGATG CTTCCTAATA GTACCTCTTC AGATACTTCA
GCGAACGGAT CTAGCGCAGC TAACAACAGC GACACCTGTT CCGAGCATCC ACAGATGCTT
CCGCCTCCTG CGCGGGGAAA GAGTGAAGAG CTGTCTCACA AAAGAAGTAC TGAAACTTCG
GACAAAACCA GTGATGCATT CTCCTTGGAG ACAAAGAACG CACCATGGGG CAAGAATAAC
ACCTGTGGCG ACAAAGAAGA CATCGACTTT TTGGAAAATG AGAGTTTTGT GTCTGATTTG
TCCGGAGTGA AACGGGAATG CCTTGACCAA CAATGCCTGG GCGTTGATGA TGACAGTTTA
TCCATAGGAT CTCGAAAGCG GCTTAAGACG TCTCCGTCTG AAAAGAGTCG TACCACGGAA
AATGAAGAGC AGCCGTCAAC AGTAGACCTT CCAGCTGAGA GCTCACGCGA TCAGGACGAT
CGACCATCTC GTCCTAGCAT CGATGTGAAT GAGCTCTGTG TCCGGATTGC AAAGGTATAG
 
Protein sequence
MRRQKLTDRE TSLARQHEKR IGTSGANESG MNSALEGAII SPQSRVIGHF TSLLSSALNV 
SRSTGTVKQP LQSVSEKLPF IFSRSIDAAA LCCSVPSLQL KQKDPTFFPR SSKPLVEDAA
ALNLGTPLRV HRSDLKSLPI TLLWNLSRSF LSLVDSRLRS SQTALVRQSR SRHREDDAHS
RVLVGLLAAS STPINPTAVV TTFRALAFSE RVDEGDYILP IVMEAVFDLD VLGHFMTVTI
EAPGTIQGSF VGNNHIAGPV ELLKIEVQLD TSAMLKSMMT EARSVVRKAL VVATEIATNL
LHSTPSRVSY HDTTDLLVLQ GSGERSVREM LPNSTSSDTS ANGSSAANNS DTCSEHPQML
PPPARGKSEE LSHKRSTETS DKTSDAFSLE TKNAPWGKNN TCGDKEDIDF LENESFVSDL
SGVKRECLDQ QCLGVDDDSL SIGSRKRLKT SPSEKSRTTE NEEQPSTVDL PAESSRDQDD
RPSRPSIDVN ELCVRIAKV