Gene PHATRDRAFT_44818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44818 
Symbol 
ID7199542 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp326096 
End bp329251 
Gene Length3156 bp 
Protein Length762 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178978 
Protein GI219116366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGTTCTTC TTGCTTGGTG GGTTAACAAT GACCGTATTG GAACGCTGGC GTCGTCGAAA 
AGATCAGAGG AAAACATCAC TTGGACGTCA GCCGTCGTTG CGTAAAGGCC GCAACGGCTG
GCCGCTTTTG AAGCGTAAAG AAGCCAGTTT TAGATTGACG GACTTCCGTG TGCGCAACCC
AATTGAACGT CACCTGCAAC AAGAATCAAT GGGGCGTTTC AATGTGGAAG CACTCGTTTT
GCGAGCAATC AAAGTGGAGG TGATTGATTT ACTTCGGGAT GCGGGTGGAT CTTGTACCGT
GCTGGCCATG GCGAACGCCG TTTCGGGAAG TCCGCGAGTC CAGGAATGCG TGGGGGATGC
AGCAGATTCC GAGAACGATG CTTGGGGATT GGCATTTTTG GACCTGATAC CGGGACGAGC
CGCGCGAAAG AAAAGCAACG CACGGTTGGC AAAGCGCTAC GAGTTTATCA ATGCCTTGCA
CAAATTAGAG GCTTCGCCGA GTTCCAGTGA AGCACGCATT TCAAGCTGGG AAGAGGCTTT
GGAAAGACTG CGAACGGTCT TAAATACTCA ATTTGCGGAG GCAGATGGAA GTGATGGCTT
GTCGATTCCC GACGATCTAT TAGAAGAGAG TATTGTCGAC AAATACTTGC TCCGCGCGCA
AGCTATTAAA CTGCAGCAAA TAGAGGAGCT AGAGCATTCC CAAAACCAGT GGTTACAAAT
TTCAAATGCT GCCCTTTCAC AGCGAATATC ACAAGTCGAC GCCAATCAGT TGGATGAAGC
GGACCGATTA ATGAGAGCTC GCTTGGACGC CGAGTATGAA AAGCGGGAGA AACGTCTACA
AAGTACGTCT CTGGCAGAAT TGGAGGCCAA GCTTATTCAA GAAGAGAGAA ATCGTGAAGC
GAGAGAACGA GCATTATCTC TGATGCGACC TTTGGAGGAT GCGGAACTTA AAATTGTACA
TGAGGCGATG AGTCATGTCG GGAATCCGAA CGAAATAGTT GCCCAAGCGG GCGTCGACTC
GGTTCAAAGG GAATCGTTTC AACGACTTGC ACCGGCACAA TGGCTCAATG ACGAGGTCAT
TCATTACTTT TATGTCATGC TAGCCAACCG GGACGAGGAA TTATGCAAGG CAGATCCCAA
CCGCAAGCGA TGTCATTTCT TCAAGTCGTT TTTCATCACG AAGCTTTTAG ACGAGGAACA
TTCCAATCCA TCACTCCGCG GTAAATACAA CTACAACAAC GTGAAACGAT GGTCCAAAAA
GGTTCCAGGT ACGTTATCAG AGAAAAGGGC CTTCTGTTTT GTTGTCTCTA TGTTCTTACA
AGGTATCGAC TGCCAGGCAA GGACATCTTC AATCTTGACA AAATCTTCTT TCCCATCAAT
GTCAGTCGGA TGCACTGGGT ATGCGCAGTT GTCTTTATGC AACAAAAGAA GGTTCAGTTC
TATGACTCCA TGGGGGATGG TGGTATGTAT CATTTAAAGG CAATTTTTCG CTACATCCAA
GACGAGCATC AAGCTAAGGA AGGTGCTCCG TTACCGGACG CCGATGCGTG GACACTTGTG
CCGTGCTTAT CGGATACACC ACGTCAAAAA AACGGTACGT AGGAAGTATA CGATGCGAGA
ACGGTTGGCT GAACGAGAAC TTTGTTTACC ATTGTTTTTG ACGAACCAGG TTACGACTGC
GGAGTGTTTG CGTGTATGTT TGCCGACTTT TTATCTAAAG ATTGCCCTTT GGTGTTCGAT
CAAAGTATGG TCAACCAATG CAGAAATCGT ATCGCACTCG CTCTCTTGAA CGGCAAAGCT
ATCCTGTAAA TGTACGATAT ATTCTTTCAC TAACATTTGA CATAGTATCA GATAGATTAC
TGTCTTGCGA TATTCTCGAG AAAGATGGAA TCACTGGGCA GAAATGTTTC AGGCTCCTCT
GTCAATTAGT TGGCACTGAC GCAATCAAGG AATTTCCAGT AGTGACAGCG CGTAGAGCCG
TTCGTGGGGA TAAAGCGTCG CGCTTCGAGA GGTTTAACGA TCTTACTTAT ATGGTAAATA
GCAGACGGAG ATGCCACTAT GAGTGTTTTT ATCAATTTCT ATGACCAGGA CCAATCAAAG
GAGCGTTTTT CTTTTGATAT TTCCCATGCG GACATCGAAG TTGCCAAGTG ATTGTTACTG
TTGGTAAATT GGGCACGCCG CGTCGCTTTC CATTGGCACA CGGACACGCC CGACAAGATG
GCACGGCGAA CCGCACCGTC CGTGAATGCG GTAGTTCACT GTCACCTCTT CTCGCTTATT
TCTGGAAAGT TCTCGCTCGG TACGTTCCGT CGTCACAAAA ACATACACGG CCGCTTACGG
TTGGTTTCAT TGCAAGGGGA AACAAACCTT GACACATCAC TCCAAAGAAG TTCTTTGACC
ACATACACTT TTCATTTGCT TATTAAGATG GTTTCTACTC GACGTTCCCA GCCCGTCAAG
GGACCCGCGT ACACTAACTG TGACAAGGAT GAGGTAAGCT TCAGCATCGG GATCCTGCGG
TCGTGTTGTT ATGCCGCCTC TGTGCGAGCG TCTCCGACAG CTCCCGATAG TACTTGAATA
CAATTCCTGT TGCTCACATA CTATATCCGC GATTCACATT AGCTTGCACG ATCCCAGGGA
ACCGAGGTTG ATACGTTCCC CAGCACAACT GTACGAAAGT CTGACCAGTC CCCCGGAACG
GCGGACTCGA CGATGCACTG GACAGGCGCC TTCCACGATG ACACCGCGAT TATCGCGAAA
CACGAGTTGG GACCGGATGT TGAGGACAGC GTCGTTGTCT CTCCAATGGA AAGCGATAGT
AGTACCACCA CCGCCGTTCA CACCAACACG AAGAAGAAAG TCAAGGCCGC GGCGCGCAAG
GCATCGGCGG CGACCGTCAA GCACGTCGCC ACCACGACGA TGTGTGCTCC AGCGAGTCCG
CCTGGGACCA CCAATCAGAC CAGAGCCGTC ACCCGCAGTA AAGGAGGCAG CGTGGATCTA
TTCAAGGGTA TTGAAAGTGT TCCCGTGGTA AAAAAGCCTT CTCCGAAAAA GCGAGGCGAC
GAGGGGGGTA ACATGCAGAA AATCAAACTT CTCACGGGGA CCCTGTACCT GTACCGGGGG
CGGTATCCAC GAGCCGAATT TGTGCGCACC AAGTGA
 
Protein sequence
MTVLERWRRR KDQRKTSLGR QPSLRKGRNG WPLLKRKEAS FRLTDFRVRN PIERHLQQES 
MGRFNVEALV LRAIKVEVID LLRDAGGSCT VLAMANAVSG SPRVQECVGD AADSENDAWG
LAFLDLIPGR AARKKSNARL AKRYEFINAL HKLEASPSSS EARISSWEEA LERLRTVLNT
QFAEADGSDG LSIPDDLLEE SIVDKYLLRA QAIKLQQIEE LEHSQNQWLQ ISNAALSQRI
SQVDANQLDE ADRLMRARLD AEYEKREKRL QSTSLAELEA KLIQEERNRE ARERALSLMR
PLEDAELKIV HEAMSHVGNP NEIVAQAGVD SVQRESFQRL APAQWLNDEV IHYFYVMLAN
RDEELCKADP NRKRCHFFKS FFITKLLDEE HSNPSLRGKY NYNNVKRWSK KVPGKDIFNL
DKIFFPINVS RMHWVCAVVF MQQKKVQFYD SMGDGGMYHL KAIFRYIQDE HQAKEGAPLP
DADAWTLVPC LSDTPRQKNV IVTVGKLGTP RRFPLAHGHA RQDGTANRTV RECGSSLSPL
LAYFWKVLAR SSLTTYTFHL LIKMVSTRRS QPVKGPAYTN CDKDELARSQ GTEVDTFPST
TVRKSDQSPG TADSTMHWTG AFHDDTAIIA KHELGPDVED SVVVSPMESD SSTTTAVHTN
TKKKVKAAAR KASAATVKHV ATTTMCAPAS PPGTTNQTRA VTRSKGGSVD LFKGIESVPV
VKKPSPKKRG DEGGNMQKIK LLTGTLYLYR GRYPRAEFVR TK