Gene PHATRDRAFT_45022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45022 
Symbol 
ID7199531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp1029212 
End bp1031163 
Gene Length1952 bp 
Protein Length513 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178898 
Protein GI219116206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000948434 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTTGTTTCC GGACAATCGA AACGATCGTA TCCACTTGGA ACCAACACAC ACACACAAAG 
TAGGTCCCGT GAGTCCTTCC TTCGAACAAC AAACAAACAA ACAAACAGCT ACGTGAACGC
AACTTCTAGA GGCAATCTTG TCTTCGAGAA GCAGAATTTC TCCCGAGAAC AAACCCGGCA
CTTTATTCCA TCATGTCGGG TGAAAGCTAT CCTTCGGAAT CTCCGGAAAT TCCGACATCG
CGTCCAGCAG ATTCCACCCA CAAAGCGAGA AAATCCGACG GCCGTAAGGC TGATGCGGAC
CGGAATCCTG TTCATTCCAA GCAAAGCGAT CCCGCACCGC ACGAAACGGA CATGCTCGCG
GACCCCGCAA CGACCCTCTC CCCCTTCCCC ACGGATCAAA CCTCGTCTCC GGCTTGGCAC
TACCAGCATC ATCCGCCACC ACCACACGCC TACTACTATC CGCCCTCGCC GTACGGCTAC
ACCGGGGTAC CGCCGTACCC GGGAGGACCT CCACCGCCCT ACGGACCATC CTATCCACCC
CCCTTCTTTC CAACCGCACC GGGAGCCTAT TACCCACCTC CCCCGGGATA CGCACCTCCC
GGCTCCCCGT CACCGTCCCG CAGTGATGGT CACATGAACG AGCACGGGCA TGCGTCTTCG
CCGGGACGCG GCGAAGCCGG AGGCTACGGG ATGCCCTACC CCCCACCGCC CTACGATCCC
TACCAACCTC CGCCCTACGG CTCGGCCTAC CCGATCGGGG CGCCGCCCCT TAGTCCACGC
CATCCCGTCT ATTCCAATGG ACTACCGCCA CACAGTCCGG ACGCCAACGG GGTTTCCTCG
AAGCATCCGC ATTCGTCGAC GAATCCCCCG CCCTTGGAAG ACCTCGACAC GGCCTCCGAT
TCCGGCAAAA CATCGGCCGT CGAACCCGCG GTGTCAGCAA ATGAACTCCG GAAACTCAAG
ACCTACATTC GGCCACCGGC ACCGTCCAAT CCCGAAGTGG TCGCCCGTCG CCAACACAAG
AACTCTCAGA GTCGTCGCCG CGCCGCTGTT CTGAGGGACC GGGTTGCCGC CGTCGCCGCC
ATGGATGCTA CCAAACGTAC CGAAGAAGAT CAACACATGT TGCACCTACA CGAGACGCGT
CGGGAACGGA AAAATAATAG GTCACGAGAA CGCGCTCTCG AACGCAAGGA AGAGATGGAT
CGTATCCTTG GCAAGAAAAT ACGACAACGT ACACGACTCG AAGTGCAGTT TTTGAATAAC
ACAATGTCCA AGAAGCAGCG AAAGAATGAA GGCGATCGTT TACGTCGGGC TCGACTCAAG
GCGCTCGGAT TGGACGCACG TAACGGGGCC GCCAAAAAGC CCGGTGTTCC GGCGCGTGGA
CCACTCCCTC CGCATCTCTT GGATCCACAA CGGCAGCATT TACCCTTGGA TCCGCAACAA
CAGCATCATC ATCACAACCG ACCGCCACCG TCGCCGCGTT TCATGCCTCC CGCACTAGCG
TATCACCAAC ACGCTTCACC GCAGCCGCAA CCGTACTACA CACACCGGTT CACCATTCCC
GCACAGCACC CAAGTGTTGG AGCTCCCTGG CACCCGACCG CACCCACAAC GTACGCGCAC
CAGGAATTTG CCAAACACGC CGAAGGCAAC AGCGTTGCCA AATCCGAGGC GGTTGTGGAC
GCCAATACCG TAACGCCAGC GTCGCCAAAC GAAGTCGACG GTGTTTCCGT TTGAATAGAC
GGTGAGAAGC CGACGACGAT CTGCATAGTG AGGAATGCAA CTACGGCCGT AGGAAGGAAT
CAGGAATGGT ATATTTCGTA GTTTTATGTA AGAAAAATGG CTCGCGGTGG AGCCTGGGCA
ATTGCACATG TGAAAGTCTA GTCATGGACC TGTGGGAGAG CTGTGTTGAT GGGGTGGGAA
AGGAGTCTCG GAAAGAGAAG ATGACCAAGG CT
 
Protein sequence
MSGESYPSES PEIPTSRPAD STHKARKSDG RKADADRNPV HSKQSDPAPH ETDMLADPAT 
TLSPFPTDQT SSPAWHYQHH PPPPHAYYYP PSPYGYTGVP PYPGGPPPPY GPSYPPPFFP
TAPGAYYPPP PGYAPPGSPS PSRSDGHMNE HGHASSPGRG EAGGYGMPYP PPPYDPYQPP
PYGSAYPIGA PPLSPRHPVY SNGLPPHSPD ANGVSSKHPH SSTNPPPLED LDTASDSGKT
SAVEPAVSAN ELRKLKTYIR PPAPSNPEVV ARRQHKNSQS RRRAAVLRDR VAAVAAMDAT
KRTEEDQHML HLHETRRERK NNRSRERALE RKEEMDRILG KKIRQRTRLE VQFLNNTMSK
KQRKNEGDRL RRARLKALGL DARNGAAKKP GVPARGPLPP HLLDPQRQHL PLDPQQQHHH
HNRPPPSPRF MPPALAYHQH ASPQPQPYYT HRFTIPAQHP SVGAPWHPTA PTTYAHQEFA
KHAEGNSVAK SEAVVDANTV TPASPNEVDG VSV