Gene PHATRDRAFT_44889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44889 
Symbol 
ID7199811 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp551669 
End bp554786 
Gene Length3118 bp 
Protein Length892 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178801 
Protein GI219116012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.423509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGACGTAGCA CTAGCATCCA CTCTGCCCTG CGTCGACTCT ATTCCTATCG TGATAACACA 
CGGAATAGCA CTTTTGTTCT TTGCATTTCG TTCTCAAACA ACCATGGCCG CAAAAGACGA
CACGGGAATT CCACGCTTTT CTCCTTCTTC TTGGAGCATG CGCCGGCACT TTGGAGTTTC
CGTAGTGTTA GCGCTGGGCT TTGGGATACT ATTGTTGGTA TTATCTAGTG ACCCACACTC
CTCGACCAGC TACACACAAC ACACCAACTC TCCGGTCTTT TCCTCCCGCT TGCTGCGCCT
CGAGCCCACC TGGGAGCTGC AAACACGAGA CAATGTATCC TCCGTATGGA AAGCACTGGG
AGTGTCCAAC GAGCAAAGGG TCCTCCTGAG GGTACATTAC TCGAAACACC AGCCCAAAAA
CAAGCGTACA GTCAGCGATG CCAGTGGTGA TTGGTTTGAT GATTCCACAA TTCGTAGTCT
TCAACAATTG ACATCCGAAG ACCGCATGGC AGCGGAAGCG GAAGAGCTGG CCGTTAACGT
ATCGTTTGAA GACATTTGTA CGTAGAACGT TGAATATTCT GCAAATTGAT TCCGTATTAC
TTTCTTACAG ATGCCAGCGT TTTTCTCTCT CAGATAAAAC AATTGTATTT ATCACCGCCA
CTTGGGCTTT GGGAAATTTG AGCACCTGGC TGCATATGCC GAGCTTGGTG GGTGAAATCA
TGTCGGGATT TTTGCTAGGT CCGCCACTCG CTGACTTTTG TCCCTTCCCT GAGGCCATGG
TTCTCATTGG GAGTTTTGGA TTGATTGGGC TCATTTTGGA ATCCGGTATT GATTTGGATG
TCGGCCAGCT TAAAGAAACA GGGAAACGCG CCATTCTGAT GGCGTTTACC GGAACGGCCC
TACCACTATT AGTGGGAATG GGTCTTGGTC GGGCTGCTGG GCAAGAATTG CAATCCAGTA
TCGCAATTGG CGCAACGTTT TCGCCTTCCT CGCTTGGTGT CTCGGCCAAT GTACTGTCGG
CCGGAGAAGT TCTCAACACA CCGACGGGTC AAATGATTGT CGCCAGTAGC GTTGTCGATG
ATGTCCTTGG GCTTATCATG TTGAGTATAC TGGATGTTTT TGTCAGGGAA AACACCACGG
CGTTCGATTA TTGCATACCC TTCATTTCGT CGTTTGGGTA TTTGATTGTA CTTGGATACT
CCGGGATCAC TTGGATGCCG TACATAATTG AACACAAAAT TATGGCTAGA TTTCCGGAGG
GGTACCGCGA GTTGGTCGCC TTTTGTCTCA TGTTTTTACT CCTCCTCGTA TACCTTCCGC
TGCTGAATTA CTCCCGTGCC TCGTACCTGA CTGGTACTTT TTTGGCGGGT CTGACATTTT
CCCAGATAAA CTCGGTCCAT GCTGCTTTTG CTCAGCACGG ACGAGGAATT CTCGATTGGC
TGTTGCGCAT ATTTTTTGCA GCCACTATCG GGTTCCAAGT GCCCATTACA CGCTTTCAAG
ACGGTTACGT CCTCAAGTGG GGCGCGAAAT TTTGTAAGTC CTTGCGCCCG AGCAGCGACT
TTATAGTCAT TGCTATGGAT TGATCCCATT CGCTCACGAT TCTGTTTTTA CTCAATGTAG
TGGTGCCGAT TTTGGCCAAA ATGCCTTTAG GCCTTTACGT CCCGCGCTCC AACTGTAAAA
CGCTCCCGGA GGACTTTCCG TATGACCCGT ATTGGCGCGA TGTATGGATT ACTTCGCTGG
CACTGGTCTG CCGCGGCGAA TTCAACTTTA TTATTGCGAG TTTTGCATTA AGTGAAGGAC
TTGTCAACCC TGACATCTAT TCGGCGATTG TCTTTTCAGT ACTATGTGCC AGCATCTTTG
GACCGCTGAT ATTGGCACGT GTCATTGCCT ACTACAATGC TAAATCACAG GCCTATCTTT
CTGGAAGTCA TCCCATCGAG AGAGACGGCG ATACGTCTGA CGGCTTCCGT CCGCTTTATC
TAGCGATTCA AGCGAGAACT CCGATCCACT GGGGTTTGCA GGACAGTTTC AAGAAAGCTT
TGGACGATGC TGGCCTCATC ATCATTGATC ATCGTTCCTG GCATACTTTG GGCCTCAACG
CGATCGATAT TACGGAGCTC TTTGTTCAAG ACACGAGAGT CAAGGTTCGT GTTTGTGCAT
GCTTCGAAAC TCGCAAAGCA GCAGCTGCGG CGGCTGCCAA TTCCGGTGAG TCGGTTGCGA
TTCTTTTGCC GATTCAAAAA GGCCAAGTAC CTGAGTCAGC TACCATTAAC GGCTCAGCAA
CAAGTCAAGA CACAGATGCC AAAGACTCGG AGACTAGTAG TTCGCAAATG GAAAAAGGCA
AGACAGAAGA CGAGATCATT CGTGCACGGT GCGATGAAAT AAAACAACGT ACGTAGAACG
ATTCTGTCGA GTATGATAAA AGATGCGTGA TGATATCCTA ACCCTTTCCT CCGACGTAGT
TCTCTCCAAT TGCCTCCTTC CACACGATAC CGAGGACTAT GTGATTCAGG TGTCGCAGTG
GCAGACATAT ACGTTCGACA ATCAAGACTT GAAGGGATCG GATGATGACA AAAAGTTCTA
TCGATTCAAC TTGCATCAGC CAACGGAATT AATAGTAGCT CCGAGCGAAG TATCTGCAGA
AGATTCTCTT CCAACTGCTT CAGAAGTCGA GCCGCTGCGC CGCCCGGCAT TATATCGGCG
GTCGTCAACA ATCACCGTAA CCGATGATCC TACACCTGCT GATGAGCCCA TGCTTTCGGG
TCCTGATCTG TGGGAATCAG ATGAAATTTC TCACGCAATG ACTCGCGACG GCTATGTCAT
GTCTCCGGTT CCCGGCGGTA TTCACCGTTC GGTGACCGCA GGTCTAGTTG GAGAAGCTGA
GCACGGACAT CATCCGGAAC CGTATCACCG TCGTCGCATA ACTTTCGACG CAGCCCTGTT
GACAACTCAT GGGGACGAGC TTGAAACAAG CATGATCAAA GAGCGTCTGC ATGGATATGT
ACGACCGCAT TTGTAGCTTT TGTGGTCATA CACGCGTCTT CTTAGCCCTG TCATCTCGAG
ATACATTTTT GAATAGTATG TAGATTATAG TTTTTGTACT TGACAGAGAG TTATGCTT
 
Protein sequence
MAAKDDTGIP RFSPSSWSMR RHFGVSVVLA LGFGILLLVL SSDPHSSTSY TQHTNSPVFS 
SRLLRLEPTW ELQTRDNVSS VWKALGVSNE QRVLLRVHYS KHQPKNKRTV SDASGDWFDD
STIRSLQQLT SEDRMAAEAE ELAVNVSFED IYKTIVFITA TWALGNLSTW LHMPSLVGEI
MSGFLLGPPL ADFCPFPEAM VLIGSFGLIG LILESGIDLD VGQLKETGKR AILMAFTGTA
LPLLVGMGLG RAAGQELQSS IAIGATFSPS SLGVSANVLS AGEVLNTPTG QMIVASSVVD
DVLGLIMLSI LDVFVRENTT AFDYCIPFIS SFGYLIVLGY SGITWMPYII EHKIMARFPE
GYRELVAFCL MFLLLLVYLP LLNYSRASYL TGTFLAGLTF SQINSVHAAF AQHGRGILDW
LLRIFFAATI GFQVPITRFQ DGYVLKWGAK FLVPILAKMP LGLYVPRSNC KTLPEDFPYD
PYWRDVWITS LALVCRGEFN FIIASFALSE GLVNPDIYSA IVFSVLCASI FGPLILARVI
AYYNAKSQAY LSGSHPIERD GDTSDGFRPL YLAIQARTPI HWGLQDSFKK ALDDAGLIII
DHRSWHTLGL NAIDITELFV QDTRVKVRVC ACFETRKAAA AAAANSGESV AILLPIQKGQ
VPESATINGS ATSQDTDAKD SETSSSQMEK GKTEDEIIRA RCDEIKQLLS NCLLPHDTED
YVIQVSQWQT YTFDNQDLKG SDDDKKFYRF NLHQPTELIV APSEVSAEDS LPTASEVEPL
RRPALYRRSS TITVTDDPTP ADEPMLSGPD LWESDEISHA MTRDGYVMSP VPGGIHRSVT
AGLVGEAEHG HHPEPYHRRR ITFDAALLTT HGDELETSMI KERLHGYVRP HL