Gene PHATRDRAFT_41506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41506 
Symbol 
ID7199373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp23023 
End bp24381 
Gene Length1359 bp 
Protein Length452 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185473 
Protein GI219130650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000024093 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTG ACGGTGCTAA TGCGTTTCGT GGTGCGCTCG GGTGGATCGG ATGGTCTGTT 
CCTGCGGCGA ACGCGTTTAC GAACGAAGGT TTTGATGCGA TGGAATCCCT TGGCTTGGTT
ACCTGTGACC GTCTTAAGGA TATCTGCAAG ATCATTCGTC GTGGTACCGA TGGCGTGGCC
GCAGTGCCAG CTGCTGGTGG AAATGCTGCG GTGGCGGCGG TGCCTGGCAT CCCTGGGATA
GCGATCCCCA TGATGTGGGA GTACAAGCTA AGCGGAATGC ATCTCTGGGT GTCTGAGCGT
CTCCGACAGG GGACTCCGGT TGTTGCGGCG GACTTTACTG CGGCCATTGG AAACCTGTAC
ACCAGGAAAG TGCGTAAATT GGAAGAAGCG AAGGATGACG AGGATGTCCA GGTCAAGCCC
CCGGCTCCGT TCTCGAAAGA AACGAAGTGG ATTTCGTTCT TCAAGTTGCT GGTCAATTAT
TTGAGCTCCG TGACGGGTGT CAACAAAGTG CCATTGGATT ATGTTGTCCG GAAGGATGAC
AACATTGCTG CCCCCAATGC CGAGTTTGAG ACGGAGCACA AGAAGTTGGT GTTGTTGACT
CCCCATTTGG GGACGGCTTT CGACAAAGAC AACGGCAAAG TTTGGATCCA GGTGAAGCAA
TTGACTGTGA ACGGTCCGGC CTGGACCTAC GTTGCTCCTT TCGAGAAGAA ACGCGACGGT
CATGGAGCGG TCAAGGCTTT GAACAGCCAC TATGAAGGAG ATGCGGTGAT GTCAAAATCG
AAGGCGGCTG CCTTTAATGT GCTTGAGCAC ACCACCTACA CTGGAGAACG TCGAAATTTC
GGTATGGAAC GGTACACGAA CGCCTTGTCC ACGGCATTCC AGACCTTGGA CAAGTACGGA
GAGACCTTGA CGGAGTCAAG AAATGTGGAT GTGTTCCTCC GCAACAATCA CTGTACGGAT
CCCAAGATGC TCTCAGGAAT TGCGGTAATT CAGGGAGACG CGGATTGGAT GTCCAATTTT
GCCAAGGTGG CCGACCATTT GGCCTTGTTT ACTAACACCG ATAACTCTCA AAAGACAGGT
TGTTTGATCT CAAGTGCTCA GCGGACTAGT AACAACAAGA AGAAGCCGGG TATCCGAGCG
GGCAATTATA ACCCAAATGA ATGGCATCAA CTCTCGGACA AGGCAAAGGA CGAAGTTAGA
GCCAAGCGAG TGGCCGCCAA GTCCTCTCGC AATAAAAATA AGCGCTCGGC AGCAGCAATC
ACTTGTTCGA GCAAGAAACC TGACAAAAAT CAGTTTGCTC TCCCGAATAA GAAGAAAAAA
AGGAAGACTG TTGGTTTTGA AGGCGAAACG AGCAATTGA
 
Protein sequence
MATDGANAFR GALGWIGWSV PAANAFTNEG FDAMESLGLV TCDRLKDICK IIRRGTDGVA 
AVPAAGGNAA VAAVPGIPGI AIPMMWEYKL SGMHLWVSER LRQGTPVVAA DFTAAIGNLY
TRKVRKLEEA KDDEDVQVKP PAPFSKETKW ISFFKLLVNY LSSVTGVNKV PLDYVVRKDD
NIAAPNAEFE TEHKKLVLLT PHLGTAFDKD NGKVWIQVKQ LTVNGPAWTY VAPFEKKRDG
HGAVKALNSH YEGDAVMSKS KAAAFNVLEH TTYTGERRNF GMERYTNALS TAFQTLDKYG
ETLTESRNVD VFLRNNHCTD PKMLSGIAVI QGDADWMSNF AKVADHLALF TNTDNSQKTG
CLISSAQRTS NNKKKPGIRA GNYNPNEWHQ LSDKAKDEVR AKRVAAKSSR NKNKRSAAAI
TCSSKKPDKN QFALPNKKKK RKTVGFEGET SN