Gene PHATRDRAFT_50495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50495 
Symbol 
ID7199329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp213053 
End bp214441 
Gene Length1389 bp 
Protein Length440 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185449 
Protein GI219130599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCTTTCTA AAGCATATTG TAATCTCTCT CGCCATCACG GATCGACCAC ACTATTCCGT 
GAAGCCATGT CCGAGTCCGG ACAAGAGGCG CACGACGAGA TCCCCATGTC CGATTCCAAC
CACGAAGACT CGGAGCGGAT TTCCCGGGTC GAAATACAGT TGCAGGATTT TGTTACTCAG
TTGCGTTCGT TGGATCTCGA CGGTTCCTTC GATCTACCCA AGCTCGAGAT TCCTAAGAAC
TTGCCTATTC ACGCTCCACA TGCGTACCCG TGCGAATATT CCTTCTCCCG TTTGCCGTCC
CCTCCATCTT CTTGGCCGCA GGCCCCCGTA ATGTTACGAC CCACACCAGG TTCCCACACG
CAAATTCGGG GGATCCGATA CGCTGATTCC AAGACGTACC AGAACTTTTC CGGTTTTTGC
GCCGGCTGCA TCCTGCCGAT TAATACTGGA GCGGAAGAGC CGGGCAAGTC GTTGGTCATT
GATTTCGAAA CGAAACATTT CGTTGGCACC TTGTTGATGC GCATCAAAGA CGTTTCTGCT
CTTCTCGATA AGTCCAGTTA TCAACAAGAG TCTTACTTTG ATGGTAAGAA ACGCAAGTTC
CAAGCGATAA TCAAGGGAAA ATTTAAAACC GCCTTGCCGA TGAGCCAGTG CGTGACGGGG
CAAGCCTTCG AACGGCCGGC CGGCAAGCTG CCGGGTCGCC TCGTAGTCAA TGCCGCCATC
AAATTTATAT CCACACTCGC CCCCCAGCTG GAAGTGACGT TGGACGGCAA TGAACCGCGT
TTCATTACAC CCTTGGTCGC TACGGCGCAC ACCGTTCTCG TCGAAGACTA CATCCGTCCG
GAAACAGAGA TGACTGACAG TGATGCGGCG GGATCCGCCG AATGGACAAT GGAAGAGATG
AAGGAAAGAA CCGATCCGAA ACTAGAAAAG CTCGTCAACT ACCACGTGTA TGCCGGATCG
ACCGATATGG AATGCGATGT TGTCGAACCA CCAACGGACG TCACAACAAG CATTCTGCAG
ACGGTTGAGG GCCAAATTGA GCTGGGAGAA TCGTCCACGG TCAGCGCTCG TATGAAGCAC
CGGAAGAAAA TTTTCAACAA AATTGCGGCA TCTCGCAACG CGCTGCCCTG TTTTCGAACT
GACCAACAAT ACACCTTTGA GTTTTATCAG CATTTGTTGA TCTTTACAGA TACGGATGAT
TTAAAGCTAG ACCTCGGACG TGCGTTGGGA TATGCTGGGT TGGCTCGTCC CTTGAATGGT
CAACCGATCA AATGCATGGC GGCGCACAAA CATTCCGAAA GCACGGCGTT GGATACACTT
TGGTCCTTCG ACATTTGGCA CCACTCCCTG TACGGTCTAG CGCAGCAAGC TTTAAACAGC
GACAAATGA
 
Protein sequence
MSESGQEAHD EIPMSDSNHE DSERISRVEI QLQDFVTQLR SLDLDGSFDL PKLEIPKNLP 
IHAPHAYPCE YSFSRLPSPP SSWPQAPVML RPTPGSHTQI RGIRYADSKT YQNFSGFCAG
CILPINTGAE EPGKSLVIDF ETKHFVGTLL MRIKDVSALL DKSSYQQESY FDGKKRKFQA
IIKGKFKTAL PMSQCVTGQA FERPAGKLPG RLVVNAAIKF ISTLAPQLEV TLDGNEPRFI
TPLVATAHTV LVEDYIRPET EMTDSDAAGS AEWTMEEMKE RTDPKLEKLV NYHVYAGSTD
MECDVVEPPT DVTTSILQTV EGQIELGESS TVSARMKHRK KIFNKIAASR NALPCFRTDQ
QYTFEFYQHL LIFTDTDDLK LDLGRALGYA GLARPLNGQP IKCMAAHKHS ESTALDTLWS
FDIWHHSLYG LAQQALNSDK