Gene PHATRDRAFT_48898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48898 
Symbol 
ID7194973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp623404 
End bp624633 
Gene Length1230 bp 
Protein Length404 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183395 
Protein GI219126293 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCAGCGCTAT CTGAAATGTG GCTTAAAAAG CATTTACTGG CAGTAGCATT GCTTTTGGCC 
TGTGTGATTG CTACGACCCT GTTCCAGTTT CGTGCCTTTC CCGGCCTACC ACTGCATCAG
GCACCAATTG CAGTGCGCCT TTCCAAAGTC AAAGAGAAGC TCAAACAAGA AAATTCCAAG
TCGCTACGAC CAGACCCGGA AGCCCTTCAC CAAGAGAGTG GAACATCGAC CGCGCGAGGT
AGATGCGCCA TCAATCTATT TGGGTTACCC AGAGCTTTCC AATCACTAGT ATTGCCTTCG
CTTGTTCAAA ACGTGCTGTC TCCTAACTCT CTTTACCAAT GTGACTATTT TGTGCATTAC
TACTTCTTGA CATATGAAGA AGCGGGACGA TCGGGCCTTG GTGGCCGCAT TGATCCGGAC
GAAATCTTGC TACTGGAACA AGCCGTTAGA GATGTCTCGC CAAATTCGGT CATCTCGTTC
CGATTTGATC ACGAACAGGC CTTTTGGGAT AAATACCAAC CATTCATTGA CAAGATACGG
ACGGCCAAAG ATACAGATGG ACGCTTTTTG TATTTTCCTT GGCGTGATAC ATCGTACGTT
TATCCAGAAA CGCTAGATAA TATTGTCAAA ATGTGGCACA GCATTGAGTC GGCTTGGGAA
GTAATGACGA AGCATGAACT TGAGACGTCT TTGCGGTACG ATCGCGTCGC CGTACTGCGC
TCCGACGTTG TCTACGTAAC GCCAATCGAC GTTTTTCAAG ACAATTGGCG ACTAATCAAT
GATAGCGACC GAGTAGCTGT GGTTCCTGCC TTTGGTAGGT ACCCAGTCAA TGATCGCATG
ATTGTGGGAC CGCGAGAGGC CGTGGAAATA TGGGCTGCAC AGCGATTTAA CCGGCTGGAG
ACGCATATAA AGTTTGTGCA GGAGAATCAT CCGGGATGGG GTATGCATTC AGAAAGATTT
ATCAAATGGA CGATAAATCC GGCCATTCGA GATAGCAACA CAACCATAGT CGAAGACGGC
AATATATGCT TTTTTCGCGT CCGTGCGGAC GAGACGGTGA AGATCAATGA TTGCGAGGAC
GGCAAGAGTG TGGTGGCTGC TCCATCAATT GTTGAGAATA CAGGTGAAGG CAAGGCCAAA
CTGTTGGAGT CGATCCTGGG TCGCAAATGT TTGGTCCGGC CTCCAGATTC AGCGTCTACC
AGTTTGCAAT GTCCAAAAAA CATGACATGA
 
Protein sequence
MWLKKHLLAV ALLLACVIAT TLFQFRAFPG LPLHQAPIAV RLSKVKEKLK QENSKSLRPD 
PEALHQESGT STARGRCAIN LFGLPRAFQS LVLPSLVQNV LSPNSLYQCD YFVHYYFLTY
EEAGRSGLGG RIDPDEILLL EQAVRDVSPN SVISFRFDHE QAFWDKYQPF IDKIRTAKDT
DGRFLYFPWR DTSYVYPETL DNIVKMWHSI ESAWEVMTKH ELETSLRYDR VAVLRSDVVY
VTPIDVFQDN WRLINDSDRV AVVPAFGRYP VNDRMIVGPR EAVEIWAAQR FNRLETHIKF
VQENHPGWGM HSERFIKWTI NPAIRDSNTT IVEDGNICFF RVRADETVKI NDCEDGKSVV
AAPSIVENTG EGKAKLLESI LGRKCLVRPP DSASTSLQCP KNMT