Gene PHATRDRAFT_49237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49237 
Symbol 
ID7195700 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp323854 
End bp325352 
Gene Length1499 bp 
Protein Length414 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183855 
Protein GI219127256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.178302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGAAACAG CTCCGTATAC AACTAAAAAC GTGCGAAATT CATCCAAAGG ACGGCAGCGC 
AGCCCATCCC AAGCTCCGCA AAAAGGAAGA ACGCCCTCTT TTAACTTACT ATGCACCACG
ATTCCTTCGG CAGCTCTACC GGCTTTCGTG GCGTCGATGT TGCCGTTCTT GCACGGCCAC
ACTTGCTGAT TGCAAACATC AGAGCCCACG AAAACGACAT TCTGCTCGGG CGAGGTACGT
GTCGGAGACG CTTCAAGTGA GCAACGACGA CTGTGGAATT CTCGGATCTA ATAAAGATGC
GCCCATGTTC TCTTTTTCCA TTGCGAATCA GGTGGAAAGA ATAATCAACA CGCAGGGAAC
CAAAAGCTTC GACGGATGGC TCGCCAGTAT TGCGCGCGGT ATCATGCCGC GGCGAAGAAG
CAGAAGCCTG TGATTGCACT CGAACTCGTC AGGCAGGTCC ATTCCCTGTC CCCGCCTGGA
CGTTTTCTGA AGCACAAAAT GGGAGGTTGG GAAGAAGCCA CGGAAGATAT TGCGAAGGAA
AAAGTCAGTC AATGTCTACG AGATATCGTA GCGTCTCAGC TCAAAGCTGG AAATTCTCTA
TCTACACTGG AGGAAGCGAC GATACATGTC AATTGTCGAA CTTTTATGGA AGCGCCGCCT
TCCAATACCT TCCAACAGTA TACCAATCAT GTTACTCCTT CTGCTTCACC AAGCGCAACA
AGCCACTCGC AAAAGCAAAT GTTTGACTGG CAGCAACCGA TGATGCAACC GTACCGGAAC
GATAGTTCCG GAGTCCGGTA TATCTCCCCA CCGGATGTAT CAGCAAAAAG GACGATGCAC
TGCATACAGA GTCGCTCCTC TTTTGAAGGA ATGAGAGAAT TCCGACATCC GTATCCGCAT
GGCGTTTGCT CAACAATGAA TGTTGGCGTG TCCCGGGAAC GTCACATACA ACACCAAAAG
AGACTAAGTT CGAACCCGAT GAACTCAGGT TGGTGTCAAG GTCCTTCAAA GAGAGTACGT
GAGGGATATC AGGACATAGC TCCACGACAG CCGCGTCATC CGCAACAGGA TTACGGTCAA
ATGGAACAGT TTGACGTAAA TTTACTTTCC ATGGAGCAGC ACGACGCCGC CCGCCACATG
CCTACGCCTT TCGTGAGCTC CGAGTCTTCG AAAACTCGAG GCCAGGAAGA CAACTTTACG
TTAAACTCGG CGGCGCGACT GGCTTACATG GAGCGTCAGA ATGGGGTATT CTCACAACAG
CAAAACTTCC AACGTCACCA GAGTGACCTT CAACAGATTA AGGACGTGGA TATATTCAGC
CCCGAGGATA TTGAAGGCAT CGCATTTGAA AGCATGGATG AATTTTTACC TCCACCTTCC
GTGCAGTCAG CCCCATCCCA GGACGATGAC AACCTTCGCA CTTACGTCTT GCGCATGCTC
CAGGAGCTTT AACCTGAATT TTGACTGCAC CTAACATAAG TAATCTGGAA TATAAGAGC
 
Protein sequence
MHHDSFGSST GFRGVDVAVL ARPHLLIANI RAHENDILLG RGGKNNQHAG NQKLRRMARQ 
YCARYHAAAK KQKPVIALEL VRQVHSLSPP GRFLKHKMGG WEEATEDIAK EKVSQCLRDI
VASQLKAGNS LSTLEEATIH VNCRTFMEAP PSNTFQQYTN HVTPSASPSA TSHSQKQMFD
WQQPMMQPYR NDSSGVRYIS PPDVSAKRTM HCIQSRSSFE GMREFRHPYP HGVCSTMNVG
VSRERHIQHQ KRLSSNPMNS GWCQGPSKRV REGYQDIAPR QPRHPQQDYG QMEQFDVNLL
SMEQHDAARH MPTPFVSSES SKTRGQEDNF TLNSAARLAY MERQNGVFSQ QQNFQRHQSD
LQQIKDVDIF SPEDIEGIAF ESMDEFLPPP SVQSAPSQDD DNLRTYVLRM LQEL