Gene PHATRDRAFT_33014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33014 
Symbol 
ID7197021 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1314955 
End bp1316315 
Gene Length1361 bp 
Protein Length436 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177805 
Protein GI219112107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.111303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTCG AAAGCGATCG TTTATATTGG AGCGTCTCGA AAGAAGATCA GAGCTACGCT 
GGCACCTCAG ATCCTGCTCC TCCCGTTCGA TCATTAATTC TTCCTCGTAC GTTGTCACAA
GTTGTCTGTC AATTTCTGCT AAGTATATCT GACCAGTATT TTTTGATATT TGGCTATACT
CAGGAACGTT GGCTTGGTTT TCGGTGGACG ATGACCACAT TTTTCTCTTG GAAGGTTACA
CCGCAGCATC GTACACACCA CCAACCATTA TCTTTCCGGC GTCAACTTTG CCAGGCAAAA
TGTGGCAGAC GCTCTTGCAA ACTCGTACAT GTACACTATC AAGCGCGACT ACACGGGAGA
GTACGGCCGT CTTAAAAAGG GCGTTGTCGG GGACTGAGCC AAAAAGCTTT AAATTTGACG
AGCTAAGACT GAAGGCGTCT AAAAACAAAA AGGATTATCC TTTTGTGGTG GCTGAGTCCC
CCATCCATAT GTTTTGCAAG GTACATGAGC TGATTGCTTT AACAAACGAC GAAGCCTTGG
TCATTTTGAC GGTCGAGACT TTCGTAATTG ATGGATCGGT CCTAGGTCCG CCCACAGATG
AAATGACAAA GCGGCCCAAC GTGACGGCCA AAATTGACGC CGATCTAATT GAACCAGTAG
TAAGTCTCGG CGACGGCAAG GTATTTCCCT TGATGTGTTT GCGATCGATG CCTCGCCCAA
CGCGTTGTGC GCAACGCGGC TCGTGGACTT CCACCGATTT CAATAACAAT GTCGGAAGAG
GGGACCATAG CCTCGCTTAC GAAACGACCG AATGGTCGTA CAGACAACAC GGTGGTACCT
GCCCCCTCGG GTATAACGCA ACAACGGCTT TAATCATGCC TAGACCAATC GGTTGGATTT
CGACATATTC GCAAGAAGGC AATGTTGCTC ATCTGGCTCC CTACAGTTTT TTTACTGATG
TCTCCCGCGG ATGTGAACCA ATAGTCGCAT TTTCCGCCTT TCGCAAGGAA GGTACAATAA
AAAAAGATGC ACACAAAGAT GCAGAGGAGA TGAAGTGTTT TGTATACAAC ATGGTGGATG
AGGACTTGGC CGTAAAAATG AATTATTCTG CTGCAGAGCT TGGACGCAAC GAAAGTGAGT
TCGAATTAGC CAAGTTGACA CCTGGGAAGG CACGTCTAGT AGATGCTCCG GTTGTGTCAG
AAGCCTGGAT ACGATTGGAG TGTGAATATT TTAAGACGGT GGAAGTTGAC AGCTTTTCAG
TCGTTCTCGG TTTCGTTCGT GGCATAGACA TTGACCGCAA GCTTTGGAAA GACGGCAGGC
TCGACGTATC TTTGCTAAAG CCTATCACAC GGCTCGGGTA A
 
Protein sequence
MAVESDRLYW SVSKEDQSYA GTSDPAPPVR SLILPLFFDI WLYSGTLAWF SVDDDHIFLL 
EGYTAASYTP PTIIFPASTL PGKMWQTLLQ TRTCTLSSAT TRESTAVLKR ALSGTEPKSF
KFDELRLKAS KNKKDYPFVV AESPIHMFCK VHELIALTND EALVILTVET FVIDGSVLGP
PTDEMTKRPN VTAKIDADLI EPVVSLGDGK VFPLMCLRSM PRPTRCAQRG SWTSTDFNNN
VGRGDHSLAY ETTEWSYRQH GGTCPLGYNA TTALIMPRPI GWISTYSQEG NVAHLAPYSF
FTDVSRGCEP IVAFSAFRKE GTIKKDAHKD AEEMKCFVYN MVDEDLAVKM NYSAAELGRN
ESEFELAKLT PGKARLVDAP VVSEAWIRLE CEYFKTVEVD SFSVVLGFVR GIDIDRKLWK
DGRLDVSLLK PITRLG