Gene PHATRDRAFT_11940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_11940 
Symbol 
ID7200664 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp748454 
End bp749596 
Gene Length1143 bp 
Protein Length330 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179906 
Protein GI219118255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGGCCGTTG TGGTCGGGCA AGCTGCGAAA CGACGGATTG ATTCCCATCC ACACCACACC 
CTGTATCAAG CCAAACGCGT TTTGGGGCGG CCCTCGGACG ACCCCGCCAT GACGGAATTG
CGGGAGGAGG TCGAATTCGC CGTGACGGCC GACCCCGAGC ACGGCGTCGT CTTTGGTGTG
CCCGAGACGT CGCGGCCAAT TTCACCACAG CAGGTGGGAT CGTACGTCGT CAGTCATCTC
ATGAGAATCA CCGAAACCTT TTTGGGACAC GACAACATCA AATCGGCCGT TATTTGCGTC
CCCGCCAAAT TCAATGCCGC GCAAAAACTC GCCACGTACC AAGCTTTCCG ACAAGCCGGT
GTTACCGTCG CGCGTGTCGT AGAAGAGCCC ACAGCAGCCG CTTTGGCTTA CGGGTTGAAT
CGGAAAGAAG GTGTGGATCA CATCCTCGTG TACGATTTTG GTGGAGGCAC ACTCGACGTT
TCCTTGCTGC ACGTGAGCGA CGGGTTCGTC GACGTCATGG GCAGCGACGG AGACGATCGA
CTGGGTGGTG CGGATTTTGA CGCGGCCATT GCTCACTTTT TGCTCGAGCA TCGCCATGGA
CAGGCCGTAG TTTCTCGAGT CTCACAAGCG TTACAGTCAC TGGTCCAAGC TCTGCCCAGC
AATGTGGATC TAGAAGACCA GCTTTCGGCA TCGTGTACGT CTCTACAAAC GGTGCCGCTT
TGTACCGTAT CATCCTTCCA TACGTTAGGA GAACAACTCA AGATTGCGTT GTCGGCATAC
CCGGATGGCA ACGGAACAGT CGAGGCGGAG TGTCTCGGAT TTCCCGAAGA CTACGTTGAC
CCAGATGTGT CTCTCGAAGG TTTTTGCACC GACCTGACCA CTTTCCGGCT GTCGCTAACC
TCTCGCGAGT ACGAACAGAG TGTGCAAGCG CTGTACGCAC GCTCCATTTC GCCCGTGACA
CGCCTACTGA ACGACTTGAA TTTGCGTCAC GATGATGTTG ACGAGGTTGT CATGGTGGGC
GGGACAACCC GCATCCCACA AATACGAAAA CTCGTTCAAC AGGCACTGCC ATCAGCATCT
GTGAATACAC ATATTGATCC GGACATCACC GTGGCTTACG GTGCCGCTTC CGTAATAGAC
TGA
 
Protein sequence
TAVVVGQAAK RRIDSHPHHT LYQAKRVLGR PSDDPAMTEL REEVEFAVTA DPEHGVVFGV 
PETSRPISPQ QVGSYVVSHL MRITETFLGH DNIKSAVICV PAKFNAAQKL ATYQAFRQAG
VTVARVVEEP TAAALAYGLN RKEGVDHILV YDFGGGTLDV SLLHVSDGFV DVMGSDGDDR
LGGADFDAAI AHFLLEHRHG QAVVSRVSQA LQSLVQALPS NVDLEDQLSA SYVSLEGFCT
DLTTFRLSLT SREYEQSVQA LYARSISPVT RLLNDLNLRH DDVDEVVMVG GTTRIPQIRK
LVQQALPSAS VNTHIDPDIT VAYGAASVID