Gene PHATRDRAFT_49656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49656 
Symbol 
ID7198147 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp324193 
End bp325593 
Gene Length1401 bp 
Protein Length399 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184442 
Protein GI219128484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0309902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAGTTGATT GTCGTTAAAG GTCGTTCGTA CAAATCAACT GGTCCCATAT AACTTGACAG 
ATTTCTTCTT GTTAGTACAT ATGGTACATG AAATTGGTCA GCTTCTGGGA CTTGTCCTGG
GAGCGATAGT TTTCTGTCCA AGTAGACTGC ACGCATTCCA GCCAGCTGCC TCGCACTCGG
GAAATCGCTT GAGACGTCTC CATTCTACAG CAGTAAATCA GCTTGAGGTA GACATAGCCA
AACTGAAGAG GGTTCTGAAA AAGGAATATG TTTCGTTCTT TGACCCAATG GAAACCCAGT
TTTATTCGCC ATCGGTATCT TTCATCGACC CCATGACGAG CTTTACAGGT GTCGAAAACT
ATAAACGTAA TGTGGACATG CTTGCTGCAC GAACCTCAAT GGGAAAGTTT CTTTTCAAAG
ACGCTGGTAT TGTTTTGCAC TCGGTAGAAG GCGGAGCTTT GAAATCTGAC GGCTCAATTG
AGGATATATG TACTCGATGG ACTCTTCGAT TAACAGCTAA AATCTTACCA TGGAGTCCAA
CTGCTCGCTT TTCTGGAATA TCAGTATACC AAGTCAAGGC AGGTGGGAGA AAAGGTGTTG
AGATAATCAA ACAAAGTGAT TTCTGGGATT CCATCAATAT CCAAGAGGGT GGCACCTACA
AAGAAGTCAA CAAAGGCCTC GCCATTTCTG ATTTTTTAAG CCAGCTGAAA CCTGAGGATT
TAGCTGCGCC CTCAGCTGGA GCCGAGCTTC CCTATCAATT ATTGCGCCGA GGGAATGGCT
ACGAAGTTCG ACGTTATCCC AGCCACAACG CCGTCGAAAT CAATTATGAG CGACGGGATG
ATGGTTTTAG TATGCTTGGA TCCTTTACGA ACGGTACGTT CATCTGTTGA TGAATGTCAC
TGTTGCTTTT GAATCTGAAA TTTACGTGTT TACTTTCCTT AGGGATGGAA CCATTGGCGC
CGGCTTTGAT GGCCATCCCT TGCGCTGGAT CCAAAACGAT GATGTGGCCT TTGGATTTTG
CTGCTCCCGG AAGCGACTAC CCACCCAAAC CTGCAGCCGC GCTCGAAAAA GCTAACGATG
GCCTATGGAA TGATTGCCGT ATTGTCACGG TGCCGGAAAA GGTAGTCGCC GTGCGCCTTT
TTTCGAATGC GAGTGTCGAG CCAGTCGTTC GGCAAGCCGA CAAGGAGCTT CGGGACGTTT
GTCTACGGGA CGGTATCGGA ATACCTCTTT CGAGTGAATC GCTGTTGCAA TTCGCACAAT
ACGATGCAAT ATTCAGTATG GGAAAGAGAA GAACGGAAGT TTGGATCGAC CTAGAGGATA
GTAGCCATCC TTGGTCTCAC AATCAGTGAA AAGTACACTA TCATTGAATG TTAGATAAGT
TAGCCCCTTC TAAAAAGGGT T
 
Protein sequence
MVHEIGQLLG LVLGAIVFCP SRLHAFQPAA SHSGNRLRRL HSTAVNQLEV DIAKLKRVLK 
KEYVSFFDPM ETQFYSPSVS FIDPMTSFTG VENYKRNVDM LAARTSMGKF LFKDAGIVLH
SVEGGALKSD GSIEDICTRW TLRLTAKILP WSPTARFSGI SVYQVKAGGR KGVEIIKQSD
FWDSINIQEG GTYKEVNKGL AISDFLSQLK PEDLAAPSAG AELPYQLLRR GNGYEVRRYP
SHNAVEINYE RRDDGFSMLG SFTNGMEPLA PALMAIPCAG SKTMMWPLDF AAPGSDYPPK
PAAALEKAND GLWNDCRIVT VPEKVVAVRL FSNASVEPVV RQADKELRDV CLRDGIGIPL
SSESLLQFAQ YDAIFSMGKR RTEVWIDLED SSHPWSHNQ