Gene PHATRDRAFT_50149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50149 
Symbol 
ID7198850 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp196188 
End bp197874 
Gene Length1687 bp 
Protein Length535 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185065 
Protein GI219129793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACTA CAACAAAAAA GCAACTCGAA ATCGATGAGG TCCAGGAACT AACAGAGGAA 
GAAATTAAAG CTCCAGGTTG TGGAATTCTG GGGCGTTACC CAGTTCTTTC CGTCCTTATT
TTTGCATCGG CTGGCATTGG AATTGGTCTT GGCCTCAGTT TTTGGGAACC AGATGACGAT
GACGATACAA AGGATAAAGT CATCAAGTGG CTCGGTCTCG TTGGCGACTT ATTTATTCGC
TCACTAAAAT GCGTCGTTTT ACCATTGGTA TTTATCAACG TGATTATATC TGTGGTGGAT
ATGATGAATG TGGGACGAGC TGGATCTATT GGTTGGAAAA CAAGTAAGTA CTGAGTCCAG
ATATCATAAG AGTAGAATAA TATTACGCTT TCGCTTGCTG ACTCAGATGT TGATTTTGAC
AGTTGTGCTT TATCTGCTGA CTACCGTCAT CGCGTCGATC CTTGGCATTA TCTCGATTGT
CAGTTTCAAA GGCCTTTTCG AGGAAGGAGA ATTCGAGGAA GCCGTTCCTG CATCAGTCAA
GCTTGGGTGT AACCAAGATG GAGAATTTCT CACTGAAAAT GCCAGCGGCG CTATTTCGTG
TGCGGCAGAC TCGGGGGAAA GCTCTGAATT CTTTATCACA GATGTATCCA TGAGCTTTGT
CCGTGCCTCC GGTAGTGTTC GTGATGACAT TTCTTTGAGC GATACGGTAT ATGATGGCGT
TTTTACGAAA CTAGTCACAG CTAACATTTT TGAGTCGTTT GTTGAAGCCA ATTTTGCTGC
TGTCGTCTTC TTCGCCATCG CTTTTGGGGT GGCAATCAGT CGCGTCTTTG ATCAGGGTGG
TGGTCCCGAC AAGAGTTTCA TTCTACCGTT TCTCAAGGAA CTGGACGGCG TATTCCTTAC
GATTATCAAC TGGATCATTA TGATTACTCC GTTTGCAGTG CTTTCTCTAA TTTCCTCGGC
GATTGGAAAG CAGGAAAATC TTGCGGACTC CTTTTCCAAT GTGGGATATC TCGTGGTTGC
CACAATGATT GCGATGTTCT TTCAATTTTT GGTCGTTCAC TGCCTTCTTT TCTTTATTGT
GACGCGCACT AACCCCTTCG AGTACTTAAA GCATCTGATT CCGGCGCAAA CAATGGCATT
TGCATGTGCC AGTAGCGCAG CGACAATTCC AATGACTCTC AAGTGTGTGC GCCAAACGGA
GCGGGTACCC GAGCCCGTGG CTCGTTTCGT TATTCCTCTT GGGGCGACAG TCAACATGGA
CGGTGGAGCA ATTTATTTCC CATGTGCGTG TATATGGCTT GCTGTGCTGA ACGGTATCCA
ACCAGATGCT GCTTCCTACC TTCTATTGGT TATTATTTCA ACGATCGGCA GTGCAGGCAC
AGCGCCAGTG CCTTCGGCCA GCCTCGTGCT TATTATCACG GCTTACAATA CTGTCTTTAA
CACCACCGGA GTTCCTGAGG GGTTTTCTTT CATTTTGGCG ATCGACTGGT TCATGGATCG
CCTACGCACT GTCGTGAATG TGACTGGCGA TGGCGTTGTG GCTGGAATGG TGTCACACCT
TTGCCCGGTG GACGACGACA CTGGGAATGT GCTTTACGTG GACAAAACTG AACAACACGA
AGCTGGAGCT GGCTCTTCTA CAGATAGTGA TATCAATCTA AATGCGGTGG AAGTCACGCG
AAACTGA
 
Protein sequence
MTTTTKKQLE IDEVQELTEE EIKAPGCGIL GRYPVLSVLI FASAGIGIGL GLSFWEPDDD 
DDTKDKVIKW LGLVGDLFIR SLKCVVLPLV FINVIISVVD MMNVGRAGSI GWKTIVLYLL
TTVIASILGI ISIVSFKGLF EEGEFEEAVP ASVKLGCNQD GEFLTENASG AISCAADSGE
SSEFFITDVS MSFVRASGSV RDDISLSDTV YDGVFTKLVT ANIFESFVEA NFAAVVFFAI
AFGVAISRVF DQGGGPDKSF ILPFLKELDG VFLTIINWII MITPFAVLSL ISSAIGKQEN
LADSFSNVGY LVVATMIAMF FQFLVVHCLL FFIVTRTNPF EYLKHLIPAQ TMAFACASSA
ATIPMTLKCV RQTERVPEPV ARFVIPLGAT VNMDGGAIYF PCACIWLAVL NGIQPDAASY
LLLVIISTIG SAGTAPVPSA SLVLIITAYN TVFNTTGVPE GFSFILAIDW FMDRLRTVVN
VTGDGVVAGM VSHLCPVDDD TGNVLYVDKT EQHEAGAGSS TDSDINLNAV EVTRN