Gene PHATRDRAFT_50479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50479 
Symbol 
ID7199275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp172509 
End bp174938 
Gene Length2430 bp 
Protein Length698 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185439 
Protein GI219130578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTCTTTCG TGCACGAAAA TTCGAACCCT TCTTGCGATC CAGCGTGGGA GAGAGTCAAC 
CTGCTGCCCC TTAACTTTCA ATCTCCTCAC TGCTCTAGTC AATTTCCCGG CACTCTACTT
TCGCTTTTGA AAGTTCGAAA GTCTTTCGAC TGCTACATTG GCAATATCTA ATTCTCTTTG
AAGTAGGAGA CCCCGTTTGT CTTCCTCGTG TTCTCATCGT ACCGTTGTGT CCGACGCAGA
CTGCGATCTT CCGAGGAGCG ATCGCTTGGA TTCAAAGCGC GTTCCACAAA TCATGACCTC
CACCACTGGT AATGTCGCTG TTGATCCCGC TACTGCGGAA GGCCTCGGCA TCCCGCTGAC
GGACATCACG GACCCGGACG ACTCGGACGT GCTCTGCGGT CGCGGAGGTG CCGCGCTGCG
ACATCCGGGA AATCAAACGT ACCGGCGTTT GGTCAATCTC AACAAGGGCC TTTACATTAC
CTGCCTCAAA ACGGAAAAAT TGAAAATTTC CCGTTCAATT GTCGCGGCGA TTCGCGAGCA
GAAGGGTCGA TTCTTGGAAA AAGACTCGAA AACGGGAAAT TGGTACGATA TTGGGGACAA
GAAAGCTGTG GAAAAGACTT CACAAGCACT CCGCGAAGGA CAGCCCAAGC TGCGCCAGAA
GATTGTGGAA ATGGGGGGTG GTGTAGCGGG TACCGCAGCC TTTATGGAGT CGCAATTGGG
GGATTCGAGC AGCGCATTTT CCCACACAAA TAGGAACGAT ATACCCCCAC CTCCACCTTC
GATGTTGGGG ACTCCGGATA TGTCGCATTC TCCTCATCTA GGCCACAACG GAAATGCTTT
GAATTCGGTC GGCATATCGG CACATGCCGG AATGCCTGGA CTCTTGCCGC ATAATCCTCA
CGACGCGGCT ACCGCTGCAG CGTTGCGGCG CCAACATCTG GATCTCCGTC AACAGCAACA
AGACTTGGAG CAGCAGTTGC ATGCGGCTTC GATGGGAAAC GCGATGAACT TCAATCAACA
ACAGCAGCAA CAACAGCAAT CGCGTTCCAA AGACCTTCAC CAGGATATGT TGCAGCGCCT
TAGTTTGCGG GACGTCTCGT CCGACCCCAA CGCCTACGAT ATGCAGGAGC AAACCAATCG
TCTTCGGCCA TCCCTGACAC AGCGGGGGCC GCAGATAGCA CAAGAATTGG GGATTCGAGA
TTCCCAACTC TCCCTTTTAT CAGATTTCTC GGCGTATGGA TCCGGTCAAC AACTTTTGGT
TAACATGTCG CTCGGGAGTA TGGACCCTGG ATCCTTTCGG TACCAACAGC ATCCACAGCA
AATGCAGCAG TCACTGCAAA GTATGGATTC TGGCTCGTTT CGTCAGCAGC TGCAGTCTCT
ACAGAGTATT GATTCCGGTT CCTTTCGACA GCAAATGCCG CGCCAACAGC AACACCACCA
GCAACAGCAG CAACAACAAC AGATGCAGCA GCAACATTTT CAACAACAGT ATCAGCAGCA
ACAGCATCAG GGATTTTCTG CACTGGACCA TGGTCAAAGC GACGAGTGCC ATCCCCGACC
TATCCAACAG TCGTTGAACG ATAGAGGAAC TAGCTCCACC ACCGCGCGCG AAGACTACTC
CTCAAGTAAT GATGGCAACA GAAAGACAAA TGATGGAGTT TCGAATCCAT CTGTGAACAG
CGTGGTAACG GCGTCTTCGA ACAGTGATCC ACATTGTAAC GGCAACACAA ATTCAGGCTC
CAGCACTAAC AGTAGCAAGC TTGCGGGCCT AGATCGTCGT CGTGTGTTCG CCAAGATGAA
GTACACTCGA CCGCCATCTG AGATGAAAAT GAAACCGGAA GATTCGGCTC GTTCGATGCA
AGACGGCATG TCAGACTTCC ACATGGTCGA GTCCACCATG AGCTTTCTCT CCAACATGTC
CCAACTGTCA GCGGCGGACA AAGGTGGTAA TGGAGAGAAG ACGGCGTCGG CGGGCGCGGA
CGGTGCTTCG TCCGCAGAGA TACTGGTGTC TGCCGTCCCG ACACCTGTCT TTCCAGCTGG
TGTAGAAACA GCAAAGGTGA TTGATCACAG CAGTGACAAT CATGAACGCA TGAGTACATA
CTCGGAAGCA GCGTCGGGGA GTCGTCGCTC GATCATGTCT GGTCTATCAC GGATTAGTGA
CGCGGATATA TCCATATTCT CGGACCTTTC CCGAAAGATC GGCAACGTCT CAACACGATC
CATCGCCATG AGCGATATTT CGGCCATCGA TATGCAAGAG CAAGACAACG AAGACGAAAG
CACAACTTCG AACTTTGAAG GCGCTTCCAT TGACCCTATT GATCCTATAC GGTCGCCACA
ACGGCTTTCC GGCGGGAATT ACTCGGAACC GTATGACTTT ACAATTTGAT AGCATTGTTT
TTAATTGTTA ACTGCACCAA AGTTATGGCT
 
Protein sequence
MTSTTGNVAV DPATAEGLGI PLTDITDPDD SDVLCGRGGA ALRHPGNQTY RRLVNLNKGL 
YITCLKTEKL KISRSIVAAI REQKGRFLEK DSKTGNWYDI GDKKAVEKTS QALREGQPKL
RQKIVEMGGG VAGTAAFMES QLGDSSSAFS HTNRNDIPPP PPSMLGTPDM SHSPHLGHNG
NALNSVGISA HAGMPGLLPH NPHDAATAAA LRRQHLDLRQ QQQDLEQQLH AASMGNAMNF
NQQQQQQQQS RSKDLHQDML QRLSLRDVSS DPNAYDMQEQ TNRLRPSLTQ RGPQIAQELG
IRDSQLSLLS DFSAYGSGQQ LLVNMSLGSM DPGSFRYQQH PQQMQQSLQS MDSGSFRQQL
QSLQSIDSGS FRQQMPRQQQ HHQQQQQQQQ MQQQHFQQQY QQQQHQGFSA LDHGQSDECH
PRPIQQSLND RGTSSTTARE DYSSSNDGNR KTNDGVSNPS VNSVVTASSN SDPHCNGNTN
SGSSTNSSKL AGLDRRRVFA KMKYTRPPSE MKMKPEDSAR SMQDGMSDFH MVESTMSFLS
NMSQLSAADK GGNGEKTASA GADGASSAEI LVSAVPTPVF PAGVETAKVI DHSSDNHERM
STYSEAASGS RRSIMSGLSR ISDADISIFS DLSRKIGNVS TRSIAMSDIS AIDMQEQDNE
DESTTSNFEG ASIDPIDPIR SPQRLSGGNY SEPYDFTI