Gene PHATRDRAFT_49639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49639 
Symbol 
ID7198272 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp276869 
End bp278809 
Gene Length1941 bp 
Protein Length435 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184435 
Protein GI219128469 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCCTTTTTT CAATACCAAT CGGATTGGTC TTCTTGTTAG TCCTCGTCGT GTATACCTTT 
TTTGGTACCA AACGCGAATC CTCCACAAGA TCTTTCCACA CTACCGCTTC CATCACTTCT
TTGCCAGACT TGTGTTCTCG TTGTGGATTC GTAGCTAGCT CACTGTGTAT CTTTTGCGGA
CTCTACGCTT CTTGCAATGA AACTTCAGCT CGCTTCTCTT GCTCTGTGGG CGTCATCTGC
TCTCGCGTTT GCGCCTAACT CTCTACCTTC TCGGAACAAT CGTGCGGTAG GGTATGTTCC
TTCCTCGATC TCCTCGCTGC ACGCTGCGGC GTTGGAACCT CCACCGCTGT CCCCGCTCAC
ACAATGGGGC GATCGGATCG AGAACATTCG GGCGCTGCAG GCCGAGCTCA AAGCCCGCGA
GTTGCCTCCC TTTGCCCCGG AGCTCTCCGC CGTCAAAGAC TGTGGTCTTG CTCGTGAGGA
CACCGAAGGA CAACTCGCCT ACGTCCGCGA CAACGCCATG CGCATCAAAA CTATGATGCA
GGAACACGGC GCCGTCGTCT TTCGAGATTT CGATCTCATG AAGACTCAAG AAGGATTCCA
AGCCTTTTAC GCAGCCATAG GAATGAAAGC CTGTTTGGAT CCCTTGCATT CCGTATCGGC
ACGGCCAACG GTGGATGGAC AAAAGAATTC TCCCGTCTAC GAAGCCGTCA ACAAGGAATC
GCGCAAGAAT TTTTTTATTG GTACGTGATG GTGCTAGATA TCGACAGAAT GAGGCTTGGC
ATACGACGGA TAATGTCGTT TTCTCATGGC ATTTTTTTGG TACGTCGACA GGCATGCACA
ATGAATTCGT CGGAACGCGC GCTCCGCGTG CTGCAGCCTT TGTCTGCTTC AAGGCCGCCG
AAACCGGCGG GGAATTCCTT GTTGCCGATG GCCGTCGCAT GTTTCGTGAT CTCGATGCCG
ATCTCATCGA AGAGCTCTAC AACCGAGAAA TCCGTTACTC GGTCATGGAA TTGCCATTTT
TTGGATGGAT CGATAACTTG CCCTCGTTCG TTCAGCAACC CGCCATGAAT GTCGTTCGCG
GGGCAGTTTC GGCGGCGATC AACGCCAAGG TAGATTTTGA CGTGGAATTA CTCTGGGGCG
AAGGTGGATA CGACGGTACC CGCATGTTAC AAGCTCGAGC ACCATCGCAG CCGCCGATTG
TCAAGCACCC CGTCACTGGA GATCCGACCT GGTTTTGCAA CGTGCATTCT CACAGTTCGA
AACTGCGTCA TCAGCGCGAA TCAATGTACG TAACGACTAC TGCGCTGCAA TCGAAAAGAT
CCGTCGGCGA GATTCATCTC ACAGTTGTCT TGTCTATCCA TCCCACAGCT ATGGTGCGGA
ACGTTTCGAG GACGGTGCTT CCCAAATCAA CAAGTCCGAC ATGTTTTTTG GTGACGATGG
CGAGCTGTCG GAGGCACAGT TGAAGCAACT GGATGAAGTC ACGGTGAAAA ATACCCGCTA
CGTCAAGATG ACGGAAGGAG ATGTCGTGCT TTTGGACAAT TATAAAACTA TGCACGGGCG
CAACGTCTTT GACGGAACCC GCAAACACGG CGTGGCCTGG TTCGAGGGAT GGGAAGGTGA
AGCTGATATG AAACAACAAT TTCAAGCCGA AGGAGCTTCT CAAAAAGTGG TTGCGTAAAC
GAACAATTAA CCCGCTCCTC GTTCCTGCGC AAGTCTCCAT TCCATAGACA CACTCGCGTG
TTATCAAATC CAGTAGCGCC TGCTTCTAAC TAGCTCTTTT CGTCTCGAGC GCCGAGGAAA
AGCGGCACTT TAGTTTTGTG GAGTCCACCA AATTCTTAGA TTTGAATCTT AACAAAGAAA
TTGAGTCTTA ATGTAGGTTT GCGGGGCGGA ATGAGAGCGT TGTATCTATT GAGATCGGAT
TAATAACCTT CACCTTTTAG A
 
Protein sequence
MKLQLASLAL WASSALAFAP NSLPSRNNRA VGYVPSSISS LHAAALEPPP LSPLTQWGDR 
IENIRALQAE LKARELPPFA PELSAVKDCG LAREDTEGQL AYVRDNAMRI KTMMQEHGAV
VFRDFDLMKT QEGFQAFYAA IGMKACLDPL HSVSARPTVD GQKNSPVYEA VNKESRKNFF
IGMHNEFVGT RAPRAAAFVC FKAAETGGEF LVADGRRMFR DLDADLIEEL YNREIRYSVM
ELPFFGWIDN LPSFVQQPAM NVVRGAVSAA INAKVDFDVE LLWGEGGYDG TRMLQARAPS
QPPIVKHPVT GDPTWFCNVH SHSSKLRHQR ESIYGAERFE DGASQINKSD MFFGDDGELS
EAQLKQLDEV TVKNTRYVKM TEGDVVLLDN YKTMHGRNVF DGTRKHGVAW FEGWEGEADM
KQQFQAEGAS QKVVA