Gene PHATRDRAFT_47462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47462 
Symbol 
ID7202579 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp683345 
End bp685594 
Gene Length2250 bp 
Protein Length491 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181611 
Protein GI219122561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.28983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTTCCTTG TGTATCGCAC AAACGCTTCA AATTGAAATA TACCACCACC GGAGCATCAA 
TCGTTACACT CACATTCACA CACACCCAGA GGGAACAACA ATATTAGGGA CAAGTAGAAA
CATTCTGTAG TGGCATCATG AGACACAGCA ATGGTCGGGG GGTTCCCGCT GAGGAAAGCG
AATTGCGGAC ATCGCCGGTG ATCAACCGGA TCCGACCCCG CCGTGAAGCA CAAGCCTACC
GCATCGCCTA TTCGTGTGAG CACGCCGCGC GCATTTTCCG GAGTACTAAA CGGCGCATTC
GATGGTAAGT TTAAGTCAGC ATTGTTCTTG TCGTTGGGCG TGACCTCGAG TTTGGCCTAG
TCGTTCCTGC ACCATCGCAC CATGGTGGAC CCCGTCCGAC ATCGATTACC GATGCTTGTT
CGGAGTGGGC TTGCTAATTT TCATCGACGG ACACCCGTTC AATATTTTTC TGTCTTGGTT
CTGTAATCGA AAACAGCTTT GTTGTTTATG TATTTTTGAT AAGGTATACA GTCTTGTTGA
TGAATATAAC AGTTGGGACA TGAACTTTCC GTTTTGACCT GGGGTATCTA GACACTTCAC
GTGTCCTCGT GTTCTTGTGT GGGAGGGGCA CAGCTTAGCC CTTCCAATCG CGCTCGGGAC
CCCGTCTTAT ACTTATAGCA GCCCTCTTCC GATGGGCCGA CCATGATGGA ATCACACATG
GTAGTCACCG TTCGCCAAGC TTTCCACTCA CCGAGTCCCG TCCTCTCCTC GCGTCACAGG
TCATTCGGGC TTACCAACGT GAATGCTTTG CGAGACGGCG GTAAAGGCTT GGACTGTCGG
GGGGAAGAGC ACGCCGTCGA GCTGGTTTGG AGCGTTCGCA GCGGAAAGAC ACGGGTTTTC
TGGAACGGAA GGGATATTTC GAATCTGTTT CGGGGCGGGA ATCGATCCGG AATGGTCGAG
TTTGCTTGGA ATACTCGAAC GGAGGAATCT CTGAGGATCG TCGCCCACGC GGAACCCCGT
CGAGGAGTCC GGCAATACGA TCTGTTGGTG GACGGAATCA GTGTTTTCAA CCTGCCCAAG
TTGGCCGAAA TTGGACAACC TTTGTCGGCC ACTTCAACCC CGTGGGAGCT TCCCTTGGAC
ACAAGCCACG AAACCGAGTC CCGATCGCCA CGGTCGTCTC CGATACCGGT GCACACGATC
GACTGCATTG AATCGATCGA TAGCATGGAG CACCAGACTT GGACGGATGC CGATCAGGCC
AAAGCTCGGC TCGCGAGTGT TGGTCTAGCG TTACATCCAG ACACTGAAGA GAGCGACGAT
TTGCACTCGG ATTTGTACTC GCCCATCGTT AATTCGATGC GCAACCTGAT CACGGCGCAC
CTACCACAAA CGGAGGAAAC CGTCTCGCGG GCGTTTACGA ATGCACTGAT CAAGGATAGT
GATTCGTATA CGAGCGAATC GTCCCTTTCG GATTCGAGTT GTCTTCACGA CGCGATGCAA
ATTGAAGTCA ACGCGCTATG GGAAGCCTTC CGGTGGGTCA GGTCCAATGC CGATCAAATC
ATGTTTTCGG ACGGCGAGGA GTTACAGCTC GAGTACATGC GTGAGCAAAT CGAAGCGGTG
TTTGCCAAAG TATGTCACGA ATGTTTGACA CCGGGAGAAG CTTCGCGTAT TCTTTTGCAC
GTCGGTGCCA TTCTTGGTCT CAAGTTTCAT CGCAATATTC TCATGGATAC CATTCTCGTG
GACGGCTTGT CCAATTACTG CACCGTAAAC GACCTGGAAG CTGCGTTACG GCCGTTTGGC
CGGATCGTTT CGATTGCCAT GGTCCAGGGA CATGGTTTCG GATGTTGCCG ATTCGTAGAC
GACGGTCCGC TGCTGCGGTT ACAACGGAGA GGGCTTACGT TTACGATTGC CGGAACGAAA
GCGCAAGGCA TGGTCATTTC TAGTCTTCAC GAATTCAATA CCAGCACACG TTATTCTCAA
GGAGAGAACG GAGAGTTTTC CGAAGAAGAG CATTCCGACC AGCCTACAAT GCAGCGGACT
TCCGCTTGTT CTATCCATGG CGAAAGGAGG AATTCTTGTG GAGTACCAGT AACACCGATG
TCCGAGCAAA GTGTTCGAGA AATGATTGAC TTTGGGGACC AACATCAGAT CGACACCTTA
GACTTTCTTC GGTCGCCCGA CTCGGTTACA CGAATGACAT TTGGTCCAGG CATCATGCCA
TCGAGTCTCA CCGCTGCTTG TCTGGACTAG
 
Protein sequence
MRHSNGRGVP AEESELRTSP VINRIRPRRE AQAYRIAYSC EHAARIFRST KRRIRWSFGL 
TNVNALRDGG KGLDCRGEEH AVELVWSVRS GKTRVFWNGR DISNLFRGGN RSGMVEFAWN
TRTEESLRIV AHAEPRRGVR QYDLLVDGIS VFNLPKLAEI GQPLSATSTP WELPLDTSHE
TESRSPRSSP IPVHTIDCIE SIDSMEHQTW TDADQAKARL ASVGLALHPD TEESDDLHSD
LYSPIVNSMR NLITAHLPQT EETVSRAFTN ALIKDSDSYT SESSLSDSSC LHDAMQIEVN
ALWEAFRWVR SNADQIMFSD GEELQLEYMR EQIEAVFAKV CHECLTPGEA SRILLHVGAI
LGLKFHRNIL MDTILVDGLS NYCTVNDLEA ALRPFGRIVS IAMVQGHGFG CCRFVDDGPL
LRLQRRGLTF TIAGTKAQGM VISSLHEFNT STRYSQGENG EFSEEEHSDQ PTMQRTSACI
MPSSLTAACL D