Gene PHATRDRAFT_46995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46995 
Symbol 
ID7202232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp130323 
End bp131833 
Gene Length1511 bp 
Protein Length420 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181141 
Protein GI219121579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.104637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAGGCCAGC ATTCCGATGA GCCAGCGAGC TCTAATATCG AATACTCTGA ACCATACAAA 
AGATGGTTGC AGCATTACGT AGATCTTTTT GTCTTGATAA AAAGTGGGTG AAATTGCGTG
CAGCAATCGT TTTTAAGAGG CTTTGATTGT TTTTCTGTGC TCTGATCTTC CCAAATGAGT
GATAACAACA ATACGAACCA CAGATGTGAG CCAAAGAACG ATCATGGCGA GAGCGTCGCA
TCTGTTGCTA GCATCTATCA GCTCAATGGA CAAAGAGTTC GTCATACACG CATCAACTCT
ACAAAGTACA AAGACACTTT TGATGATTCT AGAATGGTCG TAGTTCGCTA TTTCTTGGAC
CGTCCGGGCT ACGATTTACA CAATGCCACT GCCCGCCTCT GGTTCCACCC CCCGACGATT
TGCTCCCCCG AAGATGTACG GTACTTTCTC AAGAACCAAG GCGTTTCCCT CGAGGGTCTT
CTTATAGAAA TATACTTGGA CACATTTGAA AGTTTTATGG TGCTAGAATC TTGCGCAGCA
AATTGTGTCG AATGGAGCTT CGAAGGGACA AGCTATCAGG ATCCAGGTAT TTTAAATATA
CGATTGACTG ATCTGATTGA CGCCACCATG GAAAATGTAT CTTCAACAAC TCGGCTAGTT
CATCCTACAA AATATGGCGC CACAAAAGAA CAGGCCAATA CGTCGCCCGT CGGTCTTTTT
TCCTTCTCGA TGGTTCAAGC CATGCAGACG ATGCAAGTCA TGAGCAAGTT GATTCCAGAC
AGTGTCGCTG AATCGTGGAC GTTGACCAAT GGTCCCTACA TGTTTTTCCT TGGAGGGGTA
GTTCAGTTCG TGGTGGGAAT TTTCCAAGTA CTGCGAGGCA ACATCTACGG AGCTACTGCT
TTTCTTGCGT TCGGAGGCTT TTTGATGGCC GATGGTGCAG GAATAATACT TCGAAATCAT
TTCTCCGATG CTGGAAGTCG TGCCGTAGAG CTTATTGGAG TTCCCGATCC ATGGGGCAAC
TTCTTCCGTC ACGCGTACAT CTGTGCTTTC TGTTGTGTAT TGCTAAAGCA GACTCTGGTT
ATGAACAAGC TGACAACTGG CTTGATTGCA GTTGTCTGCA CAAAACTTGC CGCAAGCTCT
TTCACTGGTT GGAGTGAAAC TTTTGAATGG ATGGAAATCG CTTTGGGGTG GATTGTATCC
GTGTTTGCCT TTTACGTTTT CACAGTCGAA GTGACAAACG AAGTGTATCA TCGGGAGGTC
TTCCCGATGT ACAAATGGTC AGAAAGGCAC AGTCCTGAAG AGCTGTTTGG TGCTACTGGT
CGCATTGGGA CGCTCACATC TAAAGCAACC AAGCTTCGCC AGGCAAACTA CCCCACCCCT
TTGAATATTC GCTGGGCAAA CACAATGCAC GGATCGCAAC AACTCCAAGA CAATTAACAG
TGCTTGAGGA TGGTACAGAG TTCTTCCATA TATGTCGTTG ATAATGTAAA CAAGCACATT
ATGCTGCCCA A
 
Protein sequence
MSDNNNTNHR CEPKNDHGES VASVASIYQL NGQRVRHTRI NSTKYKDTFD DSRMVVVRYF 
LDRPGYDLHN ATARLWFHPP TICSPEDVRY FLKNQGVSLE GLLIEIYLDT FESFMVLESC
AANCVEWSFE GTSYQDPGIL NIRLTDLIDA TMENVSSTTR LVHPTKYGAT KEQANTSPVG
LFSFSMVQAM QTMQVMSKLI PDSVAESWTL TNGPYMFFLG GVVQFVVGIF QVLRGNIYGA
TAFLAFGGFL MADGAGIILR NHFSDAGSRA VELIGVPDPW GNFFRHAYIC AFCCVLLKQT
LVMNKLTTGL IAVVCTKLAA SSFTGWSETF EWMEIALGWI VSVFAFYVFT VEVTNEVYHR
EVFPMYKWSE RHSPEELFGA TGRIGTLTSK ATKLRQANYP TPLNIRWANT MHGSQQLQDN