Gene PHATRDRAFT_47205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47205 
Symbol 
ID7202191 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp810072 
End bp811862 
Gene Length1791 bp 
Protein Length555 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181446 
Protein GI219122215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.551537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCCT CCAACAGTCC CGGAGCCGCT GTTGTCATTC CTTTTGAGCC CACGAGCAAA 
GACGTTGTGC TTTCATCGTC TGAACCCGAA AACCACCATC TCGGGAATGT CTTCTTTCAC
CATCTGCTCC AGAGCTTGCG CAATATGCCG AACTTACAGA GCACGGCTTC TCGCGTAGTC
GACGCAGTCT GCGACGAACG TCAAGGGAGG TTCCTGAATG TTCTTCCCAA TACGACAGAA
CAAGAAGTTC GTCTTTGTAC CGTGCTGACC CGCGATGAGG CCACTGCTCG CGTATACCAG
GCCCTACAAC AATCACAGAT TGTGTCGGGT CGCACACCGC CTACGAAAAG GGCTCGTGTA
CAACGTAAGG GTGAAGCGAC CAATCTTCCT GTCTCTCCTA CTAGTACTAC TACAAAAAGA
GGTCCCGACG AAATTTCTCT AAAAGTTCGT TGCAGGTTTA GACGGAACGT CTACCGCATT
GTCAATGGGA ACAATCCGGC GAGTAGGATT CACCCATATG CGATTGAACT TCTGGAAGCC
GTCTGCAACC AGGCTTTTAG TAATGTCATA CAACATGCAT TTCAACAAGT GGGTTTGCGC
TACACGAATG GCGGGGAAGA GGAGCCTGAA GAACACAGAA TAATGCGTCT TCAAATGCGA
CAGCGGTTTG TAGCAGCATT GGAGGCCAAT ATTCCAACGT TAGACTTTGC CAAAACTTTG
GTGAGATCGT GGGATGGGGA GTTGATTTGC CGACCTGAAG CCGAGACTAT CCCAAGCAAG
GCTTTGGATG TGGAAACGGA TGTCCAAACC GTGACTTCCG AAAATCCACC AACATTCAGC
AAGTCGGAAT CTTGCAATGA TCTATTGCAA GCGAGGGAAG AGGAAGTGAC AAAAACCTCG
ACCTCGTCAA GAGCATCTAA AATAAAGGCT CCCAAGAATG TTTCGTCCAT CTCGATGGAA
AATCCTACTA TTGCTGCGAA TCAAGTTGGG ACCTTACAGA ACTCAGTCTC GCAACATCCA
CAGCAACATG CGACCAAGTC GGCCAAGAGT TTGCCCATTC CCAGGAACGC CTCCAGTGCG
CAATGCCTTA TATCTCCCCT TACGGATGGC CCCTTAAAGC TGTCGCCGCT CACGTCGCGC
ACAAAACCAG CTGCCTCCTC TTTCCTTTCC TCTCTGGTCA CTTCGCCTAT TCGCAAGAGT
TCCAGCACCC CTTCCAAAAT GATGTTCAAG AGGGAATCAT TCGACGATTT AGATGAAAAT
GATGTTATGC AAGGTCTCGA CTTTTTGGAC GACACGCATC ACGACTTGGG CGATGGATTC
CTGTTGCACG ATGCACTTGC CGCAGACTTG GGATCGCCGG AGCCTTTACA TTTTGAAGGA
AACCACGTAG GCTCGACCAC ACCACCATTA CCGATCTCGC CCCTGGCAAC ACTAGACCCG
AATACGACGG AGGAACGAAC CGGAAAGTGG ACCATTGTTG AGAAAAGGGC ATTTTTGCGA
GGCCTGGAAC GGTATGGAGC CGGGCGTTGG AAGCAAATCT GCGATATGAT CCCTACTCGG
TAAGTCGTTA GGAAATGACC AACAACCGCG TACGCGTGTG CCGTTTACAC AGTATCATCC
GCATCCTTTG CTGACCGTTC TCACGATTCA TTTGCTTCTT TGCGCTTTGC GGTCGGCGAC
AGCTCGTACG GACAAGTAAA AAGTATGGGT CGCTTCGTTG TGAAACGCTA CAACCTTTCC
AAGAATGAGC GGCCGACGGG ACCCGTCGTT CATTTCCTGC AAAAACCTTA G
 
Protein sequence
MASSNSPGAA VVIPFEPTSK DVVLSSSEPE NHHLGNVFFH HLLQSLRNMP NLQSTASRVV 
DAVCDERQGR FLNVLPNTTE QEVRLCTVLT RDEATARVYQ ALQQSQIVSG RTPPTKRARV
QRKGEATNLP VSPTSTTTKR GPDEISLKVR CRFRRNVYRI VNGNNPASRI HPYAIELLEA
VCNQAFSNVI QHAFQQVGLR YTNGGEEEPE EHRIMRLQMR QRFVAALEAN IPTLDFAKTL
VRSWDGELIC RPEAETIPSK ALDVETDVQT VTSENPPTFS KSESCNDLLQ AREEEVTKTS
TSSRASKIKA PKNVSSISME NPTIAANQVG TLQNSVSQHP QQHATKSAKS LPIPRNASSA
QCLISPLTDG PLKLSPLTSR TKPAASSFLS SLVTSPIRKS SSTPSKMMFK RESFDDLDEN
DVMQGLDFLD DTHHDLGDGF LLHDALAADL GSPEPLHFEG NHVGSTTPPL PISPLATLDP
NTTEERTGKW TIVEKRAFLR GLERYGAGRW KQICDMIPTR SYGQVKSMGR FVVKRYNLSK
NERPTGPVVH FLQKP