Gene PHATRDRAFT_48789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48789 
Symbol 
ID7195102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp242359 
End bp244167 
Gene Length1809 bp 
Protein Length602 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183326 
Protein GI219126149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.450874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTCA CTCCATCGCA TGCTACCAGA GAAGCGAGCA AGGATGCCTA CGAGCTTAAC 
ATGGCCGTAG CTGTGTTGTT CTTCCTCCGT CGAAGCAAGT TCGAAAGATC CGCAAATCTG
TTTCAGACTG AGCTCCGCGA GACTTACGGA AATTTCAACG GGAAAACATT TAAAGGAACT
CTCAGATGGG AGCCGCTTGA ACAAAAATCA CCGTTCCCCG AGAAAGGGGA CAACGAGGAT
GAAAGCTACA TAAGTGAAGA CTCCGACGAT TTCGAATGGA AACACTTTGA TAGCAACCTA
GTCAAAATGC GAGTCGGGGA CCACTTCGTC GGAGACAGTG GTAGCGATAG CAGCTCCAGC
TCCGACTCTA CCAGCGAGCT GACTGCAAGC CATGTCAAAA TGAATATAAG GTGTACCACG
CGTCAACCAA ACAAGGATGG TGCGCGCGAT GAAAGCGGCA TTATAGACGT CGACAAAAAG
CAATTGAGGA CTCTGGACGG TAGCGAAAGC GATAGTACAG GGGACAGCAA ATCGAGCGGT
GAAGGCGACT GCATTCCAAA ACAGGGTCAC GCATCAAACA AGTTTGCAAG TGCCTCCTCG
TCCAGGCCTG CTCTTATTGC ACAACACTTA AAGCCACTTC CAAAAGGAAG GAATTTTCTT
GATAGTGACA GCGATAGCTC TTCCAGCGAA GAACAACGCG CAAAGCCACT TCCAAAAGGA
AGAACTTTTC TTGATAGTGA CAGCGACAGC TCTTCCAGCG AAGATTCGGC TCAAAAGAAA
AAACCGCCGA TTATGGCCCA AAGACTTGTG CGACGGGAGC CACAGCTATG TAGCATCGAC
ATCATGCCAA CGAAGATTGC TGCTACCAAA AATTTACCAA AATACGATTC TCTGAATCGA
AAAAAGAAGC TTGCTTCTCA GGTAAAGTAT CGAGCGAAGG TTCTCGACGA CAGCGATAAG
AGTTCCGAAG ATGAACGCCG AGAGCGCACT AAGTATCCTG CCCACAAAAT ATCAAACGAT
GTTGAAAAAG TAGGATCGAG GGTATCGAAT TCCATTTCTG CCCAAATTGT TACCCAAAAC
GACTGCAACG ACAGCAGCAG TAGCTCCAAC GAAAGCGACA GTACCGCCAG CGACGAAGAA
ATTCGATCAG GATCCGTTTG GCGCTCCCGA CAGTCTATTA CGAGTATTAA CAATGAGAAA
AACAAGGAAA GTGCCCGGGT AGACGTCAGC TCATCACAGC AAGACGTGAA ACTGCAGAAA
ATGAATTACA ATGTACAGAA TTTGTCTACT AGGACTGCAC CATTGAAAGC CAACGGCCAA
GAAAAAGCTG ATAAGAATCG CGATGCCGAA AGTCATTTCT CCAATACACT GAAGGCTGCC
CTGTCCAACC TTAGAACTGT TGCTGAAAAC TCAATTTTGC ACAAAACAAA CGGCCTTGCT
TGTACAAGTG ATTCTTCGTC GAATTCAAGT GAATCATCAT CGTCAACGTC GGAAATTTAT
AAAAATCCAT ATCCCATCCG ATCCAAGGCT AGAGAGAGAA AACGTTTACC TAGGCGATCT
CAGTCGCTTG ATATTGAGTC TCTATCATTT CTGTCTACTT GTCGAGGTAT GAAACGCTCA
GTATCATTCT CTGACGATGA CAAAGTAGCC GAGATTCCTC GGTATGAGGC TCAATCAAAG
TCTGAGCTCT TCTACAATAA AGCCGACATC AGGCGGTTTA CTGTCGATGA GCAGACACGC
CGTCAAGAGG AGCAAACTGA GAAAATGGCC ATGATGCTAA AGCTCTACCT GCTTGCTAAA
AAAAGTTAG
 
Protein sequence
MVVTPSHATR EASKDAYELN MAVAVLFFLR RSKFERSANL FQTELRETYG NFNGKTFKGT 
LRWEPLEQKS PFPEKGDNED ESYISEDSDD FEWKHFDSNL VKMRVGDHFV GDSGSDSSSS
SDSTSELTAS HVKMNIRCTT RQPNKDGARD ESGIIDVDKK QLRTLDGSES DSTGDSKSSG
EGDCIPKQGH ASNKFASASS SRPALIAQHL KPLPKGRNFL DSDSDSSSSE EQRAKPLPKG
RTFLDSDSDS SSSEDSAQKK KPPIMAQRLV RREPQLCSID IMPTKIAATK NLPKYDSLNR
KKKLASQVKY RAKVLDDSDK SSEDERRERT KYPAHKISND VEKVGSRVSN SISAQIVTQN
DCNDSSSSSN ESDSTASDEE IRSGSVWRSR QSITSINNEK NKESARVDVS SSQQDVKLQK
MNYNVQNLST RTAPLKANGQ EKADKNRDAE SHFSNTLKAA LSNLRTVAEN SILHKTNGLA
CTSDSSSNSS ESSSSTSEIY KNPYPIRSKA RERKRLPRRS QSLDIESLSF LSTCRGMKRS
VSFSDDDKVA EIPRYEAQSK SELFYNKADI RRFTVDEQTR RQEEQTEKMA MMLKLYLLAK
KS