Gene PHATRDRAFT_41659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41659 
Symbol 
ID7195985 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp619397 
End bp621529 
Gene Length2133 bp 
Protein Length645 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177124 
Protein GI219110745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.666462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAG GCGAACAAGA GCTCTTCGAA AAGAAACTCT CCAAGGAAGA GAAGAAAGCT 
CGTGCCAAGG CACTTCGCGA AGCGAAAAAG AAAGCCAAGG AACCAAAAGA AGGGAAAAAA
GACAAGAAGG AAGATGCTGA AGAAAAGAAG GAAGAGGCTC CCGCTCTGAA CTTAGACGCT
CTCGACTTGG ATGCTCACAA GGATGCCAAG CGCGAAGCTG CCCTCGACAA GCTTTCCGAC
GATGACATCA TTGTAACCTA CGAAAGTAAG AAAGGAGTGT TGCATGCGAA TACTCGAGAC
ATTAACGTGT CGGGAGTAAC GGTCACCTTT CACGGAAAGC CTTTGATCGA GGAAACAGAA
ATTACCATCA ACTACGGAAA CCGTTACGGA TTCATTGGAC CCAATGGCTC CGGAAAATCC
ACGATCATGA AGGCAATTGC TGCGCGTGCT ATCCCTATTC CGGATTCTTT GGATATTTAC
TTTTTGGATT GTGAATACCC AGCACGCGAT GACATTACGG CTCTGGAGGC AGTCATGGAA
AGTAATGACG AAGTCGGCAT CCTGGAAAAG CAAGCAGATG CTCTCAACAT GGCTATGGGA
GAAGCCGATG AAGAGCAACA GACATCCATC CAAATGACAC TCGAAACAGT TTACGCCCGT
CTTGATCAGT TGGACGCGAG CTCGGCCGAA GCTCGCGCCA CAACTATTCT GCACGGTTTA
GGATTCACCA AGACCATGCA ACATATGAAG ACTCGGGAAT TCAGTGGAGG ATGGCGCATG
CGCGTTGCCT TGGCCCGTGC TCTCTTTCTT CAGCCCGAAT TCTTGCTCCT CGATGAGCCG
ACCAACCATT TAGATATGGT ACGTTTTTCA ACGTTTTTGG GCTGATTGAA AGCCAACAAT
CTCTGAAATG TTCACCTTCT CATTCTATTT CCTCCAGGAT GCTGTGTTAT GGTTGGAAGA
ATATCTGTCG AACTGGGACA AAATTCTGTT TTTCGTCTGC CACAGTCAAG ATTTCATGAA
TAGTGTCTGC ACAAACATCG TTCGCCTCGA TATGACGTAC AAAAAGCTGC GGTACTATAG
TGGAAATTAC GACACATACG TGCAGACGCG TCGGGATCAA GATATGGTGC AAATTCGTCA
ATACGAAGCT GAGCAACGTG ATATCGCTGA AATCAAAGAT TTTATTGCTA GATTCGGTCA
CGGTACCGTC AAGATGGTTC GGCAAGCACA GGCGCGCGAA AAATTGCTTC AGAAAAAGCT
GGAAGCCGGT TTGACTACGC TGCCCGAAAT GGATCCAGAA TGGGATTGGA CATTTCCTGA
TGCGGGAGAG CTCCCCGTCC CGGTTTTGTC GATCGAGAAT GTCAGTTTCA ACTACCCCAA
TAGTGTCGAG CTCTACAGCA AGGTAGATTT TGGGGTAGAT TTGCAGACGC GCGTTGCCTT
GGTGGGGCCC AACGGTGCGG GAAAGACAAC GTTGGTCAAA CTAATGACGG GTGAACTTAA
TCCGACTAAG GGGGCAGTGA AGCGCAATAC GCACCTTAAG ATTTCTCGCT TCACTCAGCA
TTTTGAAGAA AAGCTTGATT TGACGATGAC TCCACTCGAC TTTTTCAAGC AAAAAGTCAT
GCCGGAACAG CCCATTGAAA AAATCCGTCC GCTTTTGGGA CGTTACGGGT GTTCGGGGGA
CCAGCAATCG CAGGTGATGA ACCAGTTGTC AGCTGGCCAA AAGGCACGAA TCGTCTTTGC
AATTATTGCC CATGAAAAGC CGCACTTGTT GCTGCTAGAC GAACCGACAA ACCCATTGGA
TATGGAAAGC ATTGATGCGC TGGCACGATG TTTGAACAAG TTCAAGGGTG GTGTTTTGAT
GATCAGGTAC GTAGAAGATT ATCGTTTACC AATGATTTGG ATTGTCTGCT TGCATATTTG
CTTTCTTTGC GATGCGCCGG CCTGCACTAA CCAAATGTTC TCCTATTCCT TTTGTCCCCT
AGTCACGATA TGCGCTTGAT ATCGCAATGT GCCGAGCAGA TATATGTTTG CGATCACAAG
AAGGTTGTCA AGTATACCGG AGATATTATG GATTTCAAAA TGCACACTCG CAAGGAAAAC
AACAAGAAGC TGGCTCAGCA TTTGAATGGA TAA
 
Protein sequence
MGEGEQELFE KKLSKEEKKA RAKALREAKK KAKEPKEGKK DKKEDAEEKK EEAPALNLDA 
LDLDAHKDAK REAALDKLSD DDIIVTYESK KGVLHANTRD INVSGVTVTF HGKPLIEETE
ITINYGNRYG FIGPNGSGKS TIMKAIAARA IPIPDSLDIY FLDCEYPARD DITALEAVME
SNDEVGILEK QADALNMAMG EADEEQQTSI QMTLETVYAR LDQLDASSAE ARATTILHGL
GFTKTMQHMK TREFSGGWRM RVALARALFL QPEFLLLDEP TNHLDMDAVL WLEEYLSNWD
KILFFVCHSQ DFMNSVCTNI VRLDMTYKKL RYYSGNYDTY VQTRRDQDMV QIRQYEAEQR
DIAEIKDFIA RFGHGTVKMV RQAQAREKLL QKKLEAGLTT LPEMDPEWDW TFPDAGELPV
PVLSIENVSF NYPNSVELYS KVDFGVDLQT RVALVGPNGA GKTTLVKLMT GELNPTKGAV
KRNTHLKISR FTQHFEEKLD LTMTPLDFFK QKVMPEQPIE KIRPLLGRYG CSGDQQSQVM
NQLSAGQKAR IVFAIIAHEK PHLLLLDEPT NPLDMESIDA LARCLNKFKG GVLMISHDMR
LISQCAEQIY VCDHKKVVKY TGDIMDFKMH TRKENNKKLA QHLNG