Gene PHATRDRAFT_49638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49638 
Symbol 
ID7198271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp274682 
End bp276672 
Gene Length1991 bp 
Protein Length611 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184335 
Protein GI219128260 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAGAATACA ACTCCTATTG GTTTCCAAGA CTTCAAGAAA CAAAGCTTCT CCTTTATTTT 
GTGTAAGCTA CAGGAGGTAG GGAGAACTTC ACATCATTCC ATCAGCAACC GTTTCCGGCC
ACAACCAATA TGCGGAATCC GTTTCGACGC AATAACCCTG CGGCGTCGAA CGCGAATCCC
CGTCCGATGC AACACGAAGC TTCGAACCCG TTTGAGGTGC TACAACGCGG TGCCCGACAA
GCTGCTAGCT CCCTTATCGA TAGTTTAGGG GCTACGCATC AGATGGTCGA AGAGCAATTT
GAAGCCGCAA CGCAAGCCGC GATGCATGCT TCCATGCAAG CCCCTGCTTC TTCTCAAGGA
CCGCCCGCTG CCTCGGCACA GGTCCTGCAC CACCTTCCGC AGATTCGTAT CACTCGTCAA
GATTTGGTCG AACCTACCAA TCGCGAATGT TGCGTCTGCT TCGATCTTCA TCGTTTGAAC
GACAAGGTTT TGCGGCTGCC CTGTGCTCAT GTTTTTCATC CACAATGCAT CACCAAGTGG
CTACAATCTC ACTGCACCTG TCCGGTTTGC CGATACGAGC TACCAACGGA TGATCCCGAC
TACGAACGGG GCCGGATTGA GCGGATGCGG AACCGGAAAC CGAGATTTGC GAGGCATGAA
CTGGATCGGA TGACAGTCTC TGAGTTGAAA GCGCTTTTGG CGAAATCGAA AAATTGCCGC
CAACGGCCGG TGGACAAGCA TGATTTAATA TCTCTCCTTA TTTCGTCGAA CGCTATTGAT
GTCGTGGAAA CTCCGGAACC GGTTACTTAC CGGCTCTCTG CCTTAAAAGA TATGAGTGTG
GGAGCATTGA GGCGGTGTAT GAACGACGAA GCTGGGGTCT TTTACGATCC GAACGAGGTG
GTAGAAAAAG CAGATATGAT CCAAATTTTT TTGAACAGTG GGCGCTTACT CTTAAACCCT
GAAGATAATG CGACCAGTGA AATCAACGAA GACGACTTTG TTTGGAAGGA TCTGTCGCCG
ATCAACAGCG ACGAGGATGA AGAAGACAAA TATCCGTCTT CCGCAGTTGC GTCTATCCTT
GTGGAGACTG TAGTAGATGA AACTGATGTA TCAATATATG ATCGAAGAGT TAACAAAATG
TTACTCATGG AAGAAAACTC GTCCTTTGAC GAAAGTTCGT GCACTCCAGC CATGGAAGAT
GTGGAAAGAT TACTAGAGTT CAAGGATGAG ACATCGGGCT CAGCGACTGC CGACGAACAA
TTCACTGGAA TAGATGCCGT TGAGGTCGCA TTGACTGATG CACCTATGGA CATGGGAGCT
GATTCGCAGA ATGTTAAACG GCGAAAACGT GTAAGAAGTT TAGGTGGGAA CGAATATACC
AGGCAGGTGG CAAAAGAAAA AAACTCCGCA AAAGCATCTC TGGACTCTAT TGACGAAGGA
AATGTCACGG AAGGATCCAC GGCTCTCCGC CTACCAGTGG ACGATAATGA GGAAAGTATG
GATTTCCACG GAGAAGAAAA TATCTCCTCT TGTTTCGACG ACTTGAGCGT TTCCGAACTT
CGAGCACGCG GGCGAGAAAT ATCGGTTGAT CTTTCCGATT GTATCGAACG TGCGGAAATA
GTCCAGCGCC TTTCGTCCAT TGAAAGTGAT GGACAACGTG CCGGCCGCCT CATGAATTGG
GAAAAGTGGC GGGTTTCAGA TCTCCGAGCG GTTGCCGCGT TGACCGGTGT GGACTTATCA
GAGTGCCTTA ATCGACAAAG TATGGTTGAA AAAATGCAAC ATGCAGGAGT TGAACGTCCT
CATTTAGGAC GATTCTTGCA CTCACTGGCT CCATTAGCAC GTCTTACCAG TCTACAATTA
CTGGCCGTAG CACGAGACTG GCAGGTTGAC GTTTCCGACT GTCTCGAAAA AGGCGATATT
TTGCGCCGAT TGGTTGAATC GGGGCCAGGT ATACGATTTG AATGAATACA TTCGTCATAT
TTTAGTTGGT G
 
Protein sequence
MRNPFRRNNP AASNANPRPM QHEASNPFEV LQRGARQAAS SLIDSLGATH QMVEEQFEAA 
TQAAMHASMQ APASSQGPPA ASAQVLHHLP QIRITRQDLV EPTNRECCVC FDLHRLNDKV
LRLPCAHVFH PQCITKWLQS HCTCPVCRYE LPTDDPDYER GRIERMRNRK PRFARHELDR
MTVSELKALL AKSKNCRQRP VDKHDLISLL ISSNAIDVVE TPEPVTYRLS ALKDMSVGAL
RRCMNDEAGV FYDPNEVVEK ADMIQIFLNS GRLLLNPEDN ATSEINEDDF VWKDLSPINS
DEDEEDKYPS SAVASILVET VVDETDVSIY DRRVNKMLLM EENSSFDESS CTPAMEDVER
LLEFKDETSG SATADEQFTG IDAVEVALTD APMDMGADSQ NVKRRKRVRS LGGNEYTRQV
AKEKNSAKAS LDSIDEGNVT EGSTALRLPV DDNEESMDFH GEENISSCFD DLSVSELRAR
GREISVDLSD CIERAEIVQR LSSIESDGQR AGRLMNWEKW RVSDLRAVAA LTGVDLSECL
NRQSMVEKMQ HAGVERPHLG RFLHSLAPLA RLTSLQLLAV ARDWQVDVSD CLEKGDILRR
LVESGPGIRF E