Gene PHATRDRAFT_40843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40843 
Symbol 
ID7198701 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp79994 
End bp82964 
Gene Length2971 bp 
Protein Length903 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184824 
Protein GI219129288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACA 
CCGATTGCTA CCGCGACAAC CAAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA
ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ACGGCCAACA CGGTCACCTT
TTTCTCGTAA TTTCCGAACC CGAGTACCTC GCAATGACCG ACGGCGTTCC ATGCATTCCT
CCTGTGCAGC CGCCTTTCGA CCCAGTTCAT GCTGCCAACG CCACCGCTCC TCAAATTGTC
GAAGCTAACC GTCAGAACGA CAAACGTCAA AAGCTTTTTG ACCTTTACCA CAACGCCATT
AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATTGA ATCTCTCGGT
CACCCTACCC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTCTCTCA TCTCTGGGAA
AATTTTGGTA AAATTCAGGC TTCGGATCTC ATTGCTAACG ACGAACGCAT GAAAGCCGCC
TGGCATCCAC CAACACCTAT CCAGCAACTT TTCCAGCAGC TTGAAAAAGG CAATCAGTTT
ATCATCGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC
ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCGTTATAA AGATGAAGCC
GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC
ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA
AAATTGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT
GAAAAACAGG CCCTGGGGCA CAAAACCGGC GCTACATTGC ACGACAAACA AGGCGGGTCG
ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG
TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT
CCTCCGTTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT
ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC
ATCAACGTGG TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCATATCAC CGAACTCAAC
ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA
TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC
ACAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCCTACACC
CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACTGA ATTGCATGCG
CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC
CGCATTGCCT TTGTTCATGC ATCCTTATTC TCACCACAAC TTTCAACATG GTGCAAGGCC
ATTGACGAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC
CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC
TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC
ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC
GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GCTTTGTTAC CACTTCAAGC
TCCGGTAATG CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCGAA
GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG
CTATCCTCCA AAGGTTTGCG CCCCCAACTC CAACACTTAG ACAACGAAGC CTCCACTGCG
TTACAACAAT TCATGTCCTC TGTTGACATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC
CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG
TGCAGCACCG ACAAGAACTT TCCGCTTCAC CTTTGGGATC ACTTACTCCC ACAAGCCATC
ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA
CTCCATGGCT CGTTCGACTA CAATTGTACC CCTTTGGCTC CCCCGGGCAT CCGCGTACTT
GTACACGAAA AACCGACAAT TCGCAGAACC TGGGCCCCCC ACGCAGCCGA CGGCTGGTAC
GTTGGTCCCG CCATGAACCA TTACCGATGT TATCGCGTCT GGATCAGGGA GACCACCAGC
GAACGCATTT CTGACACCCT GACATGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG
TCTCGCGACA CAATTGTCGC CGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCATCT
CCCACGTCCC CCTTGTCGCC TCTTTCGGTC CACGAACGCG AAGCCCTCTC GCAACTTTCA
GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCATCCC TCCCAGTTGC TCCCACGGCA
ACCCTAAGTC CGCCAACTGC ATCGACTTCT TCACCTCGTC AAGTCCGCTT CCGAGACCCG
GTCACTGAAT CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCCTCCGCA GTCACTTCCG
AGGGTGCCTC CCCCAAACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC
CAAGCACGTC GTAGGGCCGC TCGAAAACTG A
 
Protein sequence
MTTKSTPKDL IDSFPHSKLT PIATATTKPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL 
FLVISEPEYL AMTDGVPCIP PVQPPFDPVH AANATAPQIV EANRQNDKRQ KLFDLYHNAI
KAFRNQLLEA IPIEYIESLG HPTRGFNKVS PLEILSHLWE NFGKIQASDL IANDERMKAA
WHPPTPIQQL FQQLEKGNQF IIASGQVMDE RIIARIGYQI IEKTGLFDLA SRDWRYKDEA
DKTLANFKKH FQKANKDLAL TATSSSAGYH TANQSTVTKG KLYCWTHGIV HNTKHTSATC
EKQALGHKTG ATLHDKQGGS TKTYQYTPPS SVAPNTPPLA SSPPFFPPDA IADTGCTGHF
LSTNIAHIHC QPTVPGINVV LPDGRTITSS HITELNIPSL PPAARTAHIF PGLSNGSLIS
IGQLCDHGCT ATFTSDTVRI ELNNTVVLRG GRSPYTRLWT LDSPVTPNPP ATELHAPVHD
KNFANHLGDH SGTLADRIAF VHASLFSPQL STWCKAIDEG RLTTFPDITS AQVKRHPPQS
VPMVKGHLDQ QRSNLRSTKP KVTLSASVDP DDINFDTNPV VQDPPAARTQ FLYADFAEVT
GKIFTDPTGR FVTTSSSGNA YMLVVYDYDS NFIHVEAMKN RTGPEILSAY KRAHAMLSSK
GLRPQLQHLD NEASTALQQF MSSVDIDFQL APPHVHRRNA AERAIRTFKN HFIAGLCSTD
KNFPLHLWDH LLPQAIMTLN LLRGSRINPN LSSWAQLHGS FDYNCTPLAP PGIRVLVHEK
PTIRRTWAPH AADGWYVGPA MNHYRCYRVW IRETTSERIS DTLTWFPSQV KMPSTSSRDT
IVAAAHDLAH ALAHPSPTSP LSPLSVHERE ALSQLSDIFS KAANPVDSSL PVAPTATLRP
LEN