Gene PHATRDRAFT_47701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47701 
Symbol 
ID7202887 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp573838 
End bp577110 
Gene Length3273 bp 
Protein Length795 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181935 
Protein GI219123237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATGC GCGTTCGTCG TGAGGAGCCC CGAAAATCCG AGCACCCGAA GGAATTGCCG 
GGACGATACC GACGGGGAGA AACGAAAATG TCGTTGGAGG CTGAGGGGGG TTAAAGCGTT
ATCCCAAAGG GAGCCGGCCG ACGCGCAGCC CTCAACCCTC TGCTTGGATC GACGGAATGC
CCAACCAGAA TTCTACTCGA GTCTTGGACA GGACTAAAAG TTCAATCGTG CACGTCGTCT
CGACGAAATG CTATCTACCA GTACGGTTGC CCGACATAGT GTCCCCTTTA TCTCTATATA
TGTTGACTTT GTCTAACTCT ACAGGACAGA CAAATTTTGC TGGTATGACT GCGAATCCTT
AAGTGTCTCA TCGTTGTTTA CACAAAAGGC GTCGGGGCTT GACTTTGTCC CTGGTGTCCT
TGTTCTCATA CGGGTCATTC GTAAGTAGAA ATTTAGAGCG GCTTGCATCG GTGCTTCCGG
CCAAGCTTTG TCCGCCCTTG TGCAGGATTT TCCGTTCGAG CCGTCTCTTT CGTTTGGGCC
GAACGCCCAA AAACCTCAAA ACACATGGAT GGTACTCGAC GTGTAAGTAG TAGATACGTA
GGTAGTTTGT TGGAGAGAGA GACCGACCGC ACACATCAAT CACACACATA CACACACATA
TACACACATT ATCATACCTG ACGATCCATT GTCCCGGTAC GGACAGCTCT CGGGACACGA
CAGTTTCCGC CAAGTTCGGG CGAATCCACG CGTTCCGTTC GCTCCCTGCC TCCCTTTCTA
GAGTATTGGG CCTTTCCCCG CAGACCCATT TGCGTTGGAT TTGCTCGAAC GAACACACTG
TTCCAAACGC AGTCAGTCGA CACAAACACA CGAGCATATC TACTTGGACA GCCGTTCGAG
CCCTTGTCCT TTGTTTTTAG ACCTTTTAGG AGACCAAAAA ACGGGAGCTT ACTGTGAAAA
CTTGTCGCTT CCGAACAAAC CGAGAGGCCC ACACCCCTCC TTCGTCGTGT CACTACCAAC
ACACACACAG CTCCCCATCA TGTCTTGGCT GCGACGGAGA AGTGTTCTTC CCTGGACACT
CGGACGCCAT CGGTGCGTCT CGTGGTGCCT CGTTCTCGTT GCTCTCTCCG AATGGAGCGG
ACCCCACTCG TATGGTACGT ACACGGATTT AGTTGTTCTC TGTTTCTGCA GCGTTACTTC
CGGATACATA GCCATGGGTC AACAATCCAT CCCGTACTCG TCAGCTAGGC TGTTTTCCCT
TTTTCTCTGT CTGTCATTCA TCATCTCACG CTCCTCTTTG TATTTCCCTG GAAGTAGCAT
GGGCTGAGGA TGTCCAAACA CCAGTCGCGG CGCCAACGGT TTGGTTGCCA GAACAACAGA
TGACGGAACT AGTTCCGATT ACGGACCAAC AGTTTACCTC GGCACCGGCA GCAGCGCCCG
TGGCTGTACC TGTGGTAGCA CCCACACGAG CACCCACGCC GGCTCCGACG CGAGCCCCAA
CCCGTCCACC GGCGCGAGCC CCAACCCGTC CACCGACGCG AGCCCCCACC CGTCCGCCCA
CGCTGCGGCC AACGGTTGCA CCCACCAAAG CACCAACCGT ACCTCCCACA CCGGCGCCGA
CACTCTCGCC GTCGTTTGTC CCGTCGGATG CTCCGTCGAT TCTGCCTTCG AAGGCACCCA
CCGGCATGCC AAGCTCCCTT CCTTCCACCC TCGCTCCCAC CACAATCGCG CCCACGACTA
TCACTCCCAT GCCCACATCC TCCTCGCCAC CATCGTCCAT GCCGTCGCAG GGGCCGAGTC
GCGCACCAAG CCTCGCACCG TCACTTGCTC CTACCGGCAA TCCGTCAGCC CCACCCACCA
ACAGACCGAC TGCAGCACCG TCGACTGCGC CAACGCTGTC TCTCCAGCGA GATACGGTGG
ATATTACAAT GCGCATGACA TCCATCCCGG GACGACTGGA AAGATCTTCC GCCATTCAAT
GGGAAGCCGC CACGGCCGAG CACATCCGTC GGAGTATCTT GGCGCAAACT ACCGACAGGC
CACTCATGGA ATTGATGATA CGGACCAACA TTGAGTCCCA ATTCACGCAA GCGTTTAATG
CCGGTCGACG AGTCGTAGTG CACATGGCCG AAGAGGGAAA CGCGATCGGC GGTCCGCGGT
TTTTGCAAGA AGTCGTCATT GCGCCTCTGC GAGTTTCATT TTTTGCAACA GTCTCTTTCC
GATCCACTTT TGACAATTAC GACTGGGCCA GTTTGATTGG TGACGCCTTC AACAGCGACG
ATGAACGATC AGCCTATGTT GCACGTTTAC GGGCGACCGG AGACAGGGCG TTTGACCCAC
TCGGGAGTGT AACTCTATTG GTGGAAGGGG AAACGCCCAT TGAAGAATTA CCCGACCAAG
ATTCCGAGGA CAGCGGCGGG AACAATTTGT TGATTGTGAT CGTTGCGTGT ATCGCCGGTG
GCAGCGTCCT CCTAGCCCTC GTGGGACTCT TTATATATCG CCAGTCCTCC TCCGCTCCGG
ATATCAAGGT AACTCCCAAG CTTGTTGAAC AGCACCACAG TACGAGCCAA AGTGTACCGA
GCCAGCGTGC GGGCTATTCG ACGGAAATTA ATGTTGACCG ACAGGACGAC ATTAGCACGC
TTGGCGACCC TATGTTTGGT ATGGGCGGCA TGCACTTTGG TGCTGGCGAT GGCTTACAAC
GGGACGAGCA AACTGCCAGC GTTGGCAACG ATTACGACTA CAACAAGGAG TATCTGCATA
GCCAGGGTAT TGCCTTGTCG ATGGAGGAGA GTAGTCGAAG TCGGCTCACT TCGACGGACT
CGGATCGCGT TTCGGGTAAC TCAACTTTTT CCAAGATGGG TAAACTCAAT CCAACCGTGT
TCGCCGACGA CTCGTCGTTT GAAGAACAAT TCGTCGAGGA GGAGGAGGAG GAGGAAGAAG
AAGAAGAGGT GGAACGGTTC ACCGTGAACG TACCGGCCGG AAAATTGGGG ATGGTAATAG
ATACGCCGGA AGGTAGTCTT CCTATTGTCC ACGCCATCAA AGAATGGAGC ATTCTGGAGA
ATACCGTCAA GATGGGCGAT AAACTAATAT TCGTAGACGA CGAAGACGTG ACGGAAATGA
CTGCCGTGGA AATCTCCAAA CTGATTTCAC TCAGATCTGA TCGGCCGCGC TCACTCGTCT
TTCACCGCGT TCTTCCACGC AGCGATTTCA TTGATATGTA CTAAAAGTTT CGACAACCAA
CAGGATTGCC CCTCGTGTTG TTGTCGCCAT GTC
 
Protein sequence
MAMRVRREEP RKSEHPKELP GRYRRGETKM SLEAEGGQTN FAAACIGASG QALSALVQDF 
PFEPSLSFGP NAQKPQNTWM VLDVSPSCLG CDGEVFFPGH SDAIGASRGA SFSLLSPNGA
DPTRMPWVNN PSRTRQLGCF PFFSVCHSSS HAPLCISLEV AWAEDVQTPV AAPTVWLPEQ
QMTELVPITD QQFTSAPAAA PVAVPVVAPT RAPTPAPTRA PTRPPARAPT RPPTRAPTRP
PTLRPTVAPT KAPTVPPTPA PTLSPSFVPS DAPSILPSKA PTGMPSSLPS TLAPTTIAPT
TITPMPTSSS PPSSMPSQGP SRAPSLAPSL APTGNPSAPP TNRPTAAPST APTLSLQRDT
VDITMRMTSI PGRLERSSAI QWEAATAEHI RRSILAQTTD RPLMELMIRT NIESQFTQAF
NAGRRVVVHM AEEGNAIGGP RFLQEVVIAP LRVSFFATVS FRSTFDNYDW ASLIGDAFNS
DDERSAYVAR LRATGDRAFD PLGSVTLLVE GETPIEELPD QDSEDSGGNN LLIVIVACIA
GGSVLLALVG LFIYRQSSSA PDIKVTPKLV EQHHSTSQSV PSQRAGYSTE INVDRQDDIS
TLGDPMFGMG GMHFGAGDGL QRDEQTASVG NDYDYNKEYL HSQGIALSME ESSRSRLTST
DSDRVSGNST FSKMGKLNPT VFADDSSFEE QFVEEEEEEE EEEEVERFTV NVPAGKLGMV
IDTPEGSLPI VHAIKEWSIL ENTVKMGDKL IFVDDEDVTE MTAVEISKLI SLRSDRPRSL
VFHRVLPRSD FIDMY