Gene PHATRDRAFT_48846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48846 
Symbol 
ID7195089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp443322 
End bp445360 
Gene Length2039 bp 
Protein Length627 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183358 
Protein GI219126216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.422234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAAGGCAAC ATCTATTTCC ACATTTGCAT TTCTTACCGT TCCATTGCCT TTTGTAGGTG 
CCGTACCGAA AGTCGTATAA AGTCGACCTC AATGTCGGGA GTCTTTTTAA GTAATGTGGA
CGACTACTTG GCACCGTCGC AAGCCTGTGT GAATCCTCTC TTTAGCACCG ACAAAAAGAA
AGACGACGAG AAGAAAAGTG GAGTTGTGGG AACCCTCTCC AATGGTAACC ATGCCAACGA
CGACCCGAAC AGCTTCGACA CCGCTGCTGC CTCGGAGAAC CCAGCGATTG TCCCAAGGAA
ACGGGTGCGT CGTCGTCTTC CTGCAGCAAT CACCGCTTCT TCCGACTGGA CACCCAGAGT
CCCGAAGGAT CCGGTGCAGG CGTCCATTGC TGACTGCTTG GCCTGCTCCG GTTGCGTCAC
GACGGCTGAG ACGGTCTTGC TGGAAACGCA ACACAGTGTC GTAGCTTTGA AAGAGCTGAT
TGCGAAAAAA GAAAACGATC GTCCCAAAAT CGTGGCGACC ATTTCTCCCG CCGCTTGGGC
CGATTTGCAT CGTCATCTCA GTCGTGAATT CAACTGCTCC CCTAGCCTGT CGCTATCGGC
GCAGCAACGA TGGACTATTC TATTATGGAG AGCGCTGAAG ATTTCGAGCG TCTTGGACGG
CAACATACCT TTGGCTTGGT CATTGGAGGA AGCGGCGTTG GAATTCTGTC GCGCTTATAA
ACGAAAGCAA ACCACGAACG ATCCGGACGC CATGGCAGTT GACGTTCCCC AAGACGAGCT
TTGGCAGCAG CAGCTTATTC CTTCTTTTGC AGAATCACGA TCGCAGTCGC AGTACTACGT
CAATGGGGAA ACAAAAACGG TTTATCATGA TGGCGGTGCT CAGCAAGCAG GCAGCTTGCC
CTTATTGTCG GGGTCTTGTC CAGCCGTGGT CTGCCTGGTC GAAAAGTCAA CGCACAAGGC
AGTGCCTCAT TTGGCAACGA CCAAATCACC ATTGGCTTTG GCCGGTGAGT TTTGGAAACG
GCAACATTTT GACAAGCACA CCTCCCTTCC ACGACAAGAG TACTATCATG TGGCTATCAT
GCCATGCCAC GACAAGAAGC TAGAGGCTTC ACGAAAAGAC TTTGAGGATG AAAGCGGCAA
GGATGTGGAT ATTGTAATCA CGACGCAGGA ATGTATGAGG CTAATTCAGG AACTGCTGGA
TGTATCAATC GACGATATAG TGAAATGCTT CCGTGAATTA CCTCTGGCAA CATTATCGGA
TTGTACGTCG TTCACGAAAG CTGCGGAGCC CGTATTGATA GCAGATTCCA ACAGTCACTG
TATCACGACG CTAACCACAG AAGATGCAGA AATCTCTTCA AATGCTGCCT TCACGTTGGG
TTCCGGTGGC TATGCGTCCT TCATATTTGC TTATGCTGCC AAGCGTCTGT TCGGAGTGCA
GCTGGATGCC CACGAATTGC CCTGGGAACC AGTCGGTCCC GACCAGGCAG GGAGAGTCAG
TGCCCGAGTT GCCGCCTCGA CTCAGCGACG GCGTGATTAC TATCACGTGG CACTATATAG
AAGCCAAGAC GGAAATTTCA CAACCAATGC CAACCTGAGT AGCGATAGTA AGCCTATCTT
ACACTTTGCG ATTGCGTACG GGATGCAAAC GCTTCAGCGT GTTCTTAAGC CATACACTTC
GGAACACTTG CAATCAGGGA TCGGATACGA CTACGTGGAA GCTATGGCGT GTCCTAGCGG
TTGCGTCAAT GGTGGCGGCC AGATTCGGAC ATCGGCACGG GAGACTCCCA CAGAAACTCG
GTTTCGCGTT GGTACTACAC AAACACTGCT GCGGGTCCCG CAAATGAACG AGTCGAGCGG
TCGCACGCAG TTGGGGGCAG GAAGCTCGCT GCATACGCGC TATCACATTG TACCGCCCTT
GCAACATAGC CTCGGAGCGG CAGCGGGGGT TCCCGTCAAG GATACACAGT GGTAGGCCGC
TCTGCAAATA GAGAGCTTTA TGTGGAATAA GTCAACTTAA CGAGGTCTCT TCGCGTTAG
 
Protein sequence
MSGVFLSNVD DYLAPSQACV NPLFSTDKKK DDEKKSGVVG TLSNGNHAND DPNSFDTAAA 
SENPAIVPRK RVRRRLPAAI TASSDWTPRV PKDPVQASIA DCLACSGCVT TAETVLLETQ
HSVVALKELI AKKENDRPKI VATISPAAWA DLHRHLSREF NCSPSLSLSA QQRWTILLWR
ALKISSVLDG NIPLAWSLEE AALEFCRAYK RKQTTNDPDA MAVDVPQDEL WQQQLIPSFA
ESRSQSQYYV NGETKTVYHD GGAQQAGSLP LLSGSCPAVV CLVEKSTHKA VPHLATTKSP
LALAGEFWKR QHFDKHTSLP RQEYYHVAIM PCHDKKLEAS RKDFEDESGK DVDIVITTQE
CMRLIQELLD VSIDDIVKCF RELPLATLSD CTSFTKAAEP VLIADSNSHC ITTLTTEDAE
ISSNAAFTLG SGGYASFIFA YAAKRLFGVQ LDAHELPWEP VGPDQAGRVS ARVAASTQRR
RDYYHVALYR SQDGNFTTNA NLSSDSKPIL HFAIAYGMQT LQRVLKPYTS EHLQSGIGYD
YVEAMACPSG CVNGGGQIRT SARETPTETR FRVGTTQTLL RVPQMNESSG RTQLGAGSSL
HTRYHIVPPL QHSLGAAAGV PVKDTQW