Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48846 |
Symbol | |
ID | 7195089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 443322 |
End bp | 445360 |
Gene Length | 2039 bp |
Protein Length | 627 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183358 |
Protein GI | 219126216 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.422234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAGGCAAC ATCTATTTCC ACATTTGCAT TTCTTACCGT TCCATTGCCT TTTGTAGGTG CCGTACCGAA AGTCGTATAA AGTCGACCTC AATGTCGGGA GTCTTTTTAA GTAATGTGGA CGACTACTTG GCACCGTCGC AAGCCTGTGT GAATCCTCTC TTTAGCACCG ACAAAAAGAA AGACGACGAG AAGAAAAGTG GAGTTGTGGG AACCCTCTCC AATGGTAACC ATGCCAACGA CGACCCGAAC AGCTTCGACA CCGCTGCTGC CTCGGAGAAC CCAGCGATTG TCCCAAGGAA ACGGGTGCGT CGTCGTCTTC CTGCAGCAAT CACCGCTTCT TCCGACTGGA CACCCAGAGT CCCGAAGGAT CCGGTGCAGG CGTCCATTGC TGACTGCTTG GCCTGCTCCG GTTGCGTCAC GACGGCTGAG ACGGTCTTGC TGGAAACGCA ACACAGTGTC GTAGCTTTGA AAGAGCTGAT TGCGAAAAAA GAAAACGATC GTCCCAAAAT CGTGGCGACC ATTTCTCCCG CCGCTTGGGC CGATTTGCAT CGTCATCTCA GTCGTGAATT CAACTGCTCC CCTAGCCTGT CGCTATCGGC GCAGCAACGA TGGACTATTC TATTATGGAG AGCGCTGAAG ATTTCGAGCG TCTTGGACGG CAACATACCT TTGGCTTGGT CATTGGAGGA AGCGGCGTTG GAATTCTGTC GCGCTTATAA ACGAAAGCAA ACCACGAACG ATCCGGACGC CATGGCAGTT GACGTTCCCC AAGACGAGCT TTGGCAGCAG CAGCTTATTC CTTCTTTTGC AGAATCACGA TCGCAGTCGC AGTACTACGT CAATGGGGAA ACAAAAACGG TTTATCATGA TGGCGGTGCT CAGCAAGCAG GCAGCTTGCC CTTATTGTCG GGGTCTTGTC CAGCCGTGGT CTGCCTGGTC GAAAAGTCAA CGCACAAGGC AGTGCCTCAT TTGGCAACGA CCAAATCACC ATTGGCTTTG GCCGGTGAGT TTTGGAAACG GCAACATTTT GACAAGCACA CCTCCCTTCC ACGACAAGAG TACTATCATG TGGCTATCAT GCCATGCCAC GACAAGAAGC TAGAGGCTTC ACGAAAAGAC TTTGAGGATG AAAGCGGCAA GGATGTGGAT ATTGTAATCA CGACGCAGGA ATGTATGAGG CTAATTCAGG AACTGCTGGA TGTATCAATC GACGATATAG TGAAATGCTT CCGTGAATTA CCTCTGGCAA CATTATCGGA TTGTACGTCG TTCACGAAAG CTGCGGAGCC CGTATTGATA GCAGATTCCA ACAGTCACTG TATCACGACG CTAACCACAG AAGATGCAGA AATCTCTTCA AATGCTGCCT TCACGTTGGG TTCCGGTGGC TATGCGTCCT TCATATTTGC TTATGCTGCC AAGCGTCTGT TCGGAGTGCA GCTGGATGCC CACGAATTGC CCTGGGAACC AGTCGGTCCC GACCAGGCAG GGAGAGTCAG TGCCCGAGTT GCCGCCTCGA CTCAGCGACG GCGTGATTAC TATCACGTGG CACTATATAG AAGCCAAGAC GGAAATTTCA CAACCAATGC CAACCTGAGT AGCGATAGTA AGCCTATCTT ACACTTTGCG ATTGCGTACG GGATGCAAAC GCTTCAGCGT GTTCTTAAGC CATACACTTC GGAACACTTG CAATCAGGGA TCGGATACGA CTACGTGGAA GCTATGGCGT GTCCTAGCGG TTGCGTCAAT GGTGGCGGCC AGATTCGGAC ATCGGCACGG GAGACTCCCA CAGAAACTCG GTTTCGCGTT GGTACTACAC AAACACTGCT GCGGGTCCCG CAAATGAACG AGTCGAGCGG TCGCACGCAG TTGGGGGCAG GAAGCTCGCT GCATACGCGC TATCACATTG TACCGCCCTT GCAACATAGC CTCGGAGCGG CAGCGGGGGT TCCCGTCAAG GATACACAGT GGTAGGCCGC TCTGCAAATA GAGAGCTTTA TGTGGAATAA GTCAACTTAA CGAGGTCTCT TCGCGTTAG
|
Protein sequence | MSGVFLSNVD DYLAPSQACV NPLFSTDKKK DDEKKSGVVG TLSNGNHAND DPNSFDTAAA SENPAIVPRK RVRRRLPAAI TASSDWTPRV PKDPVQASIA DCLACSGCVT TAETVLLETQ HSVVALKELI AKKENDRPKI VATISPAAWA DLHRHLSREF NCSPSLSLSA QQRWTILLWR ALKISSVLDG NIPLAWSLEE AALEFCRAYK RKQTTNDPDA MAVDVPQDEL WQQQLIPSFA ESRSQSQYYV NGETKTVYHD GGAQQAGSLP LLSGSCPAVV CLVEKSTHKA VPHLATTKSP LALAGEFWKR QHFDKHTSLP RQEYYHVAIM PCHDKKLEAS RKDFEDESGK DVDIVITTQE CMRLIQELLD VSIDDIVKCF RELPLATLSD CTSFTKAAEP VLIADSNSHC ITTLTTEDAE ISSNAAFTLG SGGYASFIFA YAAKRLFGVQ LDAHELPWEP VGPDQAGRVS ARVAASTQRR RDYYHVALYR SQDGNFTTNA NLSSDSKPIL HFAIAYGMQT LQRVLKPYTS EHLQSGIGYD YVEAMACPSG CVNGGGQIRT SARETPTETR FRVGTTQTLL RVPQMNESSG RTQLGAGSSL HTRYHIVPPL QHSLGAAAGV PVKDTQW
|
| |