Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47576 |
Symbol | |
ID | 7202637 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 158326 |
End bp | 160147 |
Gene Length | 1822 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181856 |
Protein GI | 219123073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.124789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACCCTCCAA CGACAACGAT CAACGAAAAA GTGTACACGG AACGACGGCC ACGCATGACG AGTCGGCGCG TGACGGCGAC CTTACGGCAA CAACTGCGAC AAGCCGCTCG CGGTGACACT CAATCCACAC CCAACGGAAG TCCCCCGCGG CGACTCTGGA AATGCTTTCC ACTCACCGCG CAACACGTCA ACGTTCCCAC GAAACGTCAC CCCGACGACA CTGACGACGA ACGAAACACT TCGGCTCATC CACGCTGGTT GCGGACTCCC AGTGAGTTTC ATCAAACTCT CTGCGATCGC ATCGAACGTG CCCGACGTCG TGTGCACTTG GCCTCACTCT ACATTGGACC CGCCGTGGAT CCGTTCAAGT ACGACAAGGA AGCAACCTTT GCCCAAGTGT TGTCACGCAT CGATCCCCGA GTCGACGTCC GTATCTTGTT GGATCAACAC CGAGCGCTCC GTCCCGTGCC GGTGCCTCCG CAACGGGCAC CCGCACCGTT CGTCCGCCAC CATCTCGTCG GCGGAAGCCT GCCGACGAGC TCTCGCGCAA CGTCCCAAAC TACCGGACGA ATCCCACAAC GGGACTGCCG ACACCGAACA CAACAAGAGT CAGATTCATT TGTTATCGGT ACTCGGTCCT TGGCTGTCCC GACTGCCCAA TCCGTACAAC GAAATTGCCG GCGTCTTTCA CGTCAAACTC TACGTCGTGG ACGACGCCGT CCTCCTCAGT GGCGCCAATT TGTCGCAGGA ATACTTTGCC GATCGACACG ACCGCTACGT ATGTATATAC AACGGTGGCA ACGGGTTGGT CGACACCTAC GTCGATTTGA TCCAAGCCTT GTCGGAATTC GGTAGTCAAC GATACGAAGG AATCGACGAG AATGGTGTGG CACAACTCAC CAACGTACCC GATCGACAAC GACTCTTTCG AGCGATCCGG GACGTACTGA CGATCGAGGC CGATACCGCA GGAATCGACC ACGAACCAGA TCCCGACGTC ATTGCCTACG CGGTACCCAC CTTTCAGGCA CCCCCCGGTT ACTTTACAGC AACCTGCGAC GCAACCGAAC TCGCCACCAT GCCCACTGAT CTACAAACCA TTCACGATTT GCTACGCCAA ACCGCCGCGT GGGCTCCAGC GGCGGCGTCG TCGTCATCGT CCGCACCACA AACGACCGCC ACCACTGCAA CCACAACACA TCAAAACAGG CCAGTCACAC TTCGTCTCGC CAGTGCCTAT CTTAATCCAA CACATTCGTT TCTGGAATCC ACGAGGAATT TAAACGTTTT CTTTTTGACG GCCGGAAAAC TCTCGCACGG CTTTCGTCCC AAAAAGGTGA CGGGTCACGT TTCCAAAACG GCCTGGATCC CTACCGTCTT TGCAACGCTA GTGGCATCCT ATCCTCCGTG GGTAAAGACC TGGTGGTACC AACGCGAAAG CTGGACCTTT CACGCCAAGG GATTATGGTT AACAACCACC GCCGAAACGG TGCCGGAATC CACGACGACG ACGTCCAAGA CCAACGTTCC TACACTGACG AAATCCCAGC TGCGCATTCC CGAGACGGAC GAGCTTTTGG TCGTCTCCCA CGGATCCGGC AATTACGGGT ATCGATCGGA ACAACGAGAT ATGGAAAGCA ACTTGCTCTT GGTTTTCCCC TCACCTACTG ATGGACAGGA AAGCAACAAT CCATGGGCTC AGCAGCATAT TGACGAATGG AACGAATTTG TACCCTCGGC GGTGCCAGCT TGTTTGGAAG ATACCGACCC ACTGCCAAAG CCAGTGCAGT GGGTATTGCC ATACATCAAG TCGTTTTTTT GA
|
Protein sequence | MTSRRVTATL RQQLRQAARG DTQSTPNGSP PRRLWKCFPL TAQHVNVPTK RHPDDTDDER NTSAHPRWLR TPSEFHQTLC DRIERARRRV HLASLYIGPA CCHASIPEST SVSCWINTER SVPCRCLRNG HPHRSSATIS SAEACRRALA QRPKLPDESH NGTADTEHNK SQIHLLSVLG PWLSRLPNPY NEIAGVFHVK LYVVDDAVLL SGANLSQEYF ADRHDRYVCI YNGGNGLVDT YVDLIQALSE FGSQRYEGID ENGVAQLTNV PDRQRLFRAI RDVLTIEADT AGIDHEPDPD VIAYAVPTFQ APPGYFTATC DATELATMPT DLQTIHDLLR QTAAWAPAAA SSSSSAPQTT ATTATTTHQN RPVTLRLASA YLNPTHSFLE STRNLNVFFL TAGKLSHGFR PKKVTGHVSK TAWIPTVFAT LVASYPPWVK TWWYQRESWT FHAKGLWLTT TAETVPESTT TTSKTNVPTL TKSQLRIPET DELLVVSHGS GNYGYRSEQR DMESNLLLVF PSPTDGQESN NPWAQQHIDE WNEFVPSAVP ACLEDTDPLP KPVQWVLPYI KSFF
|
| |