Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38067 |
Symbol | |
ID | 7202749 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 796853 |
End bp | 799295 |
Gene Length | 2443 bp |
Protein Length | 698 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182137 |
Protein GI | 219123654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00714206 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCTA ATGCTAAGAC AGAAGAAATA CGGCAGCATG GAGTCAAAGG CGTATACTAT GCTGGAAACT GGTTAGATAT TGAAGACTGC GATGGTGTCT TTCAAAATGT GGAGAAATCC TGCTCCGGAT GCGACTTTCA GGACGCTTGT CCACGCTCTC GATCGCGTAA AGTTGCCGCA AAAGATGATG ATACGTATGA TGTCGTAATT ATCGGTGCAG GCTGCATCGG GGCAGCGATT GCGAGAGAGT TGTCACGTTA CAAAATTAGT GTGCTTTGGG TCGAAGCCGG CGACGATGTC TCGCAAGGAG CGACAAAAGG TGAGTGTGTG CTTTCTACGA GTCGCTGGAT CATGCTTCGC TCTCACATGA CAAACGCGCA TGCTATTGCG TAACTTATTT GCAGGAAACT CGGGAATTGT TCACGCCGGA TATGACGACA AACCTGGTAG TAATCGGGCC AAGTACTGTT GGAAGGGAAA CCAGATGTTT GCTGCGTTGG ACAAAGAGCT ACGATTTGGC TATCAAACTA ACGGATCACT TGTTTTGGCT TTTAAGGAAG CCGACAAGAA GGTGCTGAAT AATCTTCTCA AGCGTGGAAA GACCAACGGC GTCCAAAACT TGAGGATCGT CGAACGGGCC GAGCTACTTC GGATGGAGCC ACACGTGCAT CCAGACGCCA TTGCGGCCTT GTATTCACCC GATGCAGGAA ATGTTATTCC GTATGAGGTA CTTAGCAGAT TCCAGTTTTA TGCTTGTGAG TCGCTCTGTT CTTTGAGGTC ACCTGACTTT CTTCTCCTTG TCGCACATTT CAGTACGCCG TGGCTTTAGC TGAAAACGCA GTCGACAACG GCGTTGAGCT TCGTATTCGT CGGCAAGTAA TGGACATTCA AAATAAAGAT AAAGGTCATA TGATGGTGAC TTTGAAATAC TGGGAACCAG AAGACTACGT CAGAGCCATC GCACAAGCTG GTAAAGCTAC AGTCTTCAAC TTTGCCATGT ATGCCGTAGG TGCTACCACT GTGGCTCATT TTTTTGTCAC CAAAGGCAGT GCACACAAGC AAAATGAAAA GTATCATTTA GCGGTTCTGT GCTTCTTGTG GATTCTAAGC AAGCTGGTGC CTTTCATCTT TCCCAACGCT GCAACATCCA AGGTTGATCG CAGTATTCCC CTGTGTCATT TAGTGGACCA GGCTAGTCCT CCTGTTGGTA CAGGAGGCGG CCATCCTGTT TCAGTGCCGG ACATGCTGGT GGGAGGATCT GGAGGTCCTC GTCCTATGCA AGGCAAGATT GTATCGACGG AAAAAATTAA AACAAAGTTT GTCGTCAACT GTGCAGGCGG CGCCGCCGAC GAGATTGCTC GCTTGGTGGG TGACGATTCA TTCAATATCA AGCCTCGTCT TGGAGACTAT ATTTTACTCA ATCGGAATCA GGTAAGCATA CTCAAGTATC ACGGGTTCTA GTACGCAACT ATATCGGCGG GACGTGCGCG AGACTCAACC AACGTTTTTC TCGCTTTTCA AATTCAATGT TCCAGGGCTA CCTGGCTAAG CATACTCTGT TTCCGTGCCC AGATCCTAAG CTTGGAAAAG GTGTCTTAGT CCAGACAACG CTATGGGGAA ACTTGATTCT TGGTCCGACG GCTCGTGATG TAGGCAACGA GGAAGCTAGG AAGATGTCTT CAGCGGCAGT GCAGGAGTAC ATCCTTGCTA AATGCAAACA GCTCGTTCCT GGTTTTGACC CTCGCGAAAC ATTTCATGCG TTTTGCGGAG CACGTGCGAA ATCGGATCGT GGCGACTGGA TAATTGAGCA TTCCAAGAAC GATGCCCGCA TGATTCACGT TGCTGGAATC GATTCGCCTG GATTGGCTGG CTCTCCAGCA AGTACGTGTA ATTGCTGCAA CTCCAAAATG CTTGGCATTG CAGCTTGAAG CTGACACAAA ATATACTCAT TTTTACTATT TTAGTTGCTC TCGACGTGAT TGAAATGCTG CGTAAGGCGG GCCTCACGAC AGAAACAAAT CAGAGCTTCA ATCCTAATAG AGCGCCGATC GTCATCCCCA AAGTTGGGAT GAAAGGGCTG AAAATGGGAC CCGTCGGCAA GTTCGACAGC GATGGTAGCA ATATGGAGCA AATGGCTGCG AATGTAGTTT GCAAGTGCGA AAAGGTTACA GAGCTAGAAA TCGTTCGAGC GATTCGTCGT TCCCTGCCAA TTGATTCGTC GCAAGGAATT AGGAAGAGGA CTCGGGCTGG TATGGGTCAT TGTCAGGGCG ACCCTGAAAA CTACAACTGC GAAGCTCGTG TACGAGCTAT CATCGCGCGA GAAAACGGTG TGCCCATTGA ACATGTGGGA GGCCGTCCAT GGCCCGCCAC GTCAACGCTC TCCCAACGCT GGATCAATGA AAAGGAAAAA CAACATCTCG TGGACTGCAT GAATGTAGAG TAA
|
Protein sequence | MTPNAKTEEI RQHGVKGVYY AGNWLDIEDC DGVFQNVEKS CSGCDFQDAC PRSRSRKVAA KDDDTYDVVI IGAGCIGAAI ARELSRYKIS VLWVEAGDDV SQGATKGNSG IVHAGYDDKP GSNRAKYCWK GNQMFAALDK ELRFGYQTNG SLVLAFKEAD KKVLNNLLKR GKTNGVQNLR IVERAELLRM EPHVHPDAIA ALYSPDAGNV IPYEVLSRFQ FYASENAVDN GVELRIRRQV MDIQNKDKGH MMVTLKYWEP EDYVRAIAQA GKATVFNFAM YAVGATTVAH FFVTKGSAHK QNEKYHLAVL CFLWILSKLV PFIFPNAATS KVDRSIPLCH LVDQASPPVG TGGGHPVSVP DMLVGGSGGP RPMQGKIVST EKIKTKFVVN CAGGAADEIA RLVGDDSFNI KPRLGDYILL NRNQGYLAKH TLFPCPDPKL GKGVLVQTTL WGNLILGPTA RDVGNEEARK MSSAAVQEYI LAKCKQLVPG FDPRETFHAF CGARAKSDRG DWIIEHSKND ARMIHVAGID SPGLAGSPAI ALDVIEMLRK AGLTTETNQS FNPNRAPIVI PKVGMKGLKM GPVGKFDSDG SNMEQMAANV VCKCEKVTEL EIVRAIRRSL PIDSSQGIRK RTRAGMGHCQ GDPENYNCEA RVRAIIAREN GVPIEHVGGR PWPATSTLSQ RWINEKEKQH LVDCMNVE
|
| |