Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45206 |
Symbol | |
ID | 7200092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 479647 |
End bp | 481447 |
Gene Length | 1801 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179440 |
Protein GI | 219117291 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0786919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGAATACAG CAGCAGCATT TCCTTCATGC TGAAATCGGA ATCTCTCCAT TTACATAATT CCAAGACGCA TAGAGAGCAA GAAGGTGATT GTCATATAAA GCGACGAACG ATTACTCCTA ACGTCTTTTA CAGTTAATCA CTGGAGTGTT TACTTCATAT TCTGTATTGG TGTTCGACTT CACGATGAAG TCAAATCAAG GAAAAGGCAA CAAGAAACCA ATACCAGATA TAAATTTAGC ATCAGGCGGT TCGGTGTTGG TAATTCCGTC CGGTAGCGAT GATAGTAACC GGAAAGCTTC AAACGAGACG GATGCGACTA GCTTTCCGCA TCCGAAAGAG GCATTGATTG CGCAAGAAGC CTCAAGATGC ACGTACACGA ACTTCTCCCG CAACGCCTCT TCTGAATCTG TACGCGGGCG GACGTCCAAA GATCTTGCTC ATACATTTCC AATCAAGCTT CACGGTATAT TGTCAAATCC TGAATTTCAG GACATCATAG CGTGGCTCCC CCACGGCCGA GCATGGAGAA TCCTTCAGCA TAAGGCATTT GAAGAGAAGG TTATTCCCTT GTACTTTCGA CATGGACGGT ATTCCTCTTT TGCACGACAA GTCAACGGCT GGGGCTTCAG GCGTATCACA CATGGTCCGG ACTACAACGC CTACTACCAC GAGATGTTTC TCCGAGGTTT ACCTCATTTA TGCAGCGAAA TGAAGCGTCT TACTCCAAGG GATATCAACA AGAACCAGAA AGATGACTCT CCGCTACCTG ACTTTTATTC TTTAAGCAGA GACCACCCAT TACCGGAAGC TAGCAGCACA GTTGGTAAAA CACTGCTCCC TATACCTGCT AGCGTTCCAG GAACAATCCC GCCAGCCTCC TTTAGTCTGG GAAATCAGAT GTCAACTTTT CAGGGCCCTG CACCTGCCGC GTTTTCGGCG CTCGCAGGTT TAAACCTATC TAGCCTTTCC AGCTTGTTTG GTCAAACTGC TGCCCCAATA CCTCCTCCTG AGACTGTTGA CATGGACTCC CTTGACAAGC ATCGAAACGA CATTTTACAA CAGATGATTG GACTTATTAG CTCTCAAGGA TCAGCGAGAC TTCCGGCTGC TGTACCTGCT CCGACCGCTT CCGCCCCCAC TCCGTCCACC CCAGATTCGC TTGTCGCCGC TCTACTTAGC CAGGCAGGGT TGAACCCCTC AATTCTAGCA CAGCTAGGCA TCAATCATGG CAGAAGCTCC AGCAGAATCA GCCCCCCAAT ACCCACGCAA GTCGTGTCTG GCTCTCCTTG GAACGGGAAC GCTTTCCCTC AGATTAGCAA TCCGCAACCT TCGCCTACTC CTCCAAGTTC GACGATGGTA AATACTTCGG ACTTAGCAAG ACTACTTGCT CTACAACATC AAAGCATTGC GCCCGCGCCA ATTAACTCTG CACAAGCCCC GCCACAAGTC AGCTCAAATT CACTAGATCT TGCTCGTCTT TTAGGATGGG GTGCAGCCGC CACTAATCAG ACGTCTACGG CGCAATCTCC GGCTCTGAGC GACACAAACA GCATGACTGC AGCTCTTCTT CAGCTGCAAC GACAGATCAC GGAGGGTGCA TACTCCGCTC CAATTCAAGC TCAACATCCC GGTACAGTAC CAATACCAAC AGCCGTAAAT CCGGATTTAT TGACGCTGCA ACGACAGCTA CTGGAAGCAC AGTTTGGTGG AAACAACGGC CTTCTGGCTG CCTTGGGATT GGGAACGCAT TCCACCAATA GTGGCGGCGC GCCCACCAAC AATAGCAATG GCATTCCCTG A
|
Protein sequence | MKSNQGKGNK KPIPDINLAS GGSVLVIPSG SDDSNRKASN ETDATSFPHP KEALIAQEAS RCTYTNFSRN ASSESVRGRT SKDLAHTFPI KLHGILSNPE FQDIIAWLPH GRAWRILQHK AFEEKVIPLY FRHGRYSSFA RQVNGWGFRR ITHGPDYNAY YHEMFLRGLP HLCSEMKRLT PRDINKNQKD DSPLPDFYSL SRDHPLPEAS STVGKTLLPI PASVPGTIPP ASFSLGNQMS TFQGPAPAAF SALAGLNLSS LSSLFGQTAA PIPPPETVDM DSLDKHRNDI LQQMIGLISS QGSARLPAAV PAPTASAPTP STPDSLVAAL LSQAGLNPSI LAQLGINHGR SSSRISPPIP TQVVSGSPWN GNAFPQISNP QPSPTPPSST MVNTSDLARL LALQHQSIAP APINSAQAPP QVSSNSLDLA RLLGWGAAAT NQTSTAQSPA LSDTNSMTAA LLQLQRQITE GAYSAPIQAQ HPGTVPIPTA VNPDLLTLQR QLLEAQFGGN NGLLAALGLG THSTNSGGAP TNNSNGIP
|
| |