Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43598 |
Symbol | |
ID | 7197321 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 917037 |
End bp | 919519 |
Gene Length | 2483 bp |
Protein Length | 758 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178033 |
Protein GI | 219112563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00579504 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCATTCA AACTCTGGAG CAAAATGAAT TTTACACAAT TCTCTTCCGT ACAATTCTCA AGAATGGTGA ACATGCGGCG CCTCATCGGC CTTTCGCTAC TCTTCTTTTT GTCAGCGGCA GCTTGGCTGC CGCCATCGAC TGTTATTTCC CGGAAAGCAC GATGTGCAGC CAGAGTTAGC TCGGTTGAAT ATGTCAGATT TACAAATGCA ATACGGAAAA GTCGGTCCTC GCCTAGTCGC CCAAGCACCG AGATGTTCAT GGGCATACGC GACTTGCTTC GCAAGCGCGC CAAGGACGAT GATTATGATC AGCCAAAAGG CGGCAGCCCG TCCATACAAA AGCAATCACC TAGCCCTTCT GCAGCTCAAC TCGTACCCGA ACACGCTTCG AAAACCGAGA CGGAGACGGC GATGCCCATA GTCGACAAGA GTGAATCAAG GAAACAAGTT TTACACGAGC GCGCTCCAGT TCGTATGGGA GAAGACCCCG GTACGAAGAC TGTCAACGTC CAATCTTTGG ACTCTCACGA GTCGGTCCAG GATAGAATCA ACCGCGTTAA GGCAGGCAAG ATGACAGAGG GCGAGAAGGA AGCATTTCTG GATTCCGTTC TCACAGCGGG AAATACGCCT GAATCTCGCA AGCCACTTAT TCGTAGTTCT CGAGGAAAGT CGGAAGGCAA GCTGTCGAAG AAGGAATCGA AAGCGTCACC ATTCCCCTCG GATTCTATAC TTCGCAACCT GGCACGTGGG CTGAACGCTG ATTCCGTACA GAAATCTGAT TTTATGGAAA AAACAACGTT GGAAAATCAG CGCAAAAAGA AGGAATATCT AGAAATGGTG ACGGACCCTG ATCGCTTCAA TCGCCTCTCT ACTACAAACA GCTTGCAACG GTCTCCAGGT CTTTATCCAC AGTCAGGCAG CTCCGCCGCT GAATCAAAGA CACCCGTTTC GTCTCTTGGG GGTTTACAGT CTTCTAAGAA CAAGTACCTT CCAAACCCGA CTGGGCCTCC TCAGGAGCCG GTATCGCCTC CAGATGATTT GCCTTTGCCG GGGGATCTCG GGGCTCGACT AGGATCTGCC GCCATGGAAC ACGAACGCCT CCGACAGCAA GCCGAAGAAG AACGACGTGA GCGGGAGTGG CAACAGAAAG AAGAGCTCCG ACTTGAGCAG GAAAGGAGAG CGGCAGAAAT TGCAAAAAGG CGTGAAGAAG AGTTGGATCG ACGCGAAGCA GAAATCAGAG AGCGACGCCG TCGCGACCAA GATAACCTGT CTAGAGAAGA GAATGAGCGG AAAGAGGCAC AGCAACGACG TATGCGAGAC ATGATGAAAG CACAAGAAGA GTACTGGATC AAAAAATTGT CAAGGGAGAA GGAGTCAAAG GCCAAAACAG AAATCAAGAA AGAAGTCGAC ACAGATGAAC AGGAAAAAGA CGACACGGCA CAAACGACTT CCACATCACC TACCGCCCGC GTTCAAGTCC CGGAATCGTC CACGAGATTT AATCCTGACG AAAGGTCACT GTTGAGCGAT GTAAGTATCA CTTGGTTACG TACAGGTCGG CAGCACGATT TTCTAATTCT ATCCCTTTGG CTTGAGGTGC AGGCGTTGAA TGATCACTTG GCTGACTGGG ATAAGAATCA AGATGTCATT GAAAAATCAG CTAAACGAGC GCTAGGCGTA TCGCCTTCTG TAAGTGATGT CGGCTACCAT AAACAGAGCG AACGCGACTT TATGGCGGAG CAAGCCCGCA AGAAGTCCGA AATCGAAAGG GCGAGACAAG TACAGCTGGA AAAGCTCAAA GCTCTAAACT CGCCGCTTCC GACCCCTCGA AACTCCAGCT ATGCGCCTCG GGCAGCTCCT GTATTTAATC CACAGCGACC TGTCTCGGCT TCTTCACCTT CTTCTCGACA TAAGCTTGGG AATCTTGTCC GAGGAAGAGA AGGAGGAAGT GATCAAAACT CTGGGAGCCC CTCTGGAAGT CCATTGAATA GTGACGATGT CAATCGAAGA AAAACTATCT CTAAACTAAA TGGAGCGACA CTAGGCTCGG GGGATTCTTC GTGGGATCGA GACCAACCGC AGAGTGGGGG GTCTCTTGAG CAACCTCAAA GCAGCGCGTA CCAAGCAGAG GGCAAGGAAA CTCGTCCGCC GTCTTCGGGG GAGTCTCAAA ACAGAGGACC AATTCGAATG GAGCTACCTT TGTTGGATGT TGAGTACGAC CAGGACGAAG ATGGACTGGA CGTCCGTTCA AACAAGGCCA TGAGTATTGC CGACGCTATG AAGAGATCAA ATAAAGGTGG AAGCGCTGAT CAATCGGAGA GAAGTAAAAA GTGGGGAATC GACATGAATC GCTTTAATTG ATTGTTTTTA CGCTCGCTTC GCCGCAATAA CGAGGATAGT TCTTTCACTT TATCATCACA AAGACGCCAT TTACTATGCT CGCAGAAGAT GGGGTCTAAT TTAAAATTCA CTGTTTTCAA TGT
|
Protein sequence | MNFTQFSSVQ FSRMVNMRRL IGLSLLFFLS AAAWLPPSTV ISRKARCAAR VSSVEYVRFT NAIRKSRSSP SRPSTEMFMG IRDLLRKRAK DDDYDQPKGG SPSIQKQSPS PSAAQLVPEH ASKTETETAM PIVDKSESRK QVLHERAPVR MGEDPGTKTV NVQSLDSHES VQDRINRVKA GKMTEGEKEA FLDSVLTAGN TPESRKPLIR SSRGKSEGKL SKKESKASPF PSDSILRNLA RGLNADSVQK SDFMEKTTLE NQRKKKEYLE MVTDPDRFNR LSTTNSLQRS PGLYPQSGSS AAESKTPVSS LGGLQSSKNK YLPNPTGPPQ EPVSPPDDLP LPGDLGARLG SAAMEHERLR QQAEEERRER EWQQKEELRL EQERRAAEIA KRREEELDRR EAEIRERRRR DQDNLSREEN ERKEAQQRRM RDMMKAQEEY WIKKLSREKE SKAKTEIKKE VDTDEQEKDD TAQTTSTSPT ARVQVPESST RFNPDERSLL SDHDFLILSL WLEVQALNDH LADWDKNQDV IEKSAKRALG VSPSSERDFM AEQARKKSEI ERARQVQLEK LKALNSPLPT PRNSSYAPRA APVFNPQRPV SASSPSSRHK LGNLVRGREG GSDQNSGSPS GSPLNSDDVN RRKTISKLNG ATLGSGDSSW DRDQPQSGGS LEQPQSSAYQ AEGKETRPPS SGESQNRGPI RMELPLLDVE YDQDEDGLDV RSNKAMSIAD AMKRSNKGGS ADQSERSKKW GIDMNRFN
|
| |