Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46561 |
Symbol | |
ID | 7201845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 714426 |
End bp | 715862 |
Gene Length | 1437 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181060 |
Protein GI | 219120652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0084892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCAC AGAGGAGAGG AAGTAGAAGA CGACGACGAC AATCGCATTT GACGCAGCAA TGTACTCGCG TCGTTTGTCT TCTGATATTG GGGAAAACGG CCTCCCAACC CGCTGAAGAA ACGGTCGCCG CCTCGCTACC ATTGTCGCAA CCACACCTTC GCAGACGTCA CGACAACGGT AATACCGTGG AACTTGTTCC GAATGCGACC GTCCGTCTGC CTCTGCACGC CGTCGCGGGT ACGCATCACG TGACGGCTTG GATGGGGGAA CCGCCGCAGG CGCAAACGCT GATTGTCGAC ACCGGGTCGC GGTTGACGGC GACCGCGTGC GAGCCCTGTT CGCAATGCGG GACGACGCAC GCACACCCGT TCCCCCATTT GGACCCCCAG CGGTCCAGCA CGCTGCGATA CACGCAGTGT GGATCCTGTC TGCTCAGCGG CATCCAGGAA TGCGCAGCGG AACAAAAGTG TGGTATTAAT CAAAGGTATA CTGAAGGCTC CAGCTGGACA GCAGTGGAAG TCAGCGATAC GTTTGTCCTG GGAGGACCGG AGATATCCAG TTTGGAACAG TACGTGAGCT TTACGATTAT CTTTGCGTTC GGATGCCAGC AAAAAGTCAG GGGATTGTTC CGAACACAGT ACGCCAACGG TATATTGGGT TTGGAACGGT CCGACCTCTC GCTCATTAAG CGATTGTGGA AGGAAAATGT CATTCCTCGC GAGTCATTCT CCCTATGCAT GACACCTTTT GAAGGCTACA TTGGACTGGG AGGACCACTA CGAGACAAGC ATACGGAATC GATGAAATAC ACGCCGTTCA CTTCCACTCA GAGTTGGTAT GCTGTCCACG TAGTCCGAGT GTTTGTAGGG GACGAATGCT TGACAAGCAA TGACCAGCAC GACACTGTTG TCGAGCATGC ATTGGTCGAA GCCTTTGCAG AGGGCAAGGG TACTATACTG GACTCGGGAA CGACGGACAC GTATCTCCCC AAGGCAGTTG CGGGTCGTAT GCGAGAAATA TGGGCGCGCC TTTCCAACAC ACCCTTTCAA CCGTCGAGCA CGTACGCCTA CACATACGAT GAGTTTAGAT CGCTGCCCAT CGTGACCTTT GAGCTCGCCA ACAACGTAAC CTTACAGGCC CTGCCTAAAA ATTTCATGGA AGACCTTCCC GAGCCTTTGC GGCCCTGGAC GGGACGGAGG AAACTAATGA ACCGCCTGTA CGCGGACGAA GTACAAGGTG CCGTGGTGGG ATTGAATACA ATGGTGGGCT ATGACTTGCT CTTTGACGTC CAAGGCAATC GTTTTGGTGT CGCCCCGGCC CTATGTGGAA TTGCGAACAG TACACCAGCA GCGACTCATT AAAACGGAAG CGTTTGTAAA GGTTCTTTTG ACAATTAAGA ATCTTCGATA TACTTAATGG TCATCGGGGT TCCCGTT
|
Protein sequence | MVPQRRGSRR RRRQSHLTQQ CTRVVCLLIL GKTASQPAEE TVAASLPLSQ PHLRRRHDNG NTVELVPNAT VRLPLHAVAG THHVTAWMGE PPQAQTLIVD TGSRLTATAC EPCSQCGTTH AHPFPHLDPQ RSSTLRYTQC GSCLLSGIQE CAAEQKCGIN QRYTEGSSWT AVEVSDTFVL GGPEISSLEQ YVSFTIIFAF GCQQKVRGLF RTQYANGILG LERSDLSLIK RLWKENVIPR ESFSLCMTPF EGYIGLGGPL RDKHTESMKY TPFTSTQSWY AVHVVRVFVG DECLTSNDQH DTVVEHALVE AFAEGKGTIL DSGTTDTYLP KAVAGRMREI WARLSNTPFQ PSSTYAYTYD EFRSLPIVTF ELANNVTLQA LPKNFMEDLP EPLRPWTGRR KLMNRLYADE VQGAVVGLNT MVGYDLLFDV QGNRFGVAPA LCGIANSTPA ATH
|
| |