Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49521 |
Symbol | |
ID | 7195741 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 499245 |
End bp | 501054 |
Gene Length | 1810 bp |
Protein Length | 438 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184263 |
Protein GI | 219128107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.960008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCAC GTCCGAGGGC GTCCCGCCGC GGAGTGGAAG ACGCCCTGCC TTTGACGGTG CAGCAAGCGC AGAGTAACGG CAGCGGTAGT CTGACGAATA GTCCCTCGCG CAAAAGTGTC GGAGGTTTAT CATCCCGGAC GTTTCGAAGC AACAAACAAC ACGGCAATTT ACTGCGAGCG AGACTGCTCG TTTCTCTCTC GAAGCACAAG GGTCTTTCCA AATGGGTCGG ACTGATACTT GCCGTACTGG GAACCATCGG TTGGTTCCGG AGTAGCTTCG GCAATCATCG CGACGGGTCC CCACGCTCGA TTACATCATC ATCTTCGCGT TCCCTGCATC GTCTCGCACG ACGAGCAGCA GTCCGCGCGC TCTCTCCTTC TCACAAACAG TCTCGGAAAA GGACAGGTGC ACGCAACTCT CCCTTTGCAA CCGACCGAAG TGGAGGAAGG ATACAATCCC GACTTTGGCC ATTTACAGCT TTCCATACTC GAGGACTTCG GTACCGCCCG AGTTATTTAC CATGATTCCT ACATGGATCG TGGTCGTGTG CATCTCGGCC AGGGGGACGA TACCTACGAT TACTATTACG CATTTGATGA CGATATCAAA CGAAATCCTT ACATTGGTTG GTCAGACGAC ACCATACAGG ATCGCAAGCA TTGTCGGCGT ACCTCGTGGC ATCGAAATCT GTTACTCAAC TGTAACAGCT TTCACGAGTT TGATATTTCC AGCTATGCGC TCGGTGGAAT GTTCAAATAC ATTGGGTACG TACAGTTGTA TGGACAGCAG TGATGCTCGG GATCATACTC ATTCAATTGC ACCGTATCGA CAGGAACGGC TCTTACCGAC AAGTATTCTT GGCCAAATTG CCGACCGAAA ACGTTATTTT TAAAGAGGCC AACTGGGATA ATGGTCACGA ATCCGGTTTT CACACCGACG AGTACGAATT TATGCGCCAG GACATGGTGG TCGGCGAATC CTTTACCTCC AATCCGCGCT TTGTTGATAT TTACGGGTTT TGCGCACTCT CCATGTTTAG TGAATTCATG CCGTTCGGGG ACGCTGAAGT TATGGCCCAA ACAATTTTTG ATCGCCACAA TCCACCCTCA ATAAAGGAAG GTGGTGAATT GGTCATTTTC AATAATTTGA CCGGTACGGA AAAGTTGGAG TACGCGTTGC AAATGGCCGA CGCTGTTTCC TACCTGCACA ATTATCCCGG GGGTGTTATT GTTCACGATG ATATACAAAT GCCGCAATTC TTGCTCACTG CCGACAAGCG TATCAAGCTG AATGATTTCA ATCGCGCCGA AGTTATGCTC TTCGACGAAG AGAACAATGA ATATTGCAAG TACCGCAACA ACCCGGGACA CGGTGATGTA AGTTTGGGTT TGGAAGTATC AGTCCGTAAT TCGCCTTTTG CCACACTTGC GTCTCTCACA CCCAAGCATA CGTTGTACAC ACCTTGATCG GTTGACGGAA CAGTGGCGCT CGCCAGAGGA GTACTACGAC AAGCCGTTGG ACGAGAAAAT TGATATTTAC AGTCTCGGAA ATAACTTTTA TGCCTTACTG ACCGGTATGG GGCCTTTTTA CGAAGAAGCC CATTCAGACG GGGTTATTAA AAAGGTTAAG GCCGGAATGA AACCGTACAT TGATCAAAGA TTCCTAGAAC GGAGCTTTGC AGAGAAAAAG CTCGCTGAAA TTATGGTCCT CTGCTGGGAG TATGATCCCG TAAAGCGGCC TGACATCAAC ACGCTAGTGC AGGTTTTGCG GGAAGCGGTA GAGGAGAATT TAAGGCTTCA ACCCGTGTAA
|
Protein sequence | MRARPRASRR GVEDALPLTV QQAQSNGSGS LTNSPSRKSV GGLSSRTFRS NKQHGNLLRA RLLVSLSKHK GLSKWVGLIL AVLGTIGWFR SSFGNHRDGS PRSITSSSSR SLHRLARRAA VRALSPSHKH FHEFDISSYA LGGMFKYIGN GSYRQVFLAK LPTENVIFKE ANWDNGHESG FHTDEYEFMR QDMVVGESFT SNPRFVDIYG FCALSMFSEF MPFGDAEVMA QTIFDRHNPP SIKEGGELVI FNNLTGTEKL EYALQMADAV SYLHNYPGGV IVHDDIQMPQ FLLTADKRIK LNDFNRAEVM LFDEENNEYC KYRNNPGHGD WRSPEEYYDK PLDEKIDIYS LGNNFYALLT GMGPFYEEAH SDGVIKKVKA GMKPYIDQRF LERSFAEKKL AEIMVLCWEY DPVKRPDINT LVQVLREAVE ENLRLQPV
|
| |