Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44316 |
Symbol | |
ID | 7197974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 213156 |
End bp | 215051 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178182 |
Protein GI | 219114773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.189323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACACA TCTTGACCGA CCTTCGCGGC AATCCCATTT ATTCCTTCGC CCATCGCCGG CTCCGTCGTA TGAGGCATTC CCACGCCTCA ACGCTTTTGT TGCTCGTCCA GAATCGCTCG TGGGAATCCG TCCTCCACCG GGCAGCTCAG CACCCCCAAG AAGTTTGGAT CGTCGACGAC GGTGGGAATA CGCCCTTGCA CGTAGCTGCT CGGCTCGATC CTCCGCCTGC CGTCGTGAAG GCGCTGCAGG TGACGTCAAG GGTTGCAAAT TTGGAAGGGA TCACACCGTT GCACATAGCG GCATCGCATC GGTGTTCTGC CGCCGCTTTG CGGGCACTCA TTGCCTGTGC TGGTTCTGCG ACAACAATCA CTACAGATGG AACTGTTGCT GGATGCCTCG TAGATACTCC TTCCTTGAAG GAAAGCGATG CTTCGTCTGT AGGCGAAAAA GAAAAAGTAG GGCTTTCTCC TACCGCTGAC TTGTCTCGTA TGGGCCGAGC CCCAATTCAC TATGCTTGTT TGTCTTTTCG GGGACACAAC ATTGAGGCCT TTTCGGTCCT TCTTGATGCA ACGCTAAAAA GCGGTTTTGT CACCTTGCAC GATTTTGTGG TTGATGAAGC TCTACCGGAC TTTGAAGAGT TCGACGAAGA GTTCACTGAC GACTTTTGCG ACACCAAAAA CCTTAAGTCT CAGCATTTTG AAGCGAAGCA ACGAGTGAAG AAGGTCATTA ACGTGATGAA CATGAAGGAC GCAACGGGAC AAACCCCACT GGGACTTTTA TTTCGCCGGT ACCGGGAACG TGTCCGTTGC GTAATATCCA CCGTGGATCG ACTACGTCGC GAACATCCGA ATCGGGAACA AGCGGCAGTA GGAGCAGCGT TGGCTGTCCA CGCCGATTTG GGAGAACTTT GGGAAAAGGC GCGCTTGATT GTTATTAAAC TTACAGAGGA GCGATTGCAA CGTGAAGATT CCGAAAAAGC GGCATCCAAC AGGTCCATAA TTATATCTAG CAACGCTACT GAAATGAGTC CTCCGTCTCC CGGAGAGGAG GCTGTCGCTC AAGAAGCAGC TCAGTATGCG ATGGAGCGTC ATCAAAGAGT TGACAACACG AGCATAGAAG ACGAGGACTT CCTCACTTTG CCAGGAGACG AACTTGTTAA CGACCGAGAC ACACGACGCC TTAAGTTCCG AATTGTCCAT GCTTCGGTGG GTTTGACGGG GTACGGCTGT CCACCAGAAA TGATACGTTT GGCTCTTTCC ATCTATCCCA ACCAAGTCCA CGAAATGGAC GAAGACGGCA ATTTACCACT GCACATTGCT GTAGCAGAGG CGAACTATGT AGCCACGACA CCGAATGGGG GTCTTCCAGA ATTTCATAAT CAACTCGGTA TTGGAGAGAT TTCCGACGAT CACAGTGTTG TGTCTGCTAT GAGTTTCTTT TCCTCGGCAA CTATATCCCA GACCGTCAAC CCATTCGACA AGGTCATCAA GATGCTCTTG CAGCAGTATC CGGAAGCTGC CAAGATCCCT CAAGGTCGGA CTGGTCGGCT GCCGTTGGTA ATGGCCTTGG AATCGGGTCT GCGGACTTGG GAAGACGGAA TTCGGACCTT GTTGAACGCC TACCCGCCAG CGCTGCACGA CAAGAAGCTA ATTGAGCCTG CTTTGTATCC GAACGTGTTG GGTTTGGTCA CGAACGCCTC TAACCTTCTC TTTCCCTTGT CACGTATTGG TGGAAGCTCT ACTGGGCTGG ACGATTTGCT GCCTAGAGGA AGCCAGCATC TTCGCTCGCG CAGAGCTCGC AGCAACACGC AAGCAAAACG TCGGGAAGAG TGCGCCCGAA CGACACTGTT TGAGCTGCTT CGCACTAAGC CTGAGTGGTT GGTGTCGGAA GCATAG
|
Protein sequence | MRHILTDLRG NPIYSFAHRR LRRMRHSHAS TLLLLVQNRS WESVLHRAAQ HPQEVWIVDD GGNTPLHVAA RLDPPPAVVK ALQVTSRVAN LEGITPLHIA ASHRCSAAAL RALIACAGSA TTITTDGTVA GCLVDTPSLK ESDASSVGEK EKVGLSPTAD LSRMGRAPIH YACLSFRGHN IEAFSVLLDA TLKSGFVTLH DFVVDEALPD FEEFDEEFTD DFCDTKNLKS QHFEAKQRVK KVINVMNMKD ATGQTPLGLL FRRYRERVRC VISTVDRLRR EHPNREQAAV GAALAVHADL GELWEKARLI VIKLTEERLQ REDSEKAASN RSIIISSNAT EMSPPSPGEE AVAQEAAQYA MERHQRVDNT SIEDEDFLTL PGDELVNDRD TRRLKFRIVH ASVGLTGYGC PPEMIRLALS IYPNQVHEMD EDGNLPLHIA VAEANYVATT PNGGLPEFHN QLGIGEISDD HSVVSAMSFF SSATISQTVN PFDKVIKMLL QQYPEAAKIP QGRTGRLPLV MALESGLRTW EDGIRTLLNA YPPALHDKKL IEPALYPNVL GLVTNASNLL FPLSRIGGSS TGLDDLLPRG SQHLRSRRAR SNTQAKRREE CARTTLFELL RTKPEWLVSE A
|
| |