Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44100 |
Symbol | |
ID | 7204035 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 984563 |
End bp | 986441 |
Gene Length | 1879 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186164 |
Protein GI | 219113161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.894778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTCGATC ATGAACCGAG ATTTGCCCTC CGAGCCTGTC CCATCGGTAT CCTTGCTAGT TGGCCGAACT ATTGGTAGTC GCATCAACCT CTTGCCGGAA GATTTCAAAC CTGGAATAGA CGACGTGATT TGTGGAAGGG GGAAGAAATG TTACAGTCAC ATTGGAAATG AGCGCTTTCG GCAAAGAGTA TTAGGGATGT TAGATAAGTA TTCTCAGGCT CGATCAAAAT TGGATAAATC GAGCGTGCTG AATGATGTTG TCGAGCAAGT GCGGATAGCA AGTCCAAGAG GAGGATTCAT TAAACAAGAC GAAGCAACTC GTCGCTGGTT TGAAGTTGGT GATTTTCTCG CAAGAGAGAA GACTTCCCAG ACTTTCCGCG ACGCTCTACA CGAGCACTAC AAGTCCAGTA GTGTAGCAAA AAAGAAGCGG AGACAGAAGG AACAAGCAAA AGTAAGCGAG AAGTTGCAAA GGGGCGGCTT GGCGGATTTC AAGCAACAGG ATCCATCTCG CAGCAGCTCG GATTGTAAGT CTTCAGCGAT ATTTTTCAAA ATAAAGTAAA CAAAATCTCA CGCATACTTT CTTTCCGCAG CATACTTGGC GTCGGCGAAT GAAGATCAGG CTGCTTTGGG CATCTTGGCC CGATTACATC AGCTTTCCGA GTTGCGAAAG GAACATGCTA GCTTAGGATT AGCAGCTTCA ATGTATCGCC TGTCCAGCCA TAACACAAAG GCACACCCCC GGTTTTCAAA TTGTTCATTC CAAAGAGCTT CCCCTGAAAC CCAGTCAGGT GGAATTTCGA TGCATGCTTC TAGCGAGGAA CCATTTAAAA GTCGGTTTGA CGACCGTGGA AATCTTCGCC GAAATGCTCA GCGATCTTTT TCGCTTCCGT CATCTCCACA AACACTTCCT ACATATGGTT TTGATCTTGA ACTGGATTGG CTCAGTCACT CTCTTCCAAC AGTGCAGTTT CCTCGAAATG AAGCAACAGA TGGAATGGAA GCTATACATC ACACTTTGCC CCAGCCAAGT TACACCGACC ATATTTCACT TGCTTCCTGG TCCAACCAAC AGGCACTGGT ATCACCCCCC AGCTTCCCAC ATTATGCTTC GTTCGGAAAG CCGAAGTTGT TAGCGAATAT TAAGCTTCCT CCAGTAGGAG ATGGAATTTC GAAAGACTTG TTGAGTTCGC TGGAGAAGTT AACAGAACCT TCTTTCGGCG ATTGCAATCC GTTTGAGCCT ATTCCGCTGA CTCCAATTGG AGATTTGAAA AAACCTGACA ACCTTGATCA GGCTGCTAAA GTGCAAGGGA CTCCGCTGGT GGAAGATCCT ATTCTGACAG ATACATCGCA AACACAAAAA ACGTCCTTGG AAAGGAATAG ATCGGCATTT TTGAGAAAAC AAAAGAGGAG GCAACCATGG GGCGGTGGGC ACCAGTAAGC AGTAGATTAC TAGGTAGTAC GATTCTAAAG GCATACCTGC CTTCTTGTGA AAAGATACAG TGAGTGCGAC CAGTGTAGAT AAGACTTCCA TTTGGGGAAG AAACCAAGCT TACTCTTATT TGGTTAGAAT TGCGTAGGGT TGATTGCTTC TGGTTGGCAC AGCACAGCAC AAATTGCTAC TTTCTCTGGA CGACGCGAAC TACAAAAGGG TGCCGACGAG ACTTTTATTT CGTCATCCTT TGGATTTCAG CATGTTTGTC ATCGACGTGA ATACTGGTGC CGTAGCTGAC CATTTGACTT ATCTTATGAT AGCTTACAAT CAATGTCAAA ATACGACTGA GATATAATTC TCATTAGTAT GGCGTTTTCT ATTAAAAGAA AAGAGAGGCA AGAGTGCGTC CTGCTTACTG TGAATGGATC CTGAACAGTT TTGAGACGA
|
Protein sequence | MNRDLPSEPV PSVSLLVGRT IGSRINLLPE DFKPGIDDVI CGRGKKCYSH IGNERFRQRV LGMLDKYSQA RSKLDKSSVL NDVVEQVRIA SPRGGFIKQD EATRRWFEVG DFLAREKTSQ TFRDALHEHY KSSSVAKKKR RQKEQAKVSE KLQRGGLADF KQQDPSRSSS DSYLASANED QAALGILARL HQLSELRKEH ASLGLAASMY RLSSHNTKAH PRFSNCSFQR ASPETQSGGI SMHASSEEPF KSRFDDRGNL RRNAQRSFSL PSSPQTLPTY GFDLELDWLS HSLPTVQFPR NEATDGMEAI HHTLPQPSYT DHISLASWSN QQALVSPPSF PHYASFGKPK LLANIKLPPV GDGISKDLLS SLEKLTEPSF GDCNPFEPIP LTPIGDLKKP DNLDQAAKVQ GTPLVEDPIL TDTSQTQKTS LERNRSAFLR KQKRRQPWGG GHHTAQIATF SGRRELQKGA DETFISSSFG FQHVCHRREY WCRS
|
| |