Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39118 |
Symbol | |
ID | 7194823 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 373349 |
End bp | 374576 |
Gene Length | 1228 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183084 |
Protein GI | 219125641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA ACAATGGACA CGCAGCTCGT CGCGCGCTAA GTCGAAGAGT CGGTATCGTC TTGGTTTCTT TTGCCGTTCT ATTGGCAATA TTATTGGCCG TCAAGTCTCA CGTTCGACAC AAAAAGCCAT TCTCACTGTC AACAAGACCA CATCAACCTT CTCGTTGGGG AACGCGAGGG CGAGGAGGAT CAATGGCGTA TCTGAGCGAA GCAGAGGTTC TGGAAGATAT ATTACTCGCC AAACTGCACC TAGTGGATAT CCGGGTCGAA GACGCCGACG CTTTGCAAAA GGCCAGTAAG GCCTCAGTAA CAGCCGATAG CAACACTTCT TACGATCACG AGTCCGCTAC TCCATACGCG GGTATAACCG GTTTCTTTTG CACCCTAGAC TGGTCCGTGC ACAAGCTGGA TCCGGCCAGC ACCCCAATGT TTCGGGACTT GACAGCCAAA AGCGCTTCCT GTGACGGTCC TCGAAAAATG GACCTTGGGA AAGTAGTCCA GGCTGCGCGG GAACGAGATA GCGACATCAA CCCTCGTCGC GTCAGTAGCG TCCACGTTCT GAACTTAGCG GCCGTTGTTT TCCACGAATC GCGATGTGGG AGCACGTTGG TAGCAAATAT TCTGGCCGGT ATGAATCCTG CGGCCCACCG GGTGTACTCG GAATCGCCAC CGCCACTACA CGCTCTCAAA ACAGTCTGTG GCGAGGACTA TTCCCACTGT GCGAAAACAA TATCTGCACG TATATTACGT GATGTCGTCT ATCTCATGAG TCGCTCCAAC GACTTGAAGG AAACACGGGT CTTTTTCAAA GTTCAATCCC TCGGAAGTCG CAATATTGAG GTGTTCCAAC AAGCCTTCCC CGCCACTCCC TGGTTGTTTG TGTACCGCGA CCCTGTTGAA ATACTTATGA GTCAGCTAGC CAACGGACCG CGTAATGCTA ACTGCGTCCG TCCTCAACGA CTGCATACAC AGCCGACCAG CGTACATCGT GTTTGGCAAC ACCGTGGAAG TAGCGCGACA ACAAACTTGA AAACGCTGTC ACCGGAAGAG TACTGTGCCG TACACTTGGC GACTATCACG GAAACAGCCG TTGAGCAGCT ACAATATGGT TCACAGCTGC AAGGAGTTCC TATTAATTAC GCTTCCCTCA AACCGATGCT GCTTAAAGTG CTGCCCACCC GACTTAATGT GACAATGGGA CCAGAGGAGG TGCGCCGTAT TGACTTAG
|
Protein sequence | MKENNGHAAR RALSRRVGIV LVSFAVLLAI LLAVKSHVRH KKPFSLSTRP HQPSRWGTRG RGGSMAYLSE AEVLEDILLA KLHLVDIRVE DADALQKASK ASVTADSNTS YDHESATPYA GITGFFCTLD WSVHKLDPAS TPMFRDLTAK SASCDGPRKM DLGKVVQAAR ERDSDINPRR VSSVHVLNLA AVVFHESRCG STLVANILAG MNPAAHRVYS ESPPPLHALK TVCGEDYSHC AKTISARILR DVVYLMSRSN DLKETRVFFK VQSLGSRNIE VFQQAFPATP WLFVYRDPVE ILMSQLANGP RNANCVRPQR LHTQPTSVHR VWQHRGSSAT TNLKTLSPEE YCAVHLATIT ETAVEQLQYG SQLQGVPINY ASLKPMLLKR RCAVLT
|
| |