Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50492 |
Symbol | |
ID | 7199326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 203493 |
End bp | 204863 |
Gene Length | 1371 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185399 |
Protein GI | 219130494 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.498831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGAAGAATC CAGCGATCAA CGAGGAAAAC GATCATTTTC GGTTTGCTCG GATTTTGATT CCCCCCCGGT CTCCCGTAAC ACACAAGACT CTGACTGGAT CCTTGTCCTT TGTTCCTTTC TAGAGACTGG CAAGTGAATA CTACTTCACC ATGCTTGCTC ACAAGTTTCA TTCACGTCCT TCATTGATCT CTTGCAACAG CACGCACGGA ATACGTTCCG ATTCGAATCG CGGGTTTGAC AACTCCACCA CGTCGCTTTC CGTGGAGTCG CGGTCTCATC CGGGGCCATC GTGGAAACGT GCGCTGCCCG TGTCTGGAAT GCTGGATGCA TCCGCCGCTC TCTTGCTTCT CCAAGTTGGC AAGATTGCTA CCGACGAAGT TCAGAAAGAT TCTCTGGTGT ACCAGCCATC CTTGTTCGGA AACGTTCCTC CCCACACGGA GAGCTCCGTT TTCTCCTCCG ATGAAGATTC CGATCGAGCA GAATTTGTTC CGGGGGAAGC CCGCATAAGT GCGCGCCACC GATGCCGTAC CGTCTCGGTT GATGTGCTGG AATATCAAGG ACGACGAACG CAAACACAAC ATTCGCAGCG ACCCCGTCTC CTTGCTGGTC CCGTGATTCA TACCGTCCCA CCTTCTCCCA CGGGACCAAA GCCGATCAAG TCTCTAAGTA CAACTACAAC TGCTTCGACC CTGTCCCTGG ACGGAATCAC TACTAAAAAC CCGAGTACTT CCGCAAGCGA ACCTTTGGTC GTTCCCCACA AGCCTTCCGC GAATCTCGTG GGGATCACCA CCGCACACGG AAGGAGCGTC AAGGGTGTGC TACGGCGTAA ATTCTCTTGG AAGAATTTCC CCGAACTGGA AACCTACCTG ATCGATAATC GTCAACAGTA TTTGCAGTAC AGTTCCCAGC TAAACTACAC TTCGGAACAA AAGCGCTACA ACAACCGTCT CACGCAAGGC CTGCTGGATT TGGCTGCCGA GGAAGGTTAC GTCTTTGAAG ACTTTACCTT CGCGGCCATT CGCGACCGGA TTCGTTGCTA CTACAAATCC TGCGTGCAGG CCGCCAAGAA GAAAAAGCGC AAGCGTCGCA AGTAAACGCA AAAGTCACGC GCAAGCGTTG TTGACCCAAA CCCGTTTTTC GGATTGTGTC TGCGGAGCCA GACATTTTCC GCCCTCCACC GTGCATACGC CATTCATCTC CATATCGTCC ACGCAAACCT GCACACGAAC CTACACGTTC ACAACCCTCT CACTACTATA TTTCTTGCAT ATATACAAGC GTCAAGTGCC AGTCTCGGCC AAGTCGCGAG TTTTGATCCA TTTTTCGTCT AGCGAGTAAA TCAACTTTAC TAGTCTCTGA TTGCATACCA A
|
Protein sequence | MLAHKFHSRP SLISCNSTHG IRSDSNRGFD NSTTSLSVES RSHPGPSWKR ALPVSGMLDA SAALLLLQVG KIATDEVQKD SLVYQPSLFG NVPPHTESSV FSSDEDSDRA EFVPGEARIS ARHRCRTVSV DVLEYQGRRT QTQHSQRPRL LAGPVIHTVP PSPTGPKPIK SLSTTTTAST LSLDGITTKN PSTSASEPLV VPHKPSANLV GITTAHGRSV KGVLRRKFSW KNFPELETYL IDNRQQYLQY SSQLNYTSEQ KRYNNRLTQG LLDLAAEEGY VFEDFTFAAI RDRIRCYYKS CVQAAKKKKR KRRK
|
| |