Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51806 |
Symbol | |
ID | 7200245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 521435 |
End bp | 522760 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | inner membrane protein |
Protein accession | XP_002179232 |
Protein GI | 219116875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.959187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCACG AACGCATGGG ACTTGAAAAT GTCACAAACC CAGCCGCGGC AGTTTGTTTC TCCCGGCGAG AGTGCACCAT GGCGTTTCAC TACCGAGTAG CAAGGAGAAT CGGGTTGCAA CGTTTCAAAT ACGTCCATAA CAATCACAAC TTGTTGACTT GGCGGCTGCT ATCAGGGTCC TCCCCGACGC CGCTCCATAC AAAAGGAGAA GATGCACCGC AATCCAAATC ACCGGCATCG TCTCGAGATG TCACGTCTCC TTCCTCGATT CTTCCTGGCC TCGGGCTCGC AGCCGGAACA GCACTTGGCG GCTTTCAGGC GGCGTCAATC CTATCAGATA CCCTCGCTAT TCCCGTGTCG GGAATTCCCA CATCTATCCT ACTTGGTATG GCTGTGAAAA ACACGATAGG CTACGACACA AACACTTTCC AACCAGGTCT CGTCTTTGCC ACCAAAACGA TCCTCCAAAC GGGAATCGTC TGCGTTGCGG CCAAACTTTC TTTCCTTGAT ATGGTTACCA CCGGATCCCA GAGTGTTCCC GTCGTAATCG CTTCCGTCGG GGCCGGCATG TTGTTCTTAC CCATCGCTGG TGCCTGGGCC GGGTTGCCCC CGCGACTGTC TCTCCTCCTC ACTGCCGGTA CATCCATTTG TGGTGTCACG GCTATTACTG CCCTAGCACC GGCAATCCAG GCAACACCAC GGGAAATTGC TATAGCTGTG GCCAACACGG TGGCCTTTGG AACCGTGGGA ATGTTGTGCT ATCCCTATGT CTTGCACGAA TTGTGCCAGG GCAATTCGGT GCAGGTAGGA ATGTGCCTCG GAGTGGCTAT TCATGACACG TCGCAGGTTC TGGGGTCGGC CATGGCCTAT AAGGAAACGT TTGATGATCA ATTGGCGTTT CAGGTGGCGG CCGTCACTAA ACTCGTGCGG AATTTAGGTT TGGCCGTGGC CATTCCGACT TTGACGTACG TGTACCACAA GGAGCACACA GCCAAGTCAA CGGGCGAACG CCTACCCGAA ACCATGTCGG GCCTTTCCAC CTTCTCCAAG TACATTCCGC CCTTTTTGGT GGCATTTTTG GGAATGTCGG CTTTGCGGTC GGGAGGAGAC GTGATGCTTT CCGATGTGGA AGTCTATTCG CAAATCATGA ATTGGATTGG CAATGATCTT TCCAAGTACG CTTTGGGGAC AGCAATGGCG GGGGTAGGAC TGAGTACTTC CGCGTCATCG CTACAAGGCG TAGGATGGAA ACCGTTTGCG GTAGGAGGAG CGGGAGCTCT AGTGGTGGGT GGAACCGGAT TCACGGTAGC GACGCTGGTG CTGTAG
|
Protein sequence | MAHERMGLEN VTNPAAAVCF SRRECTMAFH YRVARRIGLQ RFKYVHNNHN LLTWRLLSGS SPTPLHTKGE DAPQSKSPAS SRDVTSPSSI LPGLGLAAGT ALGGFQAASI LSDTLAIPVS GIPTSILLGM AVKNTIGYDT NTFQPGLVFA TKTILQTGIV CVAAKLSFLD MVTTGSQSVP VVIASVGAGM LFLPIAGAWA GLPPRLSLLL TAGTSICGVT AITALAPAIQ ATPREIAIAV ANTVAFGTVG MLCYPYVLHE LCQGNSVQVG MCLGVAIHDT SQVLGSAMAY KETFDDQLAF QVAAVTKLVR NLGLAVAIPT LTYVYHKEHT AKSTGERLPE TMSGLSTFSK YIPPFLVAFL GMSALRSGGD VMLSDVEVYS QIMNWIGNDL SKYALGTAMA GVGLSTSASS LQGVGWKPFA VGGAGALVVG GTGFTVATLV L
|
| |