Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11236 |
Symbol | |
ID | 7199876 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 145873 |
End bp | 147471 |
Gene Length | 1599 bp |
Protein Length | 500 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178713 |
Protein GI | 219115836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCACGT CGGGGTCCTC TACCGCTCGT ATTCACATTT GCGGAAATTG CGGCAGTGAA TTCGTCAAAT GGATGGGGCG TTGCCCGACC TGTCGCGAGT GGAATACGTT GCAAGAGCAC GCGGTCCGGA GAGAACCAGC CTCTGCGTCA AGGCCAGTTT TTGGCACAAA TATTTCATCT AGTGGCGCAC GACCGGCATC TTGGTTAGAT GGTATCCCGG GTAGTCTGGG ATCGAATGCC CCCGTTCGCA TCACTGATTT GGTCTCGTCG GACGATGGGT CCACGAATAC TCTTTCACGT CTACACTCCT CCCAACGAGT TACGATTCCT AATGACGACG AACTGAACGT GGTACTGGGT GGAGGTATTA TGCCCGGATC GCTCAATCTC ATTGGGGGCG ATCCCGGGGT TGGAAAGTCG ACTCTTATGC TCCAAATCGC TGGAGCCGTG GCGAGCCTCG CCATCCCTAC TCCCGGCATT GGCATGGGCC TGCCCGACTC CAACAATACC CGTACGAAAA GTTCGACCGA CGGGACGGGT GGACCCGTCT GGTACGTGTC AGGGGAAGAA AATCCCGACC AAATCGCTTC TCGTGCAAGT CGTTTGGGAA TTTTGCAATC CGAACTATGG TTGCTAGGAG AAACGCACGT CGATACCCTT TGCCAACAGG TTGTTGTGCA CGTGGAGGCG TCGACAACAC GACCATTCGT CCCATCAGAC TCGAAGCTCG CGATCGACAA TGAAAACGTC AGCCACAGCC AAGGCATGCC GATACCAAAG GCTCCCGCTC TGATTGTAAT TGACTCGATC CAAACCATGG TCTGTGACTC CGGTGGCTCG TCGTCGGCGG GAGGCATTAC GCAGGTCCGG GAATGCGTCG CGCTGTTGCT AAGATTGGCC AAATCAACTC GTATCCCCAT CTTTTTGGTA GGTCACGTTA CCAAGAACGG TGACGTTGCC GGTCCCCGGA CTGTCGAACA CATGGTTGAT TGCGTACTAT ACTTGGAAGG GAGTGCGCAT AACGACGGAT TGAATTTGCG CATGCTGCGG GCGAGCAAGA ATCGGTTTGG CTCGTCCGAC GAAGTCGGCG TCTATGAAAT GACGGCGGGA CGTTTGTTGC CCGTCTCGGA TCCGTCGTCG CTCTTTTTGG CGCACCGCGT AACGCAAGAA GATGCGGAAG GATGCGCGAT TGCCATTGCA CTGGAAGGGA TGCGCGCCAT GACTGTGGAA GTACAAGCCT TGGCAACACC GTCGGGAAGT ACTACTGGCT ATGGTCGTCG GACGGTGGAA GGAATCGCCA TGTCGAGACT AAACTTATTG ATTGGTGTCC TGCAGAAACG CTGTGGTGTA TTCATGTTCA AACAAGACGT ATACATTAAC GTGGCCGGGC GCATACGTCT TGATCGAGGT GAAGGCAACG CAGTAGATCT GGGCGTAGCC GTGGCGTTGG TTAGTAGCTT GGCAACGATT GCGGTGCGGT CGGATACGGC GTTTGTGGGG GAAGTGGGAC TCCTGGGTGA ACTGAGATCT GTAGCAGCGC TGCCAAAACG CTTGGCCGAA GCACGCCGCA TGGGCTTTTC TCGCGTCATT ACACCTCGA
|
Protein sequence | MGTSGSSTAR IHICGNCGSE FVKWMGRCPT CREWNTLQEH AVRREPASAS RPVFGTNISS SGARPASWLD GIPGSLGSNA PVRITDLVSS DDGSTNTLSR LHSSQRVTIP NDDELNVVLG GGIMPGSLNL IGGDPGVGKS TLMLQIAGAV ASLAIPTPGI GMGLPDSNNT RTKSSTDGTG GPVWYVSGEE NPDQIASRAS RLGILQSELW LLGETHVDTL CQQVVVHAPA LIVIDSIQTM VCDSGGSSSA GGITQVRECV ALLLRLAKST RIPIFLVGHV TKNGDVAGPR TVEHMVDCVL YLEGSAHNDG LNLRMLRASK NRFGSSDEVG VYEMTAGRLL PVSDPSSLFL AHRVTQEDAE GCAIAIALEG MRAMTVEVQA LATPSGSTTG YGRRTVEGIA MSRLNLLIGV LQKRCGVFMF KQDVYINVAG RIRLDRGEGN AVDLGVAVAL VSSLATIAVR SDTAFVGEVG LLGELRSVAA LPKRLAEARR MGFSRVITPR
|
| |