Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34373 |
Symbol | |
ID | 7199785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 415179 |
End bp | 416435 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178768 |
Protein GI | 219115946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCAG AATTCTTCTC ATGCTCACAG TCATCCGAGC AATTTCTTCC CACTTTGGAT GTTCTTTCCG AGCTTTCAGC AGATCACGTG GAGCGTTACT CTCGCCAAAT GATTACCGCG GACGGCTTTG GCGTAGGAGG GCAAAAGAAG CTTTTGTCTT CGGCAGTTTT GGTGATTGGC GCTGGCGGGA TCGGTTCCAC CGCGATTCCA TACCTTGCCG CCGCTGGAGT TGGTCGCATA GGTATTGTAG ATTTTGATAC TGTTGATATT TCGAATCTGC ATCGCCAAGT AATTCACAAA ACAGCTGACA TTGGCATGAA TAAAGCTGTG TCTGCATGTC ACGCCGTTCG ATCTCTGAAT CCGATGTGCG AAGTCTACTC ATACGAGACA GACTTGAACC ATGGCAATGC GTTGCCTTTA GTTCAAAATT TCGACTGTGT GGTTGACTGT AGTGACAATC CTAAGACGCG GTACTTGGTC AATGACGCAT GCGTACTTGC CGAGAAGCCG CTTATTTCAG CCAGTGCCAT AGGTACTGAA GGACAGGTTA CAGTTTATAA TTTCCAAGGT GCGCCCTGCT ATCGGTGTTT GTATCCTACA CCGAACGCTG CAGGAGGATG CCGCACCTGC GTGGATGCAG GGGTACTAGG ACCTGTTCCG GGGCTAGTTG GAATTTTGCA AGCAACCGAG ACCCTCAAAG TTTTGACAGG GAGCGGCACA ACAATGAAAG GCCACCTTCT TATGTACAAC GCAATGGATT GCTCGTTTCT CAAGATTGTG AAGCCGCCAA AGCAGACAAA GTGTCGTGTA TGCGGTCCTG ATGCAACCAT TCTATCGATG GAAGCCAGTA ACGCTAGTTT GTTGCATGCT CAAGGACCGA TGCAAACAAT TGATCGCCCT TCACTGCCTA GTGAGTCAAA TATCTCCTGC ATTGAGTACA GCCGAGTCCG AGAAGACAAG GTTCCACACA TTCTTGTTGA CGTAAGAACC AAGCTGCAAT TTGAGATGTG TGCTTTAGAG GAAGCGGTGC ATATTCCGCT GTCCTCTCTA TCCCAACAAC TGGATCAAAT AGAAAAGCTA TCTGGCGGCA CAAAACCTGT CTATTGTATT TGTCGGCGTG GAGTTGATTC AGTTGAAGCA ACTCGGATAC TTGACGCTGC AAAATTATCA CACCCAAACA TTCATTCGGC CAAGAACGTG GCCGGTGGGC TGGTAAGCTG GAGGAAAGAA GTAGACACTT CCTTTCCCAA GTACTAA
|
Protein sequence | MQSEFFSCSQ SSEQFLPTLD VLSELSADHV ERYSRQMITA DGFGVGGQKK LLSSAVLVIG AGGIGSTAIP YLAAAGVGRI GIVDFDTVDI SNLHRQVIHK TADIGMNKAV SACHAVRSLN PMCEVYSYET DLNHGNALPL VQNFDCVVDC SDNPKTRYLV NDACVLAEKP LISASAIGTE GQVTVYNFQG APCYRCLYPT PNAAGGCRTC VDAGVLGPVP GLVGILQATE TLKVLTGSGT TMKGHLLMYN AMDCSFLKIV KPPKQTKCRV CGPDATILSM EASNASLLHA QGPMQTIDRP SLPSESNISC IEYSRVREDK VPHILVDVRT KLQFEMCALE EAVHIPLSSL SQQLDQIEKL SGGTKPVYCI CRRGVDSVEA TRILDAAKLS HPNIHSAKNV AGGLVSWRKE VDTSFPKY
|
| |