Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37932 |
Symbol | |
ID | 7202853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 441817 |
End bp | 443322 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182068 |
Protein GI | 219123514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.209727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGCA CATCTCCGAC CCCGACCAGG CTATCGTCTT CGCCCTCCAT CGTACCTTCC GTCATGCCAA CCGCAACGTG TCACGACGTC CAATCGTACC GCAGCCCTAT CAACAACTTA GAATGCAGCG ACCATAAAGG CACGGACTGT ATTCAGTGGC GGCACTTGGG TTTGAATGTA AACGAATTGG AAGACCTCGT CAATTTTTGC CCGGAGTCGT GCAATATTGA TTGTGGTGCT CTAAGTCGCT TTGAACTTCG CTTGACCTAT CGTTTGGAAA ATGTAATAAC ATTTTTGGGT CCGGAAACAA GTGAAACGAT AGAATTGGTT GGAGTGGAAT TTTTCACGAA ATACATCCAG TCTATTACCC CTCAAAGCCG CATTTTTGTG AATGAAGCAG AGCTCTTGCG ACAGCAGATC GAGCCGTATT TTCTAGGGCA GCGTGAACAT GCCCTTCGTA CTTCACAATC AGGCCCTTAC GTGGAGCTTC TGATAGAAAC GATGTTTCGT GGACTCACGG TGCACTTGAC GACGGATGCT CTGTTGGGGT ACCTTGAAGA AGCGATTTTG AGCACGGGCT TTACACAGGC GATTCAAGGA TCCGGTGATC CAACCTTGAC GCAAGCGATA GTTTCTGACC ACAAAAAAGG ACGGCGCTTT TCTCCGAACG CAGGATCAAC GGCAGGCAAT GATGACAATA GGCATGCGTC GGGAAGCGTT GTTGCTTCCA TGCTTATGGC CTTCGCCGCG ATATGTGCAG GAGTATGTCT TTTCGTTTGG CACAAACGAA AAGTCCGAAC GGAAAACGGA GATCGAGGTG TGAATGAGCC TAGTGCTGTT AGCCCCACTG AATCAGAGCG ATCCCAATTA GCCAACATAT TTTCTTTCGA AAGCGTCTCG AACGCGGTTG CCAATGCTAA GGGAGCAGTA CTAAATGTCA ATTTCTGTCA TCAAAAAAAG AATTCGACAA AGAATACCCC TAGCAGGTCT TCTACTAGTG AATCAGGCAG TGGCGAGACG GAAGACGAAC CGCACCCGTT GGCAGGCCTT ATTCCTCCGA TGATTGTCAT GAATCAAATT GACAGCAATG AAGATTCCAT CAGAGGCTCC AGAAATGCAG AAAAAATAGC CAATGTTGTA CCATCACACT ACATGGCCGC TTCTGCACAG CTTATAGCCT CGCTGAATGA CCGGAGGACG CCGCACCAAG TGCCGAACTT CTCTAAAGTA TTTGTCAGTG TAGCGGACTT CAACGCAAAA GAGAATGAAG GCGATCTTGA GAAGCAAGTG GCCTGGTTCA TTGCCCCGAA TAAAGGGAAC ACGAGCTTTC ATATCTTTTC AAGCGAAGAC AATGACGATG AAGACATTGA CGGGATTGAT TCTGTATCGT CTTATCCTGC AGAAGCAAGT GGAATTGAGA GCCCTAGTAT ACGGCAAACA ATGGACGATG GTACTGCCTT TCGAAAACTG ATACCATCAG CATCACCCGT GGCAAGCCTA AAATAA
|
Protein sequence | MNGTSPTPTR LSSSPSIVPS VMPTATCHDV QSYRSPINNL ECSDHKGTDC IQWRHLGLNV NELEDLVNFC PESCNIDCGA LSRFELRLTY RLENVITFLG PETSETIELV GVEFFTKYIQ SITPQSRIFV NEAELLRQQI EPYFLGQREH ALRTSQSGPY VELLIETMFR GLTVHLTTDA LLGYLEEAIL STGFTQAIQG SGDPTLTQAI VSDHKKGRRF SPNAGSTAGN DDNRHASGSV VASMLMAFAA ICAGVCLFVW HKRKVRTENG DRGVNEPSAV SPTESERSQL ANIFSFESVS NAVANAKGAV LNVNFCHQKK NSTKNTPSRS STSESGSGET EDEPHPLAGL IPPMIVMNQI DSNEDSIRGS RNAEKIANVV PSHYMAASAQ LIASLNDRRT PHQVPNFSKV FVSVADFNAK ENEGDLEKQV AWFIAPNKGN TSFHIFSSED NDDEDIDGID SVSSYPAEAS GIESPSIRQT MDDGTAFRKL IPSASPVASL K
|
| |