Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44203 |
Symbol | |
ID | 7204117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1300825 |
End bp | 1302882 |
Gene Length | 2058 bp |
Protein Length | 658 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186501 |
Protein GI | 219113835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCGA CAGCACTCTG CAACATGCGA CTGTTATATT TTACCGGACT ACTATTTGGA CCGACAGTAT TGACAGTGAA TGCCGAATGG CTGAAAATCA ATGACACAGT CAACACTATT ACATCCCATC AAGACTTCCG AACGACAATT TTTCGCTCCT TCGATCATGT TGTTTGTCGT ATTTTAGTGG AGCTTGTTGA CTACGATGTG GATCAAAAAG ATTCCATGCA AAATACGTGT GAAGTTATCG GCCCGCCTGA GCTTCTTGGG AGCAGCCGAT GGCGTGGGGG CGGCAGTCCT GGGCTTGTCA TTGAGCTAGA AGAGAAGCTG TGGGAACAAT TTAGTTTTGC TCCCGAGGAT TTGGTCCTGG ATTTGATGGA TGCAGTAGTT GGCGAAAGAT ACTTCTCAAA CGTTGGCTGG CTCCCTACCA GTATTTCGTT GTCAAGTCGT GGCAATGCGA CTATTCATCA CGGAGCGTTT CATCGACGGA GGCAGCTGGA GTACTTTGAT CGCCCACGTT CAGTCATGCT CGTGAGGGTG AGTACAGATG ACGAAAGCGT GAGTCTATCT CCATCGGAGA TTGCAGATTC GTTCTTCCAA CCCGGTCGCT TTTCTTTTGC TTCACAATAC GAGGCGTGTT CTGCGGGACA GGCTCGGTTT GTGCCATTCG ACGTCGATAA CGTAGTAAAG GAGGTTCGTG TATCTGGAAT CAGCAGTGCC TTCACTGGCA GGAGTCTGCT TGTAGCGGCA AACATAGCGA CTCAAACGTT ACTTGGCGTT GAGAGTTTAC CAGACGTAGC GGATCATGTC GTATTTTGCA TGCCGCCTGG AACGAGTGGA GAAAAATATC TGGCCGCTGC TGCTCTCAAT AGTTGGTATG CCGTAGCCAG CGACAGGTGG TGTGCGCAGC CGAACGCAAT CATCCATGAG GTATGTATTC CCAAGCTTAA TACTTGAATG AATGTTTGTG ATGCTTTCGT ATCTGACACT TTTCTTTTTG CTCCATGGAA GATCGGCCAC AATTTGGGAT TAGGTCATGC GGGAAAGGGC GACAATGCCT ATGCCGACGG GACTAGCCTA ATGGGGTATT CGGAGGTTCG CGCTGATGGC CCGTCCAAAT GCTTCAACGG ATACAATTTT TGGCTACTAG GTTGGTATCC GAATCACAGA ATCGATCTTG ACGCGTTGGC CTTAACGGAA TCGGTTCGAG TAGAGATTGC TGCATTTACG GATTTCGGTG TCATCGGTAA AAACGAAGTA ATTATTGCCA AAGCTGGCGG CCTTTATTTG ACCTACAATC GCGCCGAAAC GTTCAACAGG GAAACAGAAG ACTTCCTCGA CACGGTAACG ATTACGAAGC GCGTCGATGG AATACACACG AAACTATTGG CTGGCCTTAG CGAGACCAAC TCCCGGTATA TTTCTGGCGA CAAATTGATT TTCGAGGTAT GCAGCAAACA TGACGGAGGA AGTGTAGCAG CCGATAAGTT TGTCATTGGT ATCGGCAAAG AGATGCTCCC ATGCAAGTCT GCTATGATAT TGGGCCCAGA TGGAGTCGAC GCAACTTTGT CACCTCCTAC AAGAGCTCCA TCTATGACAC CGACGTTAGC ACCAATATCA ACACCGACAT CAGCACCAAT ATCAACACCG ACATCAGCAC CAACGTTCGC ACCAGCACCA ACGGGAAAAC CCGTGACTCC GCAACCATCG CTGCATGAAT ATACAGGTTT CCCAACACTA TCTACTGGCG GACCGTCTGA GCCCTTAGAC GACATCTCAG AGCCCTTTAA CCCGGAAAGG TCTCAAGTGC CTGTCGGAGG AGACGAGAGG GTAGCCGCGG AAGAAAGCGA CCGATTCCCT CTTATACCCG TGGTGTCTGG TATAGCTTGT GTCATTTTCT TTGTTTTCTT GACTATTTTC CTCATTCTGC TGCGCAGGAA GTGGTCGAGA CAGGCCCGCA GTCGGTGGCA GGTTTCAGTA TTAGGTAGTC CAAAAAAGCA ACACACTGAC AATGAATCTG AAACTTCGCA ATTTGGCCCT TCCGATGAAT CATGCTGA
|
Protein sequence | MASTALCNMR LLYFTGLLFG PTVLTVNAEW LKINDTVNTI TSHQDFRTTI FRSFDHVVCR ILVELVDYDV DQKDSMQNTC EVIGPPELLG SSRWRGGGSP GLVIELEEKL WEQFSFAPED LVLDLMDAVV GERYFSNVGW LPTSISLSSR GNATIHHGAF HRRRQLEYFD RPRSVMLVRV STDDESVSLS PSEIADSFFQ PGRFSFASQY EACSAGQARF VPFDVDNVVK EVRVSGISSA FTGRSLLVAA NIATQTLLGV ESLPDVADHV VFCMPPGTSG EKYLAAAALN SWYAVASDRW CAQPNAIIHE IGHNLGLGHA GKGDNAYADG TSLMGYSEVR ADGPSKCFNG YNFWLLGWYP NHRIDLDALA LTESVRVEIA AFTDFGVIGK NEVIIAKAGG LYLTYNRAET FNRETEDFLD TVTITKRVDG IHTKLLAGLS ETNSRYISGD KLIFEVCSKH DGGSVAADKF VIGIGKEMLP CKSAMILGPD GVDATLSPPT RAPSMTPTLA PISTPTSAPI STPTSAPTFA PAPTGKPVTP QPSLHEYTGF PTLSTGGPSE PLDDISEPFN PERSQVPVGG DERVAAEESD RFPLIPVVSG IACVIFFVFL TIFLILLRRK WSRQARSRWQ VSVLGSPKKQ HTDNESETSQ FGPSDESC
|
| |