Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44137 |
Symbol | |
ID | 7203889 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1098281 |
End bp | 1100512 |
Gene Length | 2232 bp |
Protein Length | 537 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186466 |
Protein GI | 219113765 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.128731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCACCGCAC TGCCAAACTC GTGTGTGCAA AACCAACACC TATCAAGGCC AACACATGTT TATGTTCTCC TGTCACCATT TATCGGTTTC TGAGCAATCC TTTTTTTTAC AATATGCCTA GCGGCCAGGA TGGGCCGATC AAGGGTACTT CTAAATTGTG GGTATCCTTG TCGACGGTCA AGAAAACCCT TTTCGTCTTT GCCTGCCCCT GCCTGCTGGC CTGGTATTTT CATAGTCTCT ATTCAAACGC GTCAGAGAAT GCACTGAATC GTTCTACTGA TAATAAATTT CGAATCGGTG AATCCGCTTC TCCCGCTCGG AGTTCAGCCT CGGATCTATG CCGCAGAAGC ATTATCCAAA GCCTGGGAAT CGAAGCGGGT GAGTTTGGTG AAAAGGAAAT GGAATCGTTC GAGAATGTGT CCATTGCCTT ACATCTGAAT GGAGAATCGA AGCATTGCGG AGAAACGAGC TTAATCAAGC TCATAGAAAC TCTGTATCAA GCCGCAAACG AAATGGATCA ATGTTCATTC ATATTTGACA AGTATGTTGT CGAGGCCGTT TTATCCAGAG CATTTCAAAA TCTTGTCGGA CAAACGTGTT ATTCAAGTGA AAAGGATCGG ACTGAAGAGG ACGGACTGTA TGGGTTTTGC GATATGGGCA CCAACAAGAC TCCCATATTA GCCGATCACG ATGAGCTAGT TTCGATCGCA TATGGCGGAG TAAAATACCT GCCTTGCCAC TTTCATACTC GTCAAGGTCG CCGGATCCAA GGGGTACAAG ATCTCGTAAG TGGTTTGACA GACGAAGCTC CTTGTCCCGA TGCTCTTTGC CTCCATCTGT ATGCGGTTCA GGCAGGTCGT CACTTTGTTT TTGCTCCAGG AGCCGTCGGC GAAATTATAG AGCTGCCTCA CGTTATGGGC GGCAATCCCG AACAGCCTGT TTATCTAAGA ACGCTCTCGC TTTCTCCACG TGTATTCGAC ATTGTCAACT TTTTCCGCCT CGAAGAAAGC CAGGATCTTG TAGCTCGAGT TTTGAAAGAA GAACGCGAAC TATGGCGTCT GAAGCGATCG ACGACTGGAC CCGACTCCTC GAATGTTAAC AAACGCCGCA CGTCAGAGAG CGGATTTGAC ACAAGCAGTG AATCGGCGTT GAAGCTAAAG ATGCGTGGCT TTCAGGCTCT CGGGTTTGAT GATTATATCG AAACCTATGC TGACGGATTA CAAATGCTTC GCTACAATTT GACAACCGCA TACAACGCTC ATCACGACTT TTACGGTGAC ATTTCAGCTG TGACGGGACA TCACAACTTC AATTCTGCTG GAGTTGGTGC TGACCGTTTT GCGACAATTT TACTTTATAT GACGGAACAC GGTGAACGTG GCGGTGGGGA AACCGTCTTT ACTGAAGCTT GGCCTACTGA ACTCGCTCCA GGCAAGAGAG TTGAGCTCCC AGATGTAAGA GCACAGATCC GTCCTCTCGT CTGCGTCTGC ATATTGGGCA TGTTATTCTC ACTATATCAT GCTTTAATGT AGGCCATCGA GTCCTTGCGA GCCTCAGGTG ATGCGTCCAT GTTGAAGACA GGATCTTGGG AAGAAACTCT GGTATGATTA AGTGGTATGT TAAGTTAACC GCTGTGGACC TCTCTCTCAG CTTCTTTTAC TTCTTCTTTG CATGGTAGGT TGCGAAATGT CGGTCACGGC TGGCGGTGCA ACCGCACCCC GGACGGGCAG TTTTGTTCTA CTCGCAGCTC CCAAATGGAA AACCTGACTA TACGTCGCTA CACGGTGGCT GTCCAGTGTT GGCAGGCCAA AAATACGCAG GTATGTACTT GGGGAGAGGT AGGAACAAGT GCTTAATTAC TCGAGTCTAC CTACTGACAA GTTGAGATTG CTGCTTCACA TGGCAGCGAA TCTTTGGGTT TGGAATTCGC ATCGCCACGG ATACAGCGAA GCTCCTATTC ATCCGTAGAA ACACTGAAGT GAAGTAACAT TTGCTTCGCC CATTGAACCA ACTACACCAA TACCATCAAA TATTTCCATT TTGAATTGCC AACCCAAGCA GAAAACATGC ACATTCTATC CAAGGGAGGG TATGGTGTTA GTGTAGTGTG GTGGGAAAGG TGAAAAACTC CCAAAAGGGG ACGGAATGTG TTCAGTCTAA GTAGCATCAT TTGGCGCTTG TATATTGTTA AGTATTCAAA CTGTTTGAAC TAAACGCCCA TTTTTTTCCC GG
|
Protein sequence | MPSGQDGPIK GTSKLWVSLS TVKKTLFVFA CPCLLAWYFH SLYSNASENA LNRSTDNKFR IGESASPARS SASDLCRRSI IQSLGIEAGE FGEKEMESFE NVSIALHLNG ESKHCGETSL IKLIETLYQA ANEMDQCSFI FDKYVVEAVL SRAFQNLVGQ TCYSSEKDRT EEDGLYGFCD MGTNKTPILA DHDELVSIAY GGVKYLPCHF HTRQGRRIQG VQDLVSGLTD EAPCPDALCL HLYAVQAGRH FVFAPGAVGE IIELPHVMGG NPEQPVYLRT LSLSPRVFDI VNFFRLEESQ DLVARVLKEE RELWRLKRST TGPDSSNVNK RRTSESGFDT SSESALKLKM RGFQALGFDD YIETYADGLQ MLRYNLTTAY NAHHDFYGDI SAVTGHHNFN SAGVGADRFA TILLYMTEHG ERGGGETVFT EAWPTELAPG KRVELPDAIE SLRASGDASM LKTGSWEETL VAKCRSRLAV QPHPGRAVLF YSQLPNGKPD YTSLHGGCPV LAGQKYAANL WVWNSHRHGY SEAPIHP
|
| |