Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50274 |
Symbol | |
ID | 7199116 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 162302 |
End bp | 164212 |
Gene Length | 1911 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185221 |
Protein GI | 219130121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTCC CGAACGCCAC ACCCAATTTC CTTCCGTACC GTGCCGTCCA GTTCCAACAA CTGATTCCTG TAGCCCTGTC AAGTTTCGCC TACAGCGAAG CGTACGTACA CGGTACGACG GTAGGGACCG CTTTCCAACG ATAACCCAAA ATGACGACGA CACGACGACA GATCGTTCGA GCTTGCAAGG CCCGAGGCTG GACGATCCAG CCCGCGGCTT TGGCTGGCGT TGAAGCATAC CTGGGATCAT CCCGGGACGG ATCTTTGGAA GATCTTCTCA ACGTCGTTGC CGAACGAATA CCGGGACAGA CTTTAACGGG AGACTTGTGG GACGACATTG CGGGAGAAGC GGCGGATACC GGTCGATCCG CGTCGACGAC GATGACGACG ACGGATGCGT GGGATGGTTT GCAAATTGTC AACGCATTCC AATCACCCAA ACTAGTATTC GAAGTCATGC GCAAAAACTT TGCCGTGGAC GAGAAGCCGC GCTCTTTGTT TGGTACTGCA GAAGATAAGG TGCGTCGATA CTCGAGACCG GTATAGGTGT TGGAAAGAGC AACTACGAGT GCACTGTCTC ACCAGTCTAC TGTCTCGCTC GTTACAATGT CTCTCTCTCT CTCAAACACA CATGTGCATT ATCGTGCGAC ATAGATCTCC ATGCTGGCCC AGCGCTACGC CATGGTGCAC CAACGCATTC TACGACACAA GCTCTTCCGA CCCGCAGATT TACACGCTCG CCAACAAAGT ATTCAACACA AGTTGACTCC GGTGGAATCG CTGCTCGGGA AGCATGACAA CCAAAGTCTG CTCATACTGG GAATTCTACT ACAGATTGAA GAAGGACAGT GGTACCTGGA AGATCCTACG GGACAGGTCC AAATATCCTT TCAAGACGCC AGCGCCGTGG ACGGATTCTT TGTGACCGAG CATTGCATTC TCTTGGTCGA GGGTATGTTC CGGGATGGTA CATTCTGCGT GCATCGTTTG GGTCATCCCC TCCTGGAGTC TCGCGAAACG TCCTTACAGA CCATTCGACA ACAAGTCTTT CATCCCAGTT TTCGCAAACC CGTCCTCACG GTCGGTAGCA TGACGAAAGA ATCCTCGTTG GTCGTTCTGT CGGACCTGCA CTTGGACCAA CCCCGCGTGC TACAGCAACT CGAAAGTTTG TTCGCCACCT ACGACAAGTA TGCTCCCGAC CGACTCCCCC TGTTTGTTCT CATGGGAAAT TTCTCGTCCA CCCCTCAGTC GCACCCTTCC CAGCTCACTC CGCTACTCGA CGAATTGGCA ACCCTCATTG GATGCTTCGC GAATCTGCGA GCCCACGCGC ACTTTTGTCT CGTGCCGGGT CCACACGACG GGGTTGGGTA CGTCCTACCC CTGCCCGCGC TACGGAAAAC GCACGCCCTG GATAAGATCG CACACGTACA TCTGGCTTCC AATCCCTGTC GTATCCAGTG GAGAGACCAA GATATCGTGG TCTTTCGTTA CGACTTACTC CACCTCTTTC AGCATCACCA GATACGTTTG CCCGGAACCG ACCGCGCCGC ATCGGACGAT GACGACCAAC ATGGCCGCCA ACCACATTGT CGCTTGCTCA AGACCATTCT GGATCAAGGT CACGTCACAC CGGTCGCGGG TGTTCCCATC TTTTGGAATT TGGATCACGC TCTTTCCCTG TATCCCTTAC CGACGGCACT GATACTGGGC GGGGACACCG CCACGCAAAA AGACGCCTTT CACGAAGTTT ACGGTGGAGT CCACGTGCTT CATCCCGGCG CTTGGGATGA CGGGTCGTAC GCGGTGTACA CGCCGGGACG CCCCGAGACA GCCATGGTGG CAGACGAGGA AGACGCGCCC CGCGTCGTCG AATTTGGTCG GGTGGGCGAC AGCGTTCCAG CCGAGGCCTG A
|
Protein sequence | MVVPNATPNF LPYRAVQFQQ LIPVALSSFA YSEAYVHGTT IVRACKARGW TIQPAALAGV EAYLGSSRDG SLEDLLNVVA ERIPGQTLTG DLWDDIAGEA ADTGRSASTT MTTTDAWDGL QIVNAFQSPK LVFEVMRKNF AVDEKPRSLF GTAEDKISML AQRYAMVHQR ILRHKLFRPA DLHARQQSIQ HKLTPVESLL GKHDNQSLLI LGILLQIEEG QWYLEDPTGQ VQISFQDASA VDGFFVTEHC ILLVEGMFRD GTFCVHRLGH PLLESRETSL QTIRQQVFHP SFRKPVLTVG SMTKESSLVV LSDLHLDQPR VLQQLESLFA TYDKYAPDRL PLFVLMGNFS STPQSHPSQL TPLLDELATL IGCFANLRAH AHFCLVPGPH DGVGYVLPLP ALRKTHALDK IAHVHLASNP CRIQWRDQDI VVFRYDLLHL FQHHQIRLPG TDRAASDDDD QHGRQPHCRL LKTILDQGHV TPVAGVPIFW NLDHALSLYP LPTALILGGD TATQKDAFHE VYGGVHVLHP GAWDDGSYAV YTPGRPETAM VADEEDAPRV VEFGRVGDSV PAEA
|
| |