Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44751 |
Symbol | |
ID | 7199873 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 129496 |
End bp | 131354 |
Gene Length | 1859 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178936 |
Protein GI | 219116282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00155872 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTTTGATT GTGATCTCAA CCCTTTGTTG ATCTACTCAA CAAGCTGCAG CCAAGGGATC TGATCAATCC CTATCTCAGC TTGTCATTCT TTTGCATCCT TTCATCCTTG CTTTTTCGAC TGTTTGGATT GTTCCCTGCA AAAGGAAAGT TCTCTCTATA TTGAAGAACA CAAAACAACA ATGGAGGTCA CAGTTGAGGA AAACAATGAA CTACAAACAT TATAAGGGTA CATTGTACTC TTGGATCTTT TTGTGGTGGC TGTGAGGCGT TGAGTGTGTG GCGGAAGCGA CTTGGCTTTC AGCAGTAGGC GATTGAACTT TGGGGGAAAC GGTGTCCTTA CCAGGAGTGA GATCAACACT CTATGTGATA TCACGAGGTA ACCTGTTCCT AAAGAAGCAC CAATGCCCGT ACCGCTTGAC AGTGATCTCC ACCGAGCAAG TAGGTTGGAA ATCGGCAAAA ACAGTATCGT ATAAAACGAT CGCATCATCA GGATGCCCAA AAAATACTAC GCGGTGGCGG CGGGACGTCG GACAGGAATC TTGGATTCTT GGGTAGAATG TCAAGCACAG GTGCGTGCTT CTGGTCCTCC GCGCCACGAA TACTACTCAC CACTTCCTCC CACGCGTTGA TTCACACTAC GTGTCGTCCT TTCTTGTCTA TATCGAAAAG ACTAACGGGT TCAGCGGGGC GAAGTTTAAA AGTTTTCACG ACAAGCAGCA AGCGGAAGAC TACTTGCGCG CACACGGCCA GACTTCGGCT TCGCCGCCGT CGAACGCCGA CGTTCGTATT ATAACAGCGT TTAGGTCGGG ACCTCGCTCC CCGGTAGCAT CGCCAGCAAC AGTCAGTTGC GAAAAAGCCA TTGAGCATAT CCGCAGTGCG CATGCAGCGA GGCCAAATCC CGATGTTCGC ATCGTCACCG CTTTCACTTC CCGTCCGTGT AAACGAGACT CTCCCGAGTT TGTTTCCGCA GCCCGTCCGC GAAAATACGC CAAGACGGGG TATTCAGTCG GGAGTGAGTT AGACAATTCC GAGAATGCAA AGGCGGAGAC GCGGCATCGA TCTTTGTGCC CTGTAGGGAG CAGGCGGCTC AAGATTCACA TCAATTTTGA CGGCGGCTCA CGGGGCAATC CAGGGGTCGC CGGTGCCGGT GTTGCCGTCG TACTGACCGA CCTAGATTGG CGGATAGGTG ATGGTCTGTG CGAACGACTT AACGTGCACC TCAGATTTTT CGTAGGTACA GGTGCCACCA ATAATGAGGC CGAGTACAGT GGCGCGTTGT GGGCTTTAAC GGTGGCTCGG GAGGAAACAC GTCGTTTCGA GTCATTTTAC GATTGTGAAG CTCATGTACA GCTTGTTGTG CAAGGCGATT CCAAGTTGAT CATTCAGCAA CTAAAAGGGA ATTACACCTG CAAAAGCCCC AAGCTGAAGC CATACTACGA GAAAGCAATT CAGCTTTTAG ATGACTTCCA AAGCTTTGCA CAATTTCGAC TGTCACTGGA GCATGTCTAC CGAGAAAGCA ACAAACAGGC AGACGGTGCG TTTACTCGAC TGACTTTTAG AGCTATGGTG ATGCTTTTGC TTTCGCATCT CACTGTAATG GTGCACGTTT ATGCTCTGTG TGTTTTGTTC CTTGCTAATG TAACTCAGGA CTTGCAAACG AAGCAATGGA TGCACAGCGA AGCTGGTTGA CAACATCGCT GGATGGGCAT GAAATGCAGC ATGCTCTTAG CGATCGTTAT CGTGTGTGCT CTCACTGAAA ATAGACAGCA AGAGGCTCGT TCTTGGTTCT GCCGTTTTAC TACGACAAGA TGTAGCGTAC AAAGCATTCG GTTACAGGCT TTTGGGATAC ATCATATGA
|
Protein sequence | MPKKYYAVAA GRRTGILDSW VECQAQTNGF SGAKFKSFHD KQQAEDYLRA HGQTSASPPS NADVRIITAF RSGPRSPVAS PATVSCEKAI EHIRSAHAAR PNPDVRIVTA FTSRPCKRDS PEFVSAARPR KYAKTGYSVG SELDNSENAK AETRHRSLCP VGSRRLKIHI NFDGGSRGNP GVAGAGVAVV LTDLDWRIGD GLCERLNVHL RFFVGTGATN NEAEYSGALW ALTVAREETR RFESFYDCEA HVQLVVQGDS KLIIQQLKGN YTCKSPKLKP YYEKAIQLLD DFQSFAQFRL SLEHVYRESN KQADGLANEA MDAQRSWLTT SLDGHEMQHA LSDRYHSKRL VLGSAVLLRQ DVAYKAFGYR LLGYII
|
| |