Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44708 |
Symbol | |
ID | 7197932 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1343927 |
End bp | 1346871 |
Gene Length | 2945 bp |
Protein Length | 790 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178409 |
Protein GI | 219115227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTACCGAAGA GTCAGGATGG GGGCAATGGG AAAGAAGGCT AGCCTATGTA TCTCAGTCTG GTGCAACATC AGTAGTAAAC GCGAGATTAG CCCTGGGGAG ACTGGCTGGA GACGCCGGTC TTGAAGGTTC TCTTTTTGTC AGCGTCGGCA AAAGAGCACG GAAAGAAGGC ATTTTCAACA TTGCGAATAA TTCATTTGCC CAGGCCGAGG CTGCAGTCGC CAATATCTCT GATTTCGGCT CTTTGCGCAG TTCTCTGATA CTGGAAGTCG CTAAGCTCAA GCGTGACATG GGAGAGACCA ACCTGGCCTT ACGAATACTC GAGCCCGCGG ATGTGGGAGA CCTGTTCGAT ACCAAGAGCG ACCAGCTACA AACGGAAGTC ATCCGCAGAG CGTCTCAGAT GCTGCAACCC TCGGGGAAGC TGCTAAACAA CGGAAGGTTT GATGCAAATG GCCTACAAGA GAAACGACTC GTTGACTTTT TTGTTGAATG TGCGCTGCAA TCGACGAGAT GGATGATGGA TGGTGGATTA AAAGCAGGCG CAGAAATAAT GTACCGATTT CGAACGATAC ATCGGATTGC ACCTTCCTTT GAAAAGGGTA CGTTCCACTA TGTCCATTCA GCTCGCTTTT GAAGGCTTTT TTTCTCATAC GCAATTATTT AGCCCACTTT CTATACGCCA AGTACCTCGA TTCTGTTCTA AAATCTCGAA TTTCAGCGTT GCAAAGAAGA CAAACAGGTC AACTGCCGGG GTTAGATGAG GATGACTTAC GGAGTCGAAC GATTGGAGCC GACGAAGCTT GCCAGCGAAA TGTTCTACGA ATAATGGAGC ACTATACACA GACATTGTGT CTCAATTCGA TACATGTCTA CAATGCATTA CCACGATTGC TCTCCCTTTG GTTCGATTTT ACGTCTATAC AAAGTAGTAT CGAGGACCGG GAAGTGCAAG GTAAGTTAAT GCGATAGATG TGCGACTCGT TCTCTTGTCT AACTTTACGC TCCGTTCCGT ACAGGCAACT TGAAGCAGAA TCAGGAGGAA GCCAATGTGT TTATGGGTAA TCATTTTAAA AAGATTCCTG CGCAAGCCTT CTATACTGCT CTTCCTCAGC TGGTATCGCG TATAGTTCAT GTAGATTCTG ACACAGCATC CGTCGTTCGA GGTATTTTGA AACGGGTTCT TACAAAGTTC CCGAAACAAG CCATGTGGCC GCTCGCTTGG TTGCGACACT CAAAAGCTTT AGAAAGGAGA AAAGTAGGAG ATAGCATATT TCAGGATGCC GAGAAGACGC TAGTGAAAGC TTCAAATCAA ACCCACTACA GAGTCTTGAT GGCGTCAAAG GGTCTCTTCA AATTTTTCCA AGAACTTGCA AAGTACAAGA ACTCGGATCT ATCCACTCAA TCAATAAATG TCAAGCCCTG GAAAGGCGAA GTAGATCTAA CTGAATTTAT TCCTCCCGTT CAAGCAGCGT TGTCAGCTTC ACTGGACTCA TCTGAGAGCG CATTTATGAG CGATCCGTTT CCTAGACAAG TTCCACGCAT GAGATTGTTC TCACAGCGCG TTTCTGTGAT GAGCTCGAAG GCCCGTCCCA AGAAATTGAA AGCCTATGTT GTTGCAGCTG ACTCTAGGCT GTCCTCAGCA TGTGCAAGTA ATGGAACGGA TCAAAATCTA CCCGACATTG GTGAGATTCA TTTTCTTGTC AAACAGGAGG CAAAAGGTGA CCTTCGCAAG GACGCACGTG TTCAGGAGCT AAATAATGTT ATCAACCGGC TAATGGCGAA TTCGAGGGAC TCAAAAGGAC ATACTACGCA CAATAGACGG CACGGACTAA GAACCTTTGC TGTCACTTGC TTATCAGAAG ACACAGGTCT GCTGGAATGG GTGCCCAATA CGTCTTCACT TCGAAGTCTC GTATCGGTAG CGTACAATCC ACAGGCAAAC GCGTTCTCTT CTCGGCGACG TGGATCCCGC CTAGTTGCAA TGAACGATCC AGTTTTGCGA GGAAATTTTG AGAAGAAATG CCAAGCAATG TATTTCTCAG ACGGAAACCT ACGGAAAGCC GCTACTTTGT TTGAAGAACT CTGTCTCAGA CAATACCCTC CTTTACTCTA TTGGTGGTTT GTACAAACGT ACTTGGATCC GCATTCCTGG TACGAAGCTC GAATCAGATT TACATTGAGT GCTGCTGCTT GGTCGGCTGT CGGGCACGTG ATTGGTCTCG GAGATCGACA TTCAGAGAAC ATTCTGGTCG ATGCATTGAA CGGGGAATGT GTCCATGTGG ACTTCGATTG GTACGTTGAT CAAGGCAGTT TCGTACTTTC TCATTAAGTT GCCGACAGCC TCATCATCTT CTTCAACTGA ATCAGCATTT TTGACAAAGG CCTACTGTTG CCTCGCCCAG AAGTTGTTCC GTTCCGTTTA ACTGCAAATA TGGTGGACGC CTTCGGGCCG ACCGGGGTGG ATGGCGTCTT TCGGAGTGGA TTAAAATCAG CCATGACTAC CCTTCGCGAC AATCGCGATA CACTACTGTC CGTCTTGGAG CCGTTTGTCA AGGATCCCGT GATTGATTGG AAAAGATACC GATCACATCA ACGCAACGAC GCGACACCGA CACAGGAACG TCCCGTAATG GAAATGAAGC GATCAATTAA CGTGATTGAT GAGCGTTTGC AGGGCATCTA CAATTTAGGA AATCCAAACG CGAAAAAAAT CCGAAGGACA GATGGGTTCA TCGATCAGGA AGACGACAAA ATAACCCAAA TGTTACCTTT GTCTGTCGAA GGGCAAGTGC ACAAAATGAT TGCCGAAGCA ACGAGTAGCG AAAACTTGGT TCAACTGTAT GTAGGATGGA TGCCGTGGGT ATAGCTCTTT TGGCAGCAAT CAAAGACGAG AGAAATCAGT TTTGCTTCAC TTAATCTACC TAGCCTAGAA GCACAGCAAA TTGTG
|
Protein sequence | MGETNLALRI LEPADVGDLF DTKSDQLQTE VIRRASQMLQ PSGKLLNNGR FDANGLQEKR LVDFFVECAL QSTRWMMDGG LKAGAEIMYR FRTIHRIAPS FEKAHFLYAK YLDSVLKSRI SALQRRQTGQ LPGLDEDDLR SRTIGADEAC QRNVLRIMEH YTQTLCLNSI HVYNALPRLL SLWFDFTSIQ SSIEDREVQG NLKQNQEEAN VFMGNHFKKI PAQAFYTALP QLVSRIVHVD SDTASVVRGI LKRVLTKFPK QAMWPLAWLR HSKALERRKV GDSIFQDAEK TLVKASNQTH YRVLMASKGL FKFFQELAKY KNSDLSTQSI NVKPWKGEVD LTEFIPPVQA ALSASLDSSE SAFMSDPFPR QVPRMRLFSQ RVSVMSSKAR PKKLKAYVVA ADSRLSSACA SNGTDQNLPD IGEIHFLVKQ EAKGDLRKDA RVQELNNVIN RLMANSRDSK GHTTHNRRHG LRTFAVTCLS EDTGLLEWVP NTSSLRSLVS VAYNPQANAF SSRRRGSRLV AMNDPVLRGN FEKKCQAMYF SDGNLRKAAT LFEELCLRQY PPLLYWWFVQ TYLDPHSWYE ARIRFTLSAA AWSAVGHVIG LGDRHSENIL VDALNGECVH VDFDCIFDKG LLLPRPEVVP FRLTANMVDA FGPTGVDGVF RSGLKSAMTT LRDNRDTLLS VLEPFVKDPV IDWKRYRSHQ RNDATPTQER PVMEMKRSIN VIDERLQGIY NLGNPNAKKI RRTDGFIDQE DDKITQMLPL SVEGQVHKMI AEATSSENLV QLYVGWMPWV
|
| |