Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43693 |
Symbol | |
ID | 7196996 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1203493 |
End bp | 1206326 |
Gene Length | 2834 bp |
Protein Length | 871 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177780 |
Protein GI | 219112057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.653175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACTT CCAGTCCAGA TCTTCTTGGC GCGTCCTCTC CTGCTAGCCT AAACGCTGGA ACTTTGGACG ACCGGCTAAC AGCTAGTCTG AACTCCCAGC AATTTACCGT CGCGGCCTAC TTGAATCTAG CACTGGCGTC ACAACGCGAG AACGCAAAAG ACGATCCTCC CGACGTCATC GCGCAACAGC GCATGGCCGA ACTTGCTTTG CAGCTTCAAC TGCAGACACA GTCGTGCCAC GAAGAAATCG GACGCATAGG GGCGGAGCTG CAGGCTATTT TGCCCCGTTG CGCGGCTGAT ATCGGACGGG TAGGTGTGGG GCTTGAAGGA TTGCGTCAGG ACGCGACATC CCTTTTGGAA ACCACTTCCG TGGATATGGA ACAAGACGTC TCGTCGTCTT TGGAAACACT CAGTACCTTA CATGCTCTGC AAGCCAATTT GACGCGCACG AAAGAAATTT TGACGGCCGC TGCAACTTGG GATTCCACCC TTTCGACTAT CGCACCGCTG CTCGCGCAAC AAAACTTGCC CGACGCCGTC AACGCGTTGG CGCAGCTCGA AAATGGTGCA CAAGCGCTTC AAGGCATGCC TGGTCTGGAA GATCGCGACA TCGCAGTTGC TAATGTTCGT CAACAAGTGT CGATTCTGTT ACAACCTCAG CTCCAGAATG CTCTCATCCA TATGCAAACG CGCTTAGGGC CTCTACAACA GTGCGTGCTT TTGTACTCCA AACTTGACAA AATCGACGCA CTCAAGGAAG ATTACGTGAA AACGCGGCCT ACGAGTCTCC ACAAGTCGTG GTTCGATTAT AGCCCTTCGT ACGGTGACGA TGTCGCTGAT CAGAACGCAA CGGCGTTTTT AGCTTGGCTG CCAACTTGGT TTGATGCAGT ATTGACGTTG ATTGGTGAAG AGCGACGACA AGCTCTGACA GTGTTTGGGC CAGAGAGTGT GTCGGAAATA GTAATGAAGG TATGTACAGG GGAAGCTGTT TACCGATGAG GCACGCTTGT ATTGAAACAT TTTTTGATTC TGGTTAGGTA TTTCGAGAAT GCTTCCGTCC AATTCTGCCT TCCTTTAAAA GTCGTTTGGA ATCAATCTAT TCTTCGGAGG AAACCGGTCC TTCAAAAGGC TCTTTACAGT CGGTATGCTC CATTTATGAG TCTACCCTAC AGTTCCTTTC TTTGGCGTAC GAAACGATTG CCGGCGGTTG GCTCGATTTG GTAGAGGGCG GAACGATAAA GGGCAATGGA TTATCAATTT ACAAGGAAAT GGGGTTCGTC TTTCGCCAGA TTGCGTCCCC CTTCGTTTCT TATCAACAAC GGCTTCCAAA TCTGGAAACA CGATATTCGA CTGCGACCAC CCAGACAATC ATACGAGAGA TGCACCAGGC CGTCTCGGAT GTGTCGAATG GAAAGGCAAC GCTGGAAACA TTGCAAACTG CTACGCAGCT TTTACAAGAG CTCTCAGGTG CATCATTCCC CTTGGCTGAA GGCGCCGTCG CCCGCTTTGA GTTGTTGAAC GGAGGCTACA ACAGCGCCTC TGCGCTGCAA GCTGTAGATA AGATTGTTGC AACTTACTGT GGCGAGCTTG CCATCGCTAT TAGAACCTTG TCCGCAACCA CGACGGCCGA TGAAACTGCG TTGGCTGTGA ATTTCGACGA GTCGCACGTT CTTTGTGCGC TTGAGGTTCT CAAGATTGCC GGCGCCTTTC GGAAGAATTT ACTTGATCTT GAAGTGAAAA CACGAGAACG TTTGACTGTG TTATCAAGCC GCATGTCGTC TTACATTTCC AAGGAGAAGG AATTGGAAGA AGTCCCTGCA ACGACAACTC GAAAATCGTC CGCTGCTGTC GTCCTATTGC CAGATTCCTT TTCAGCTGTT GAAATTGATT CCTTCTTGAC GAAGGCTGTT TGCTTCGACG AGGAAAATAA CGAAACAAAT GCTGCGTTAG TCATTTTGCA GCGTTTGGCG GAATCAGGAC CGACGTCTGT GCCGCTCTAC CCTGAAACCG AGGATGCTAC ACGACGTTTG GCAACTTCAT GCCACACGTT TGTCTTCGAT GTATGCGCAG CTGTTCCTCG TCTTCATTTG AAGGGAATGT CTTCCTTGCA AAGCTGGAAA GAAGCGAATG AACAGGATAT CAACTCGTAT GGCATTCTCC CCCAATCATA CATTACACAT GTCGGCGAGC ACATGTTAGC GCTTGTGCAG GCTTTGGAGC CATTCGCATC TGACTCTGAG GCACTGGGTT TGGCAAACGA AGTCATGGGT GGAGTCCACG GTGTTGCCAT GCAGCCATGG AGAGAGTTTC TCAGTGCGTC AGGGACCATG GGTTCAGAAG ATGTCATCAA GAGCCTGATG AACGGGAAAA ACCTCGACAA CTTCGTTGTA GCATCGGCTG CGCTTGGAGA GGAAGAAGGA ACCGAAGAGG AAGAAAGCGA GGCCAACAAG TTTTGTAACG TTTGGTTGGA TGTTGTATCA ACGGCTGTCA CTGGTCGTCT GTTAGAGCGG ATCATGCGAA TTCCTTCCTT GACTCCGAAA GGATGTGAGC ATTTGAATAC TGACTTGAAC TATCTCGTCA ACGTCTTTTC GGCGCTCGGC GTCCGCGGGC ACCCGCATCC ATTGCTTGGT CATTTGGCCG AACTTGCTTT GGTTGAGGCG GATGTTTTGG CGCATCGTAT TGACGGCCGG AATCGTGGTA GTTCGATTGA AGCTTCCCTA CGATCCGTAG AAGAACGACT TTCCGCAATG AAATCAAGAT ACTAAAAGTG ATGACTCGAA GCAAAACGAT AGTTATGGCC TTTTATACTA AAATAGAAAA ATCTAGATAG CAGC
|
Protein sequence | MATSSPDLLG ASSPASLNAG TLDDRLTASL NSQQFTVAAY LNLALASQRE NAKDDPPDVI AQQRMAELAL QLQLQTQSCH EEIGRIGAEL QAILPRCAAD IGRVGVGLEG LRQDATSLLE TTSVDMEQDV SSSLETLSTL HALQANLTRT KEILTAAATW DSTLSTIAPL LAQQNLPDAV NALAQLENGA QALQGMPGLE DRDIAVANVR QQVSILLQPQ LQNALIHMQT RLGPLQQCVL LYSKLDKIDA LKEDYVKTRP TSLHKSWFDY SPSYGDDVAD QNATAFLAWL PTWFDAVLTL IGEERRQALT VFGPESVSEI VMKVFRECFR PILPSFKSRL ESIYSSEETG PSKGSLQSVC SIYESTLQFL SLAYETIAGG WLDLVEGGTI KGNGLSIYKE MGFVFRQIAS PFVSYQQRLP NLETRYSTAT TQTIIREMHQ AVSDVSNGKA TLETLQTATQ LLQELSGASF PLAEGAVARF ELLNGGYNSA SALQAVDKIV ATYCGELAIA IRTLSATTTA DETALAVNFD ESHVLCALEV LKIAGAFRKN LLDLEVKTRE RLTVLSSRMS SYISKEKELE EVPATTTRKS SAAVVLLPDS FSAVEIDSFL TKAVCFDEEN NETNAALVIL QRLAESGPTS VPLYPETEDA TRRLATSCHT FVFDVCAAVP RLHLKGMSSL QSWKEANEQD INSYGILPQS YITHVGEHML ALVQALEPFA SDSEALGLAN EVMGGVHGVA MQPWREFLSA SGTMGSEDVI KSLMNGKNLD NFVVASAALG EEEGTEEEES EANKFCNVWL DVVSTAVTGR LLERIMRIPS LTPKGCEHLN TDLNYLVNVF SALGVRGHPH PLLGHLAELA LVEADKNDFP Q
|
| |