Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39762 |
Symbol | |
ID | 7195340 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 646549 |
End bp | 649544 |
Gene Length | 2996 bp |
Protein Length | 917 aa |
Translation table | |
GC content | 61% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183645 |
Protein GI | 219126817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000929997 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGA CCGCCGACTT CACCATTTCC GACTTTCCTC ACAAAGTCCT CGATCCCATC GCCACCGACA CCACCGCTCC CTCGTATGCG TCGCTTCTCC TGGCCCAACG CCAGCTCTCC GCCAACGCGT CCGCCATTCC CAGCCTTAAC GGCGGCGGGG CCCATGGTCA CATGGCCCTC ACGCTCACTG CCGAAGCGTA CGCCGAACTC TCCGACATCC CTTTTGTCAT CCCCGTTGCT CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TAACGGAGAA CAACCGACTC CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCGG TCAACAACGC CCTTCGTCGC CAGATCCTCG ACGCTGTTCC TCGTGTCTAC GTTCGCGACC TGGAACACCC CCAGTTTGCT TACAGCCACG TTTCCTGTCG CGACCTCCTC GACCATCTCT GGCGCAACTT TGGTACCATC TCCGCTTCGG ACCTTAAGAA CAACATTCAG TCCATGTACA CCCCGTGGAA CCCAGCTGAC CCCATCGAGA CCATTTTTCA TCGCTTAACC GACGCCATCG CGTACTCGAC GGCGGGACGT GACCCCATCA CCGAGGCTGC CGCCGTTCGC GCCGGCTACG ATGTTCTCGA GCACTCCGGC CTTTTTCCTC GTGCCTGTGA AACCTGGCGC ACTGCCTTGC CGGCCACCCA TACGCTTGCC AATCTGCGCG CCGTCTTTAA GGTCGCCGAC ACTGACCGCA AGCGTACGGT TACCACCGGC TCCCTAGGCT ACGCCAACGT CCTTGCCACA GCTCCATTGG TTCTCCCGTC GCTTGCGCCC GACTCGCTCA GCCTTCCTTT TTCAGCCCTC TTGGTGTCAC ACTCCTCTGC TGCCCTCTCT GAGCGAACTT ATTGCTGGAC CCATGGGTCC AGCAATAACC GTCGGCACAC TAGTGCCACG TGCAAAAACA AGGCCCCTGG CCACCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC TCCACCAAGG TTTGGACTGC CCCCAAGCCT CCTGAATAGG AAAGAGGGAC GGCTACGCCA ATGATTAAAA CTAGTAATAC CGATTCTCTC AATCATATTA CTAGTCTTAA CTCGTCTGTA GTCCCCTCCC CGCCTAGTAC CCACACCTCT GCCATTGCCG ACACCGGCTG CACAGGCCAC TACATTACGA TCAACTGCCC TCACACGCAC CGGCACCCAG CCAACCCCAG CCTCTCCGTC CGTGTCCCGA ATGGCTCTGT CCTCCGCTCC AGCCATGTTG CCACCCTGGA CCTCCCTGGT TTCTCCCCTG CCGCCTGCCA AGCCCACATT TTTCCTGGGC TCGCTTCCCA TCCGCTCCTC TCCATCGGTC AACTGTGCGA TGACGGCTGT ACGGCAACCT TCTCGGCCAC TCGCCTTGAC ATTCATCGCG ACGCCACCCT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGCCTCTGG CACCTTGATC TTACCCCTCC CAAGCCCCCT GCTACAGCCC ATGCTCTTGT TCCAACCACC CCCCTCGCCG ACCGCATTGC TTTTGTTCAC GCCTCGCTCT TCTCCCCGGC TCTCTCTACC TGGTGCCAGG CCCTCGACTC CGGCCATCTC GCGACTTTTC CAGACCTTTC CTCCCGCCAG GTCCGCAAGT ACCCACCCAG CTCCCCCGCG ATGATCAAAG GTCACCTCAA CCAACAACGC GCAAACCTGC GCTCCACCAA GCTTTCCCCT GTCTGTTCCC CTCTCTCGAC GGAACCCCCT GCCGTCGCTG TGCCCGACCT CGATCCTCCT GACGCCCACC CTGTTGCACG CACACACCAC GTCTTCGTTG CCCACCAACG GGTCACCGGG CAGATCTACA CCAACCAACC GGGCCGTTTC TTCACTCCCT CCAGTGCCGG ACACAACGAC ATGCTTGTCC TTTACGATTA CGATAGCAAC GCCATCCATG TTGAACTCAT GCGGAACAAG TCAGGACCCG AGATTCTTGC CGCCTACCAA CGTGCTCACA CCCTTTTTAC CCAGCGCGGC CTGCGTCCCC AACTTCAGCG CCTCGACAAC GAAGCCTCTA TAGCCCTCCA AGCCTTCATG ACCTTAGAGC AGGTCGACTT TCAGCTCGCA CCCCCCCCCC CCCCCATCTG CACCGTCGTA ATGCCGCCGA ACGGGCCATA CGCACCTTCA AGAACCACTT CATTGCTGGC CTCTGTACCA CAAACCCGGA TTTTCCCCTT CATCTTTGGG ACCGACTCCT CCCACAGGCC CTCATTACCC TCAATCTTCT TCGTCGCTCC CGCATCAATC CCAAGTTGTC CGCCCACGCA CAACCTTACG GTGCCTTTGA CTACAACCGC ACCCCGCTTG CTCCTCCCGG CACCCGCGTC TTAGTCCATG TCAAGCCCGC TGTTCGCGAA ACCTGGGCCC CCCATGCTGT CGAAGGTTGG TATCTCGGCC CCGCTCTCAA CCATTATCGC TGCCATCGCG TATGGATCAC GGAAACACGT GCCAAACGTG TTGCTGACAC CCTTTCCTGG TTCCCGACCC GCATTCCCAT GCCCGCCCTT TGTCCACCGA CCGCGCCCTG GCCGCCGCCC GTGACCTGGT CCATGCCCTC CAGAATCCTT CCCCGGCGTC TCCGTTCGCC CCCCTCGATG CCACCCAGCA CCAGGCACTC ACAGATCTTG CCACCCTCTT TGCCACTGTG GCCGCCCCAG CCGACGACAT CCCTGCACCC GCTCCCGTGC CTCCGGTCCG TCCCCCTGCC CCAGCAACTC CCCTTGCTCA GGTCCGTTTT GCCGTTCCTC TTGTCACGGC CGAACATGCC CCGGCACTTC CGAGGGTGCC CATTCCGGCC CCAGCACTTC CGAGGGTGCC CACCCTGGCC ACCTATCACT CTCGCACCGG CAACCCAGGC CGTCGCCGCC GCAAAGCACG CACACAACCG GCAACCCCAA CCCTAG
|
Protein sequence | MSPTADFTIS DFPHKVLDPI ATDTTAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL TLTAEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI SASDLKNNIQ SMYTPWNPAD PIETIFHRLT DAIAYSTAGR DPITEAAAVR AGYDVLEHSG LFPRACETWR TALPATHTLA NLRAVFKVAD TDRKRTVTTG SLGYANVLAT APLVLPSLAP DSLSLPFSAL LVSHSSAALS ERTYCWTHGS SNNRRHTIPS PPSTHTSAIA DTGCTGHYIT INCPHTHRHP ANPSLSVRVP NGSVLRSSHV ATLDLPGFSP AACQAHIFPG LASHPLLSIG QLCDDGCTAT FSATRLDIHR DATLLLSGAR SPHTGLWHLD LTPPKPPATA HALVPTTPLA DRIAFVHASL FSPALSTWCQ ALDSGHLATF PDLSSRQVRK YPPSSPAMIK GHLNQQRANL RSTKLSPVCS PLSTEPPAVA VPDLDPPDAH PVARTHHVFV AHQRVTGQIY TNQPGRFFTP SSAGHNDMLV LYDYDSNAIH VELMRNKSGP EILAAYQRAH TLFTQRGLRP QLQRLDNEAS IALQAFMTLE QVDFQLAPPP PPICTVVMPP NGPYAPSRTT SLLASALITL NLLRRSRINP KLSAHAQPYG AFDYNRTPLA PPGTRVLVHV KPAVRETWAP HAVEGWYLGP ALNHYRCHRV WITETRAKRV ADTLSWFPTR IPMPALCPPT APWPPPVTWS MPSRILPRRL RSPPSMPPST RHSQILPPSL PLWPPQPTTS LHPLPCLRSV PLPQQLPLLR SVLPFLLSRP NMPRHFRGCP FRPQHFRGCP PWPPITLAPA TQAVAAAKHA HNRQPQP
|
| |