Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39978 |
Symbol | |
ID | 7195579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 503898 |
End bp | 505964 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184008 |
Protein GI | 219127575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG AAAATAAGGA ATTTCGAGTC ACGCACAACC GTGAACGGTC CGCAACGCCG ACACCGACCA TGAACCCCTC TCATGGGCTG GGTTGGGCCG CGACAGACTC GCGTTGGCAA CGACCCCCGA CTCGTTCAGT CTTTGGAGTT GACACTGACA GTGCGGCCAT GGCGTCAAGA AAGCCTCTTG CACTGGGAAC TGCTTTTGTT GACGGCGGCT CCTTCGCGCA CCGTCGAGGA CTCCGACCCC GCTCGGAATC TCCGCGCCAT CATCACGCGC TACTGGCGGA ATCGTCGCCC TCGATGCACC GTCACGACGT CGACTATCGT ACCCACGACG ATAACGGTAA TCGCAACCGC AATTACGACA GTAACAACAA CAACAAGGTT CCAAGACCGG GAGATGGAGG ACAGACACAA TCTTCCTCCA GCCGCAACCT TAACAATCAA TCGATACGTC GATCCATTGT CGCACCAATC CCCGCCGTCT CTTTCAACGA CAACGCCGGG ATCTTCTACA ACGATTACGA TGGCGGTGAT GACGACAACC TGAGTAGAAG AAGTTACAAC ACAGCGACGA TGAACCTGCA CAAAACAACC CGCTTGCAAT CGTTTCTGCA TCTCCTTAAA GGCTACGTAG GCCCGGGCTG TCTCAGCCTA CCCTGGGCCG TCTCCCAGCT CGGCATTACG TCCGGTGTCA TTGCAACCTT TGTCATGGCT TACTGGAGCT CGTACAACTG CTGGACTGTT GTGCGCTTCA AACGCATCTG TCAGAATTCC AACCACTACG GTCCCTTGCC TTTGACGTAT CCGGACCTTG CTGGTTGGCT CTACGGACCC CGCTTCCAGC GTTTTACCAC AACTTGCATC TGCATTCAGC AACTCGCAAT TTGCACCGTC TTTCTCAGCT TTGTTGGTGC CAACTTGAGT GCCGTATTGG TGGCCGTTTG GTCCGTTCCG CTCACTCACG TGCAAGTCAT TTCGTGCTGC TTGCCCGCGG TCCTCGCTTT GTCCTTTCTG CCCAATCTCA AGGCACTGGC GCCGGCGACG GCGACCGGAG CGGCGTTTCT GGGCTTGGCT TTGCTCTGTT TGAGTACCGT CATTGGCCTC CAATGGAACG ATCGACCCCG GCACGAAGCT CTGTCCGTGG ATTGGACCAG TGTGCCCTTG GCTTTTTGTG CCATCTTGTA CAGTTACGAG GGCATTTGCC TCGTCCTTCC GGTGGAATCC AGTATGCAAC GGCCGGAACA CTTTCAAAGC ACCTTTGTGA CGGCCATGAT AGCTTCGGCT GTCGTCTTTG CCCTCGTGGC CTCATTCTGT GTGGCAGCTT TTGGGCCAGT GACGAACGGT TCCGTCACCG CCTTTTTGCT GGAAAAGTAT GCCGATCGGC GTCACTTGCA GGGATTGTTG CTAGCGGCCA ACGGATTCGT GAGTCTTTCC GTTCTGGTCA CGTATCCGTT GCAGCTATTT CCCGCTCTGG AGTTGGTGGG ACCCTGGTTT CGGCCTTGGG AGAGATGGGT GCAATCATGG GGATCGTCGA CGACCACATC AACCTCGACC AATATACAGA CCAACTTCAC ATCACTTACC AACACCGACG AGTCCGCATC GGACTCGCAC AATCAGTACA GCGCCGATGA CGTCCACGAC GAACGCATGG AACCGCTCGA TATTAGTCCC GTATCCAGTG TCGCCCGTTC GGCACTAGTG GAAGCGGCTC CGGAGGCCAG CCATTCCCCG GTTGCCAGAA TATCGCTAGT AATGCTCACA TACGTGGTCG CCGTGGCAGT TCCCAACGTA CAGATCCTCA TCTCGTTAGC GGGCGCCTTG GCCGGCTCGT CGACAGCCTT GCTCATTCCT CCCGCGCTCG AACTGGCGTA TTTGAAACAG TACGGCACGG AAAGTGATAC CATGTCGATA GGCATGGTTT CCCTGCGAGT ATACATTCTG TTGGCCTTGG GATTGATCTT CATGGGCATT GGGACTGGGG CGTCTTTGTT GGATATCTAC CGGGTCTATA CGCAAAGTGG CGAAGAAACG GGTTCCGACA GCGCGTCCTC CGTGTAA
|
Protein sequence | MKKENKEFRV THNRERSATP TPTMNPSHGL GWAATDSRWQ RPPTRSVFGV DTDSAAMASR KPLALGTAFV DGGSFAHRRG LRPRSESPRH HHALLAESSP SMHRHDVDYR THDDNGNRNR NYDSNNNNKV PRPGDGGQTQ SSSSRNLNNQ SIRRSIVAPI PAVSFNDNAG IFYNDYDGGD DDNLSRRSYN TATMNLHKTT RLQSFLHLLK GYVGPGCLSL PWAVSQLGIT SGVIATFVMA YWSSYNCWTV VRFKRICQNS NHYGPLPLTY PDLAGWLYGP RFQRFTTTCI CIQQLAICTV FLSFVGANLS AVLVAVWSVP LTHVQVISCC LPAVLALSFL PNLKALAPAT ATGAAFLGLA LLCLSTVIGL QWNDRPRHEA LSVDWTSVPL AFCAILYSYE GICLVLPVES SMQRPEHFQS TFVTAMIASA VVFALVASFC VAAFGPVTNG SVTAFLLEKY ADRRHLQGLL LAANGFVSLS VLVTYPLQLF PALELVGPWF RPWERWVQSW GSSTTTSTST NIQTNFTSLT NTDESASDSH NQYSADDVHD ERMEPLDISP VSSVARSALV EAAPEASHSP VARISLVMLT YVVAVAVPNV QILISLAGAL AGSSTALLIP PALELAYLKQ YGTESDTMSI GMVSLRVYIL LALGLIFMGI GTGASLLDIY RVYTQSGEET GSDSASSV
|
| |