Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56499 |
Symbol | |
ID | 7203300 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 360164 |
End bp | 362800 |
Gene Length | 2637 bp |
Protein Length | 625 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182520 |
Protein GI | 219124458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGACCC GAATCGTGTC GGCGACTGCA ATCCTTCCCC TCTTCCATAA CCTTGGTAGG AGAAGATTTG CAGAGCGCAA ACGGGTTGGT TTGATCTTGG CCCTTTTCTC GACGCCTCTA AGTAACTGTA ACGGTCGTCG ACGGGCCGTA TCCGCGTTCC TCTCTCCGAC CAACACATTA TCGTCGACCA AAACGCCATC TAAATCCTTT CTTTACTATT CTACAAAGAG TCGGACTTCC AGATCGGTCA CTGAATTGCT ACGATTACGT GCGTCGTCAC TGTGTTTGCG ATCCCCAAAG TCCACGAGGA CTCTTAGTGT CATGTCTTTG GGTGCCGTAT CGACCGAGTC GTCGGAACGA CTCGTCACTA CCGTAACGGA TCACGCCCGC ACCATGGCCG ACGCGGCGAT TCATTCCGTC GACCCCGTTA CGGCTGTGCG CGATCACGTC CGAAAATTGG TGGATCTATC ATCTACCGCT GCTGCCAACC ACACATCCAA ACCCGGTACG AAAGCTACAC TCTTGCACAT TGGGATCGAC CCACACAACA TGGTAAACCT CTCTCTGTCG GACTACGATC ATATCCTCGT TGTAGCCTTT GGAAAGGCGT CCTCTGCCAT GGCTACTGCC TTGTTGGAAC GTCTTACCGA GGGCCAACCA GCGACAAACC AATTGCCCTC GATCTCCGGC CTCGTGATTG TCAAGGACGG GCACGCCACA CCACAACAAC TCGAGATACT GCAACAATCA CGGTACAACA TTTCAGTCCG AGAAGCCTCC CATCCCGTTC CGGACCAACG GGGTGTGGAC GCTTCTCGCA AATTGCTCGA CCTGGTACAC ACATATGCTT CACCACGAAC CCTGGTATTC GCACTCTTGA GTGGCGGTGG GTCGGCCTTA TTTTGTGCCC CGCACGAATC GCTCACTCTG CTGGATTTGC AGCAAACAAA CCAGGCTTTG TTGCAGTCCG GTTGGTCCAT TACCGACATG AACGTGGTAC GCAAACGTCT CGAAACGGGC AAGGGTGGAC GCCTCGCGGC GGCGGCCCAT CCCGGAACCG TGGTAAGTCT AATATTGTCC GACGTATTGG GCGATCCACT CGACCTCATC GCCAGCGGTC CCACCGTACC GGACACCAGT ACCTGGTCCG ATGCTTGGGC CTTGGCTGAA ACGCTCCCCG AAAAGGCCTT GCCCGATGCT GTGCGACGAT TGATGCGTGC GGGGGTCGAC GGGCACTTGC CGGATTCACC CTCCCCGTCA CACGGCGTTT TTGCCCGAGC CGTGACGTGT CTCGTGGGCA ATAACGCCAA GGCCGTAACG GCAGCTGCCA CTACCGCGCA ACGCCTTGGA TATCACCCCG TAATTCTGGG GACGCGAACG GAAGGTGAGG CCCGGCAGGT TGCACGATGG CTAGTACAGC TCGCCCAACA CTTGGCTCTA CCCGAAACGC CATCCAAACA ATTTTCCTTG GCCTCATTGC CAGCGGCGTT GATTTGCGGC GGCGAAACGA CCGTCACTTT GCCCGAACAG AGTCAAAAGC ATGGGAAGGG AGGTCGCAAT CAAGAATTGG CTCTAGCCGC CGCTTTGGAA CTACAACGCG TGGGTTTGAA CAGTAAAAAC GATGTTGTAG TCGTGGTTGC CAGTGTGGGA ACGGACGGAA CGGATGGTCC CACGGATGCA GCCGGTGCCA TTGTGGATGG CCACACAGTG GACAGATTGC CTGGCGACGC ATTGCTCGCT TTAGAAACGC ACAATGCGTA TCCGTATTTG GCGCAAACGG ATGCAAATGG CCGGTCCCCT TTACTTAAGG TACGTCGCTC GGCTGGACAA ACATACGTTT TACTGGAACA TACTAGGAAT GCATCGACGG TATCTGGTGG AATTTATGGA CTAGTAATTC TAACTTGTTG GATTCAATTC TTTTACCGTG ATAGACCGGC CCGACGGGAA CCAATGTGGC TGACGTTTAT CTTGTTCTGA TTCAGAAAAG TCGCCTGAAG TGACATACGC GATTCTCATT GATGCATCCT TTCTGCGCAT AACATAAACG AAGATCACGA AGCTTCGTTT GTACATTCTG TTTCTTTCTT GGCTTCCATG GATTGTAATT TGCTCTGAAT ACCGTCAATC TCAACCTTGG CGACCAAAGC ATTGATGTTG TCACCCCTAC ACCGCCACAT CGAAAGGCAC ATTTTGCGAT GATACTGATC ATCATATCCC TCCAATACTA CAACGAACGA TCCGACAGCA ATCGGTCGGA TAGCGCTGCT GAATTCGTCC GAAAAGATAG CCAAGGAAGT GGGATCCGTC TGCAAGCAGT TCACAAAATC GTCCTTACTA GCAATGAATT TGCGAGCCGA CATGTGCGGG GCAATAAAGT GTACTCCTTC TTGCGCAACT CGATATTTGC AGTCGCATTC TTTATTGTTC CGGACAAAAG CTTTGAGACC GGAATTAATC ATGGTCACAC GGTCCTGAAT ACCCAGATCA ATCAACGCGC GAACTTGAGC CCCAATGTAG TAAATGACCT TGGCGTCACC TCCTGCTCGA GTCATGAATT GGTCGCGGCA GAACTCGGGA CCCGTCAAGC CGTAATACTC AACAATTGGA TCTAAAACGC CACCCTCTAC GGGCACG
|
Protein sequence | MRTRIVSATA ILPLFHNLGR RRFAERKRVG LILALFSTPL SNCNGRRRAV SAFLSPTNTL SSTKTPSKSF LYYSTKSRTS RSVTELLRLR ASSLCLRSPK STRTLSVMSL GAVSTESSER LVTTVTDHAR TMADAAIHSV DPVTAVRDHV RKLVDLSSTA AANHTSKPGT KATLLHIGID PHNMVNLSLS DYDHILVVAF GKASSAMATA LLERLTEGQP ATNQLPSISG LVIVKDGHAT PQQLEILQQS RYNISVREAS HPVPDQRGVD ASRKLLDLVH TYASPRTLVF ALLSGGGSAL FCAPHESLTL LDLQQTNQAL LQSGWSITDM NVVRKRLETG KGGRLAAAAH PGTVVSLILS DVLGDPLDLI ASGPTVPDTS TWSDAWALAE TLPEKALPDA VRRLMRAGVD GHLPDSPSPS HGVFARAVTC LVGNNAKAVT AAATTAQRLG YHPVILGTRT EGEARQVARW LVQLAQHLAL PETPSKQFSL ASLPAALICG GETTVTLPEQ SQKHGKGGRN QELALAAALE LQRVGLNSKN DVVVVVASVG TDGTDGPTDA AGAIVDGHTV DRLPGDALLA LETHNAYPYL AQTDANGRSP LLKTGPTGTN VADVYLVLIQ KSRLK
|
| |