Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45376 |
Symbol | |
ID | 7200001 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 996423 |
End bp | 998238 |
Gene Length | 1816 bp |
Protein Length | 490 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179333 |
Protein GI | 219117077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.88052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCTTGTCA CATCTCCCGT GCCCCGTCTG GAACGAGACG AATCCAAACA ACGAACTCCA ATATTTCCAT ACGTAAATCT ATACAAAAAA GAAGACGAGA CAGTCCCATT CGATTCCACA CTCTCACGAC TTCTCTACGC CTTGTTCGAC TGCCGGAATG GGTCAACAAC CCGGAGGAAT TCTGCTCTTG CTCGTGTGGC TTTCCGTGTC CGTTCGTGGT AGCTTGGCCT TTCGACCGCC GACTCCCGTG CGTCCGTTTC CCACGCGGTG TAGCGTCGCG TCCAGTGACA ACAACGACAG TGCCGAGCGT TCCACAACCG CAGCCACGAG TACAGCGTCC GTGGTGAGTG ACAAACGTCA CCCAGACGAC AACACAACGA CATCGACGAC GACGAGGTTC CTTCGGATGC CATGGAGCTG GGATTGGATG CGAAACAACT CGCATTGCTC GAAGAAGAGC AACAAAAGTT CCTAGCCACG TGCAAAGTAC TCTGCGAACA ACGAAACTTT CCCCTCGAAA AGGTCAAGAA TGCCCGTGAT CTTTCCTCCG TACAGAATTC TCCCGTACAG CGCAATCGGA TCCTCCGTAT GGGTCGCGTC TCCGACGCGT CTCAAGACGA TATTCAGCTA TTGTTCCATC AACTCAATAT TACCACGCTC ATTGATACCC GTTCACCGAC CGAACTCAAA GACGATACTA CGCTGCTCCG GGAACAAGTC TTTGGAAACT TCACCAACAT GATGTGGCAG GAGCAAGGTC GTGGGAGAGA TGGATGTGTC AAGGAACTCG AACCGGGTCA ACAACCGGTG CGACCGCGAT CTCTCAAGCG CTTTTGGGAA AAGGACCAAG GCGTTGCTGC GGCTACTACT GCCATCAGCG AGACCGAAAT CTCCCCGTCG GTCTCCACCG AAATAATCGC TCAAGAGCAT CTGGAAGCCG AGCTCGAAAT CAGTGCCGAA GAGGACGTCT GTGGACTCGA TTGCGACGAA GATGACCCCG AAACGATGCA AGAATCGACC GTCCTGCGGA CCTACCAAGG CAACCGCAAA GAGCGTCATT TCGTTTCGCT CATGAACGAA TTCAAGTACG TCAAGGGAAC GGTCGGTAAA TTGCGCAAGC GTGACCTGGC CAAGGCCGCC CTGCGATCAC CCAGTGCCAT TTTTTCGAAA AAAGCCCGGA CAGCCGTCAA GAAGCCATTT CTGGACGAAA TCAACGACGG AGGTTTGCCC ATGCTGAACG AATTGCTGTT GCGCTTTGGC GCTCCCGGCA TCAAGTACGT GCTGGAACTG TGCGCGGATC GCACCCGGCA TCCAGTGGCA TTTTACTGCA CGGCCGGCAA GGATCGGACC GGTATGCTGA CGGCTATCAT TCTCGCCTTG TGTGGAACCA AAGCGGAGGA TATTGTGGAA GACTATTCCT TGTCGGCAAA TGTGTACGCG GAAATGAACG ATCATCAGGC CATGGTAGGA GCATTGTCAC AACGCAGCTT GGATCCCAAA ACTTTCTTGG GAGCACCACC GCAAGTTATG CGGGATACGC TCTTGGCGAT CGAAGAGAAT TACGGATCGG TGGAAGGATA CTGCACCTGG ATTGGATTTG GACCGGAAAA ACAACAAAAA TTAATTCAGG CGTGTACCCA ACCCGATGAC GAAGAATAGT AGATTGGCAG GGACCGCTCC GCGAATGATT TTTACATACA TCTCGGTCTC CGCGCTCATC CCGACATTCT TTGGTCGAGA GCTGGCAATC GACTCGCGTC AAGTACTGGA ACCCCTCTCG GCCAACCGAA ATTAAACACC ATACACACAA TACAAC
|
Protein sequence | MGQQPGGILL LLVWLSVSVR GSLAFRPPTP VRPFPTRCSV ASSDNNDSAE RSTTAATSTA RQHNDIDDDE VPSDAMELGL DAKQLALLEE EQQKFLATCK VLCEQRNFPL EKVKNARDLS SVQNSPVQRN RILRMGRVSD ASQDDIQLLF HQLNITTLID TRSPTELKDD TTLLREQVFG NFTNMMWQEQ GRGRDGCVKE LEPGQQPVRP RSLKRFWEKD QGVAAATTAI SETEISPSVS TEIIAQEHLE AELEISAEED VCGLDCDEDD PETMQESTVL RTYQGNRKER HFVSLMNEFK YVKGTVGKLR KRDLAKAALR SPSAIFSKKA RTAVKKPFLD EINDGGLPML NELLLRFGAP GIKYVLELCA DRTRHPVAFY CTAGKDRTGM LTAIILALCG TKAEDIVEDY SLSANVYAEM NDHQAMVGAL SQRSLDPKTF LGAPPQVMRD TLLAIEENYG SVEGYCTWIG FGPEKQQKLI QACTQPDDEE
|
| |