Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48545 |
Symbol | |
ID | 7194782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 172326 |
End bp | 174177 |
Gene Length | 1852 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183043 |
Protein GI | 219125556 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0458637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGCCATTC GCCGATGAGG AAAATTGGAG AGACCTGCTC ACTGTCCGTC AGTCACCACC TCTTGTCGCT CTTCCTCTTT TGTACTACGA AGTAAAAAGT GAAACGCAAC CAAAAAGGAA GCCGGATACG CAACAACAAA AAGACCCTAC ACACAACACT GGTGGCTATA CTTACAGTTG ATTCCTATTG CTGTCAATTG AGCTGATACA CTAGTCAACT TGACAATCCA GACAGAAAGA AGCATAGATC GAAGGCGCCA GTGCGCCATT CCAGCTGCGC TCTGGCAACA GGTTACGTGT CTCGCCACTC ATTGGACGAC GTTCCGTCTC GACAGCTCGT CGACTGCGGT TAACAGCCGA TAACCATCTA GCGCGGAATA CTTTTTCGAC AATGTCAATG CAAATTAGTG GGCGGATACT CACTTTCACG GTACTGGTGC TCATTCCTAC TGTACTACAA CTCTACGGCA GCTGGCGCAC CATTCACCAA CTACAACAAA CCCTTTGCCT CGATCCGGAA GAATTTGGTT GTGGTGGTGC AATCCGTGAG ATTGCCACCG CAATCGATAG TCAAGCAGAA TTGGATGTTT CAACGAAAGA ACACACCGAT CTCTACGGTT TTGCTGTGCC GACAAAGGAG GAGTCGACGT CGTCCGCAGG CGCCCACATT ACCTCGCCTC GGCAGTGGCC GACGTCGGAA TCCGTCAACG TGAACACCAA CCGCACAATA TCTCGGCCTA TCTCCGGCAG CCCACCTACT CTTTTCTGGC ATGTAGGACC CCACAAAACG TCCACTACCG CCATTCAGAG TTTTCTGGCA GCCCACAAAA AAGTATTGCT GGAGAAAGAC AACATCGTAT TCCCTTGGAT GCTGCCCGGT CATTTCCGAG GCACCAAAAA CACAGCCAAT CTCGCGTTTT GTTTGTCAGG ACGAAAGGGA CCATGGGAAA TGAATTGTCA GCGAGTGCTA CAGTCCTTTC AACACTTTGT CGCAAGTGCA CTGGCAAATT CGAAGAATAT CGTACTTTCA GCGGAGGAGT TTGCTTTTTG CCACGAACCG CAAATTCGGC AGTTTGTTCA AGACTACTTT GCCGACTGGG AGATTCAAGT AATTGTTTTC CATCGGCGAT TTGACGAGTG GCTAACCAGT TTGCATTTTG AAATGAATCG CGACTTACCA TTTCGTGAAC GTTCAAACCT TGTTGACTTT TTGGAAAACC CAGGCACCTT AGCAAGCTTC GAATTTCACT ACAGCCACAA AGTTCGTGAA CGGTACCAAT CGGTCACAGA TCACGATGTG ACGATTGTTA ACTTTCACAC TGCGAAAGAG GGCCGGAGTC TGGTAGAACA ATTTACTTGT GACGGATTGC AAGGAATGGC ACCGCACACA TGTCGGGCTG CTCAAAGGTT TGTCTCCCGC AAAATTAATG GGGCCCATTC TTTGGACTCC GGCTTTCTGT TGGCTGAAGC GTTGGAGCAG AACATGGTGC CCTTGCTTGA TTTCGCCAAC CGGACACTCT CGAGTGACGC AGAGACGTCA TTGCTGGCAA GGATCGATCG CAAGCTTGAA ACGAGTACAG ATCTCCCGGT TCGCTGTTTG TCAGAGACGA CGCAAGACCA CGTCTGGAAT CGGACCAAAG AGTGGTTCCC GTTTGACCTA GAAAGGTCAA AGTCGTTATC AAAAGCCGAA AATGTGTCTC GGTCGAGAAC ATGCTGCTTG GATGTTTCGC GAGTATGCGC ACTTGACGAC TGGAAGGCGT TTTTCCGAGG ACTGCCTGTG AGTAGGAAAA GTGAGCAAAT CCAGTAGTCT ATGTTACAAA TTATCCCTAA TAATCAGGAG ACTTTTTCTT GC
|
Protein sequence | MSMQISGRIL TFTVLVLIPT VLQLYGSWRT IHQLQQTLCL DPEEFGCGGA IREIATAIDS QAELDVSTKE HTDLYGFAVP TKEESTSSAG AHITSPRQWP TSESVNVNTN RTISRPISGS PPTLFWHVGP HKTSTTAIQS FLAAHKKVLL EKDNIVFPWM LPGHFRGTKN TANLAFCLSG RKGPWEMNCQ RVLQSFQHFV ASALANSKNI VLSAEEFAFC HEPQIRQFVQ DYFADWEIQV IVFHRRFDEW LTSLHFEMNR DLPFRERSNL VDFLENPGTL ASFEFHYSHK VRERYQSVTD HDVTIVNFHT AKEGRSLVEQ FTCDGLQGMA PHTCRAAQRF VSRKINGAHS LDSGFLLAEA LEQNMVPLLD FANRTLSSDA ETSLLARIDR KLETSTDLPV RCLSETTQDH VWNRTKEWFP FDLERSKSLS KAENVSRSRT CCLDVSRVCA LDDWKAFFRG LPVSRKSEQI Q
|
| |