Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48544 |
Symbol | |
ID | 7194781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 169205 |
End bp | 170593 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183042 |
Protein GI | 219125554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.593609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGCA ATTGTGGGGC ATCCACTTTT ACAGTACTGG TGCTCATCCC TGCCGTGCTC TATCTATACG GCAGCTTACG TAGCATTCAC CAACTCCAAC GAACCCTTCG CCTTCTCACG ATCGGTCGTG GTGACATCAA AGAGAACACT TTTGCGACTA ACAAGCAAAG AGACTCGCAC GTTTTAAAGA AAGAGCAATC TGCTCTTGGT GGGGTCGCTT TGCCAACCAA TGAGGAGTCG ACGTCATCCA CGGGTATACA CATTACCACA CCTCGGCAAT GGCAGGCATC GGATTCTTTT GACGAAAACA ACAACACGAC AGCTGTGCCC GTATCCGGCA ACCCACCTAC ACTTTTCTGG CATGTAGGAC CCCACAAAAC GTCCACTACC GCCATTCAGA GGTTTCTGGC GGCCAACAAA TATTTATTGC GGGAGAAAGA TAACATTGAG TCTCCTTGGA TGATGCCGGG ACATTTCCAT GGTGCCGAAA TTGTAGTCAA TGTAGCAAGG TGTCTGTCTG GACGAAAAGG ACCTAAGGAT ATGAGTTGCC AACAAATGTT GCAGTCCTTT CAACACTTTG TTGCAGGCGC ACGGGCAGAT TCGAAGAACA TCGTACTGTC AGCCGAGGGA TTTTCTTTCT TCAACGAACT ACAAATTCAG CAGTTTGTTC AAGACTACTT TGCTGGTTGG GAAATTCGAG TGATTGTATT TTTTCGCCGT TTTGACGATT GGCTAGCCAG TTTGCATTTT CAAAAGAATC GCCACACACC ATTTCGTGAA CGTTCAAACA TTGTTGACTT TTTGGAAGCC CCAAGCATCT TTTCCACTGT CGAATTTCAT TATAGCCACA AAGTTCGTGA ACGGTACCAA TCGGTCACGG ATCACAATGT GACGATTGTT AACTTTCACA CGGCGGATGA GGGCCGGAGT CTGATAGAAC AATTCGTTTG TGACGGGTTG CAAGGAATGG CACCGCACAC ATGTCGGGCT GCTCAAAGGT TTGTCTCCAT CAAGATCAAT AAGTCACATT CTTTGGACTC TGGCTTTCTG TTGGCTGAAG CATTGGAGCA GAACATGCTG CCCTTGCTTG ATTTCGCCAA CCGGACACTC TCGAGTGACG AAGAGACGTC ATTACTGGCA AGGATCGATC GCAAGCTTGA AACGAGTACA GATCTCCCGG TTCGCTGTTT GTCAGAGACG GCGCAAGGCC ACGTCTGGAA TCGGACCAAA GAGTGGTTCC CGTTTGACCT AGAAAGGTCA AAGTCGTTAT CAAAAGCCGA AAATGTGTCT CGGTCGAGAA CATGCTGCTT GGATGTTTCG CGAGTATGCG CACTTGACGA CTGGAAGGCG TTTTTCCGAG GACTGCCTGT GAGTAGGGAA ATAGACTGA
|
Protein sequence | MHRNCGASTF TVLVLIPAVL YLYGSLRSIH QLQRTLRLLT IGRGDIKENT FATNKQRDSH VLKKEQSALG GVALPTNEES TSSTGIHITT PRQWQASDSF DENNNTTAVP VSGNPPTLFW HVGPHKTSTT AIQRFLAANK YLLREKDNIE SPWMMPGHFH GAEIVVNVAR CLSGRKGPKD MSCQQMLQSF QHFVAGARAD SKNIVLSAEG FSFFNELQIQ QFVQDYFAGW EIRVIVFFRR FDDWLASLHF QKNRHTPFRE RSNIVDFLEA PSIFSTVEFH YSHKVRERYQ SVTDHNVTIV NFHTADEGRS LIEQFVCDGL QGMAPHTCRA AQRFVSIKIN KSHSLDSGFL LAEALEQNML PLLDFANRTL SSDEETSLLA RIDRKLETST DLPVRCLSET AQGHVWNRTK EWFPFDLERS KSLSKAENVS RSRTCCLDVS RVCALDDWKA FFRGLPVSRE ID
|
| |