Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48206 |
Symbol | |
ID | 7203522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 509981 |
End bp | 512365 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182548 |
Protein GI | 219124517 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.32747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATTC CCAATCTACT GCAGGGTCTC AAGTTTGCCG TCAAAAAGGG CAACATCCGA GACTATTCAG ATCAAGCCGT GGCGGTCGAC GCGTCTTCCT GGTTCCACAA GTCGGTCTAC GCCATTGCCG ATCACTACGT AGAAGTTCTC GAGCGTACGG GACGTGCCGA TGCCCGTTCC ATAGCCGCCG CTACACAATA CGTGAACAAA CGTTGTCACG AAATTCTCAC CTACGCCCGT ATCCGCAAAA TCTATCTCGT CATGGACGGG GCCCGCTGCC CGCTGAAGGT CGTTACCAAC GATGATCGCG AGCGACGACG GCAAGAGAAT TTGGCCGAAG CACGGGTCTT TCGTCAACAA AAGCGACCGG ACAAAATGTA CGAAAAATAC AAGGCCTGTA TCAAGGTCAA GGCGGACTTG GCTGCGGCCG TGGCACAAAA TATCGCTAGC GCGTTTCCTG GGAAAGTGGA GCTCGTTTGG GCACCTTACG AGGCGGATGC GCAACTAGTC AAGCTGGCAA TGAACGGCAC CGTACAGGCG ATCATTACTG AGGACTCCGA TGTCTTAGTG TACGCCGCAA CTTGTGAAAC GACGGTTTCG GTACTCTTTA AACTGGATCG GAACACGGGG AGCTGCGATA TAATTTCCAT GGCGTGGTTG CTAGATCCTA CCGAAACTTT GGTAAACCCC TCCAAAGCTA ACCCGAAAAA AGCGTCGGGG ATCGAACAGA TTGTGGATGC CTTTGTTAGC CGTCAGTTGC GCGATCCGGG ACGGGGAGTT CGTCTCTTTG TACAAGCGTG TATTCTGACA GGCTGCGACT ATTCGCCAAA TCAGCTCTCC GGTGTGGGAT TTGTGAACGC CTTTAAGCAC GTTCAGAGCG CCATGCACAA AGACTCGAAA GATCGGTTTC GACACGTACT AAAGATGCTT CCACGCAAAG CGAAGGACCA TCTGGATCCA GTCGTGTACG AAGAGCTTCT GGCGCAGAGT GAATCCGTCT TTTACTACCA CCCGGTCCGA GAGCCCGACG GCCGCGTGGT CTTTCTTCGA GAGCCGGACA CAGCCAATGA GCATTGGCCC TCTCTGGATC GATTCAACGG TAATCTTTCG TTTCTTGGAG AAATTCGAAA TGCATCCGAC GGGACGATGC AAGTGCTGCA TCCAAAGGAA CATGAAGTCG CATTTCCGCA GCCATCAGTC AATGCTCCCG CCGATCGAGT CTGTCGACCG GCGTCTTCTT TTTTCACGAA GAATCCAAAC ACTCGAGGTG GCAAACCTGT TAAGGTCTCC AACCCGTACC AGCAGGCCGA AAAGCGGCCG CGAACAGAGA ACCGAGCACC ACTGCAACCG AAAAGCCCAA ATGAGAAGAC TACCAAAAGG AGCCGCAAAC CATTTTCAAA ACTCTGTGCC AGCAAAGAGA ATACCGATCG TCTACAACAG CATTTTGGTA GCTCCAAAAA CGATGTCCGC TTTGTTTTGC CATCGTTTAC ATCCGAGGGA GCTCGAGTTC CCCCTCGCAC TCTTTTTTCG GCACTACGCA AACCGAAAAA GGTTACATCG GGGGCTGCTC CTTTTCAAAT CAACGATGAA AATAACGATG AGAATGCAAT CTGTAACAAA AATCCTATTC CCCAGAACTC TCCGATTTGC GTCCCACCTG CACTTCCAGA AGACACGCGA CAATTTCTGA AGTTGTCGGC CGAAGACTCT CAACGGCGGA AAGTATCGTA CTCGCCCCAA AACACAGAAG GAAAGGTAGC GGTCTGCAAA ACTTTCGTTC CCTCTCCACA ATTGGATAGT GAGGAAATGT ATCCGATCAA AAACCACGCC GAAAGCAAGT TTTTCCGTGA TGGGAAACGG TATGCTCGGC GAGTTACGCT AGAAGATTCT CCTTCACCCG ATTTTGTCGA GGAGCCAACG GACCATTGGC ATGTCGTCGG CAGCGCACAT GGCAACAGCT TGAATCTGGT TTCTGAAAAG TTAGCGCCTT CCTCCTTTGA CATGTATGAT GATTTCTTAT CCCCCAGCAA AGACACTGAT GGCGAGAATA TCACTGAGGA TTTGCCTCCC GAGACGACAT CCGTAGACCG AAAAAATCCT CTTTGTCGTC CGCTGGAGAC TTGCAAACCC AAGCACACGC TTCGTCCGGC TAAACACCTT CACTTTCGAT TCGGCGGAAA ATACCGGTCC ACTACGGCTC GGAGTAAGCT TGAAAAGGGG GCTTTGATGA AAGGATTTGC CCGACAGCGG CAGCTTGCCA CCGTTGACCT TTCTGTCACA TCGGTCTTGC AACGGGCACC GCTAAGTCCA AAACAGAAGA AGTTATCGCA ATTTGCTTTT CTCAATTCGC ATCGTTATTC AAAAGATGAA CGGAAAAACA TATAA
|
Protein sequence | MGIPNLLQGL KFAVKKGNIR DYSDQAVAVD ASSWFHKSVY AIADHYVEVL ERTGRADARS IAAATQYVNK RCHEILTYAR IRKIYLVMDG ARCPLKVVTN DDRERRRQEN LAEARVFRQQ KRPDKMYEKY KACIKVKADL AAAVAQNIAS AFPGKVELVW APYEADAQLV KLAMNGTVQA IITEDSDVLV YAATCETTVS VLFKLDRNTG SCDIISMAWL LDPTETLVNP SKANPKKASG IEQIVDAFVS RQLRDPGRGV RLFVQACILT GCDYSPNQLS GVGFVNAFKH VQSAMHKDSK DRFRHVLKML PRKAKDHLDP VVYEELLAQS ESVFYYHPVR EPDGRVVFLR EPDTANEHWP SLDRFNGNLS FLGEIRNASD GTMQVLHPKE HEVAFPQPSV NAPADRVCRP ASSFFTKNPN TRGGKPVKVS NPYQQAEKRP RTENRAPLQP KSPNEKTTKR SRKPFSKLCA SKENTDRLQQ HFGSSKNDVR FVLPSFTSEG ARVPPRTLFS ALRKPKKVTS GAAPFQINDE NNDENAICNK NPIPQNSPIC VPPALPEDTR QFLKLSAEDS QRRKVSYSPQ NTEGKVAVCK TFVPSPQLDS EEMYPIKNHA ESKFFRDGKR YARRVTLEDS PSPDFVEEPT DHWHVVGSAH GNSLNLVSEK LAPSSFDMYD DFLSPSKDTD GENITEDLPP ETTSVDRKNP LCRPLETCKP KHTLRPAKHL HFRFGGKYRS TTARSKLEKG ALMKGFARQR QLATVDLSVT SVLQRAPLSP KQKKLSQFAF LNSHRYSKDE RKNI
|
| |