Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50022 |
Symbol | |
ID | 7198721 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 170792 |
End bp | 172357 |
Gene Length | 1566 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184907 |
Protein GI | 219129461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.144811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCGT TCGGGGAATT CAAACGTCTT TTTGCGATTT GCGGTAAGTA ATTTTCGTTG CTCCGTAGGA AAATGGCGAG ACGAATTCGC CTTTCGCGTT GGCATTGGCG AGGGCGCGAG TCCGGCACAA CGTCCGTCCT TCTATCGTGT CGAAACGCTG GAAGCGGATC ACAGGAACGA CTCGTTTTTC GTCTATGTAA GTTTCCGTGC TCGATACACT ATCACCGGTA CACGACGCCA ACTGCAGTGA CGCCAATCGG ATGCGTTCTA CGTAGTACGC ACTGCGACAA GAAATGCTCC AATCCAAACC TCCCTACCTC CTCCACCCGA ATTTCCATTC GAGAATTCTG GTGCTTCTTT CCGACGCTCG GAGGCGGACC AAACGGCCGT AAGACTCACC ACAACAAAAC AACGCACAAT CGAATCTCTT GGTCCTTCCA CGAGGTTTCT TTGTGCACAA CAGTGAGTTC ATCAACGTAC CAGTAAATCG CAGTCCTTAT CCCCAAAGCC CGTGCCACTG CTGCCGTGCT TTTATCTCTA CGACTGTGGA GGATCTAACT AGAGGCAGTA GCTGACTGTG AGAACGCCAG CAATCCCCCC TTGCAGTAGT CCGCCATTGA CCTACGATTC GCTCGGTACT GCGTACATAC ATTCATCCAT CCAGACGTAC TTTTATACCA AAGCATTCTC TCGCACAACA CCGTTCCAAT GCCCGCCTTT CGACCCTTGG CCTCCACCCG CATGTTGCTT ACGCACGTGG GTGTTGGAAT GGGAGCAGCG TCCTTTTGGA GAGGCGCATG GTACGTGTTG GACGATCACC TTTTCCCAGA AAACGCCACA CACTCGGCGG CAGCCTCACT CGTGCTCGGC GTTGTGGGCA TGGGAGCTTC GCAGGGACTC GTAGCCCGTG CCGAAGCCTT GTCACAAAAG ACACCGAAAC GGAAACTGCC CGTGGCGGCG GCGCGTTTCG GGGCACTCTA TACCGTGGCC GTCTCGTGTG TGTTGGTCTG GCGGGGAACC TGGGTGGGTT GGGATTGCCT TTACGAACGC TTGCATCCCC ATCCCGATAC CAAGTCGACC GATCCCGGAC ACGCGACTCA CTCCGGAATG CTGTCGCACG TGGTGAGTGT CACGCTACTC CTCGCTACAG GTTTGTTTGC CTCTGTCTTG GCTCCGCCCG CAGCCGTAAG TGTCATTCGC GACTGGTCGA TCCACTCGGG GAGTCGAGCC TACTCCGGAC CGGCACAATC GGTTTTCAAC AAGCTTTTCC CATCGTCGTC GTCCTCATCA TCGGTTCAAA CGGCTGGAGG AAACGGCTTT AGTCCAAGCC GAGCATTCCT GTCGACAACC TCGAACCGAT TGTCCGCACG GGGTCAGCAT CCCCATCCGT CGAGTCTCCT GCGGACAGAG GGTTCACGAA CAAGCAAAGT GCACCGAACA ACTTACACAT CGAGTACGCG GTGAATGTAT GCGTTACCGC CGTCGCGGCC TAACTTTCTT TGTATCCCGA TCCACACGCC AACGCTACCA GCTAGTCGAA TGTAAACTAG AAAATACAAT TTTGTG
|
Protein sequence | MEAFGEFKRL FAICGKWRDE FAFRVGIGEG ASPAQRPSFY RVETLEADHR NDSFFVYKCS NPNLPTSSTR ISIREFWCFF PTLGGGPNGR KTHHNKTTHN RISWSFHEVS LCTTTYFYTK AFSRTTPFQC PPFDPWPPPA CCLRTWVLEW EQRPFGEAHA SLVLGVVGMG ASQGLVARAE ALSQKTPKRK LPVAAARFGA LYTVAVSCVL VWRGTWVGWD CLYERLHPHP DTKSTDPGHA THSGMLSHVV SVTLLLATGL FASVLAPPAA VSVIRDWSIH SGSRAYSGPA QSVFNKLFPS SSSSSSVQTA GGNGFSPSRA FLSTTSNRLS ARGQHPHPSS LLRTEGSRTS KVHRTTYTSS TR
|
| |