Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48106 |
Symbol | |
ID | 7203271 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 220225 |
End bp | 223236 |
Gene Length | 3012 bp |
Protein Length | 419 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182640 |
Protein GI | 219124709 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGAGAGTG GCCAACATCA GTGGCAAAGT CTATACCCAG GCGTTCGATT TCTGAAAGAT ACATTTGTCG ATTCGCCTGA GCGAGAGAAT GGATTATCTG GGCTCTTTCA TCTGCTGCTG CATTCCCTCG AAAAGCAGCT GTGAGGAAGC GCCATCCGAA CCGGAGAAAG CTCAGTCTCT CCTCATGTGT CGCAGTTGGA CCCAAACAGC GAAATACATG AAGGGCAAAG TAGGGCGGAG CGACGCATAA AGTATCTATA CGTGGTCTCT CCGTTTCTCC ACTCATTGCG TTTCCCTTCT GTGTCAATAA ACCCTTCAAA TGTTCCAGTT GCATCGTGTA CCACTCGCGG GAAGTGTCCG AAAGGACTTT CCAGAGAACT GATACCCTCG AAAATATGTG CATTGAAGCT GCGGGTACAA TGCTGTTGGC GTTTCTGGCC GTTGCTGCGA CCCACTTAGG CTGCACTCGC GAGTCTTGGG GAATAGACCT ACCCATGGGG TCTCCTTCCA GAACAGTTAT TTCGCAATTG TTAATTGAAC GTAGGCGTTT GGTTAACGCG TGTGCAACAG ATACACCGGC TATCCCTCCA CCTATGATGC CGACTTTCAC TTTGGGATCT TTTTTGAAAT TTCCAACACA GGTCGTTTGA GATTCCGACT GCTCCCCATG CGAAGTAAAA TCCTTTTTGA GACTGGGGCT GTTCAAGTAA ACACAAGAAG CTGGTAGTTT GGCAGTGCAC ATCGGTTGTC TTATTCTTAG GCCAAACGGA TCGGTAAGTG GCAAGAAGGC CACCTGCGAC AGTACACTCA AATAATGGAA GCTTTTCATG ACGTTGCAAT AGTGCTCAGA GTTCATTTTG CTCTGAATGT CGGTTGCATG CTCGGCTTAG GCACCTCGGT GGCATGTGCT CTATAAGAAA AGCTCCTTCG TCTGCCACTC AAATCCTATC AACCGTGAAG AATACCTTTC TGGGTAGTTC CTTGTTGCGA TCACAGAGTG GATTCAGAAG AACTCGACCG ACGATTTTTA CCTGTTGTTG ACGCATCAAG AGCTCAGTCC TACGCATCAT GTGACTGAGT AGGGCATGGT AGACGATACA AACAAATCAA TCGTGACCAT CAAATGATTA TTACATGTGT TCGCATTTTA CATTTGAGTT CGCATCCTGG AAAGAAAGAG AAGTCCATCG CGTGAACCTT GACTGTGAGT TCCCAATTGA TTCTGTTGCC GCCATCGACG TCATCAATAT TTTGCTTCCA TAATCTTTGG CAATTGACTC AAAAGCTGCA GAACTCTGTC AGTCAACATT TCCTCGAGGA TCGTCATAGG CGACACCTAT TGTACAAGCG AGATCTCACC AGAGGCTTCT CAACTACAGA ATAACTGTAA AACTGCGACT TCGAAATAAA AAGAATACAA AACCGGCCGT CTTCTTACAC CAATGGTCTC TCCTTTGATT GAAAGACTTA CAACCTGCGT TGTGGGTGGA GGGAACTCGG CCCACGTTCT GGTCCCATTC CTATCGGAAG CTAGACACTC AGTAAACTTA CTGACCCGCC GCCCTCAGGA CTGGAATCAC GATTCCATAA CTTGTCAGCT AACAGACGGA ACTACGGGTC AAGTAGCTGC GACTCATGTG GGCATGCTGG CCGCTTGTTC CGCAAATCCG GCCGATGTCG TTCCCAACGC CGACATTGTC ATTCTCTGTA TGCCGGTGCA TTCCTACCGA GAGGCTTTGG ACCGCATCGC CCCTTATTTG AGCCGCAGCA AATCTCACGT TTATGTGGGA ACGGTATGTT CATCAAGGGT AACTAGGCGA TTTCCTGATC ACTTATATTC ACACCTTCCT CGTCCTTCCA GATGTACGGC CAAGCTGGAT TCAATTGGAT GGTCCATGCC ATGGAACGAG AGTTCGGCTT GACTAATATT GTCGCATTTG CCTGTGGAAG TATCCCCTGG GTTTGTCGGT ACGTCAATTT GTTTATGCGG CCAGCGATGG CTCTTACATC CAAAGATCTC CTCGCCTCTC AACATAATCT CTTCGTTGAC AGTACCGTGA AGTATGGAGA GCTGGTGGCC AACTATGGAG GGAAACACGT GAATGTGGCA GCCGTTACGC CCCACAGCCA ATTCGACAAG TTGAACCGTG TCCTTTTACA GAACCTGAGC GTAAGGCCGC TCGGTATGGG TGCATTCCGG CAAGCCGAAA GTTTTCTCTC CCTCACACTG TCCGTCGACA ATCAAATCAT TCATCCCGCC CGGTGCTACG GCTTATGGAA AAAGTATGGA GGATACTGGA AAGACGAGGC CCACGTACCG TACTTTTATC GTGATTTCGA CCACACATCC GAGACCATTC TGCAAGCGCT AGATCGCGAC TATGCTGCCG TTAGGAGTGC TGTTCGGAGG AGCTTCCCGT CGAAACAATT TCCGTACATG TTAGACTATT TCTCGTTGGA AAAGCTAAAT CACAATTCGT CCCATGCCGG AATCTTGGCA ACCTTTCGGG ACAGTCCACA ATTGGTGTGC ATCAAGACAC CAACCGTTCC CAGTGGCTCT GGCTCACAGA ATCAGTCAGC AGCGAGAGTT TTGGATACGA ATTGTCGCTT CTTTACTGAC GATATACCAT ACGGCCTGCT CGTGGCGAAA CGTTTAGCGG AGCTGTTGGA GCAACCGGTA CCTATGATTG ATGAAGTATT GTTGTGGGCC CAAACCTTGC GCGGAGAACA TTTTGTGCAC GAGAAGGATG GGAGCGTGAA CTTGGAATTT TGCTTGCAGA GACAAGGCAA GCTCGCCGTT TGTGGAATCC CCGAAACCTA CGGCATTACA AAAATTGAAG ACATGTTGGA TTGATGATAT ATCTGCTTTG AGAACGAAAG CGTCTGCTGG AGCCCTCTTT GTGTTCTCTA CCAGTAACTA TTCTCGAGTA TTGCACGTTT CACCGGGTAG TGAGTTTGTT CTTTGAGGCT TGGTCTACGG AAAGTACAAA GTCCAACATG ATATAGGACC TA
|
Protein sequence | MVSPLIERLT TCVVGGGNSA HVLVPFLSEA RHSVNLLTRR PQDWNHDSIT CQLTDGTTGQ VAATHVGMLA ACSANPADVV PNADIVILCM PVHSYREALD RIAPYLSRSK SHVYVGTMYG QAGFNWMVHA MEREFGLTNI VAFACGSIPW VCRTVKYGEL VANYGGKHVN VAAVTPHSQF DKLNRVLLQN LSVRPLGMGA FRQAESFLSL TLSVDNQIIH PARCYGLWKK YGGYWKDEAH VPYFYRDFDH TSETILQALD RDYAAVRSAV RRSFPSKQFP YMLDYFSLEK LNHNSSHAGI LATFRDSPQL VCIKTPTVPS GSGSQNQSAA RVLDTNCRFF TDDIPYGLLV AKRLAELLEQ PVPMIDEVLL WAQTLRGEHF VHEKDGSVNL EFCLQRQGKL AVCGIPETYG ITKIEDMLD
|
| |