Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48157 |
Symbol | |
ID | 7203497 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 377108 |
End bp | 378965 |
Gene Length | 1858 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182672 |
Protein GI | 219124776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAACAAAC CGCACGAACA ACGCAACATA CTCACCTACA CGAATTCAGA ATTGGGAAAC GGGCAATAGA GTCGGTTGAT CTGACATATA TCTGATCCTT ACCGTTTGAT ACTCCGCTCA CACACACTCT ACTGATTAAG TCCAATCTCG GTTGCACCCT AGAATGAGCT ACCGTCTGGC GCATCTCTCT TCCTCTCGAT TCACTTCCCC GTCGTTGGTA GCACAAAGAC GAGTACGTGG GCTCGAGGCA GCTAATTCAA GCTGGGCAAC CCATCAATCA GTGAAAGTGT GGTTTTCCTC GAACTCTGTA TCTTCATTGT CACTTTACAT CCACCGTCCA GTGAATTCAC GATGCACTGA AGTAATCGTC GCCAGCGCGC TGGCTCTCTT AACTACTAGC GTCTTAGTCT TGGCGTTTCC CACAACTACT GCTACCTCTG AAGATCCCCA TAACCACACT ATTCCTGCAC CAAACAGTCC TGCTCAACTC GTTGTCTCTT CAATTTGGTC TGAACCCACA AAGTTGGTAT CGCACGGCGG CGGAGGTAAC GATGGGGACG GCAAAGGGAC GAACGGTATC TTGGGCTCGC TCGTGTTCGC CGATGTCGCG CACTCGATCG GCGAGCATTG CCCCTCGAAA TCACCGAACC GGAAGGCTTT GTCTATATCT GCTGCTGCCA CTAGTGCTAG TACCACCGAA AATCCAGCGA GTTCTATAGA AACAGCTACC GTAGTACAAT TGGGTGGCAA CTCTGTTACG AGAGGAAGTG TGCGTCAGAA ACCGTACGAT GTAAGTGAAT TTGCGTCCCA GAGAATGGCT TTTCGTGTAT TCATTTATTC ACTGAGTTGC TGACCTCTTC GTTATTCTGT GCGGTTGTTT CTTGTCCGGA CCCTGCAGGT TTCGGTCCGC GCACTTCAAG GAGGCCGCAT GACCATGGAA GACGAATACG TAGTAGCCAA CGGCGGCCGT TTTGCTGGTG TCTTTGACGG ACACGGGGGT GGAGGAGTCA GCCAGCGGCT ACGGGTTAAT TTGTACAACA AAACTTGCGC CGCCCTCGCA CGCAAACAAC ACGAATTGAC CGATGCGAGT TCGGTGCTTT CGCACGTGGC AGCCTTACGG GATGCCTTCG ACGAAATGGA GCAGGATGTT CTGGAAGATG ATGGCTTGCA ATATCAAGGA AGCACAGCTG TGGTGGTGGT CGTACATGAA TCGGAAGAAG GAAAGCGAAC TTTGCTGTCG GCTAACGTTG GCGATAGTCG GGCTATCTTG TCGCGTAATC AAAACGCCGT TGATCTCACG CGAGACCACA AACCAAATGA TGATCGCGAG AAGGCGCGTA TCTTGGCCAT GGGCGAAACA ATTGAATGGG ATCTTATAAG CAAGGTGCAT CGAGTCCGAA ATCTGAGTCT TAGTCGAGCG ATCGGCGATC GGTACGCCAA ACCCATCGTA TCTGGGCAAG TCGAAATTCA ACACTATCCT GTGCAGGAAC AAGACGATGA ATTCTTTTTA CTCGCTTCGG ATGGGTTGTG GGATGTCATG ACGAGTCAGG ATGTTATCTC TTATGTGCAT AGACAGATGG AACAGGAATT AGATCGAGAG AGCTTACACA AGGATGATCG CGAGAACTAC AAACTGGTAC TCCGGAGGAA TATGGCGAAG TTCGTCGCCC GCGAAGCGAT GCGCCGTGGA TCAGCCGACA ATGTTTGCGT TCTCATGGTG TGGCTGAATG ATATGGGGTT GCGATGAATT TAGATTGTGA ACCTCCTTAT TTATGCACTT GCATGTAAAC TGAGTGAGTG CTGCTATGCT TTTGCGATCA CTAGTAATAA TATTCGATGT ATACTAAAGA GAGCGATC
|
Protein sequence | MSYRLAHLSS SRFTSPSLVA QRRVRGLEAA NSSWATHQSV KVWFSSNSVS SLSLYIHRPV NSRCTEVIVA SALALLTTSV LVLAFPTTTA TSEDPHNHTI PAPNSPAQLV VSSIWSEPTK LVSHGGGGND GDGKGTNGIL GSLVFADVAH SIGEHCPSKS PNRKALSISA AATSASTTEN PASSIETATV VQLGGNSVTR GSVRQKPYDV SVRALQGGRM TMEDEYVVAN GGRFAGVFDG HGGGGVSQRL RVNLYNKTCA ALARKQHELT DASSVLSHVA ALRDAFDEME QDVLEDDGLQ YQGSTAVVVV VHESEEGKRT LLSANVGDSR AILSRNQNAV DLTRDHKPND DREKARILAM GETIEWDLIS KVHRVRNLSL SRAIGDRYAK PIVSGQVEIQ HYPVQEQDDE FFLLASDGLW DVMTSQDVIS YVHRQMEQEL DRESLHKDDR ENYKLVLRRN MAKFVAREAM RRGSADNVCV LMVWLNDMGL R
|
| |