Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44671 |
Symbol | |
ID | 7197655 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1223315 |
End bp | 1225557 |
Gene Length | 2243 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178388 |
Protein GI | 219115185 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.364916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTACGGCTA TTGTTGGTAC GCCATTTCCT ACTACCCACC TCGCTAATAA CCCAAACTTG CTGCCGCTTA CCCGTAATGC ACATTGAGAA AGGATCATTT TTGAATCCCA ATCTCTCTGT CAAAGCGTTC GGTACGAATC CTCGAGGCGG GAGGCGGTAA CAATTGTTTC TGTAGTCCGA AACGGTCAAC TTCCGGGAGG CGACAAGTGC CAAATAGAGA GACGGGGCTT CTCGTACCTC CTGTTGGTGA CCAAGACTGG TCGACGGTAC CTTTCGTGAG AGACCGAGTC TCGGGGGGAA ACCTTTTACG TTCGACATGA GGGTTACGCT TCTGGTAATC ACCATTGGTA TCGTGGTTTG TGCGTTCGGT AGCGTTGTCT CTCCCGTGGC GGCCTTCACA AGCTTGTCGC AGGGCAGCAC TCCGCGGCTG TCCCAATCGC CGCAAAGCCA ACCTTTCCCG CAGCGTCGTT GCGACACGAC CTTCTTACGG ACAACCCGCA CGACAGCGGA AACACCGTCG ACATTACGTC CGTTGCCTAC GACCTCCAGC CATCCAGTAT TCGCCGTCGT CCAGGACTTT CAACTCAAAA AGCCTTGGAA AGACGGTACC ACGCCCCTGC TCCAAACCCG TACCTGCAAT ATTGTTCTAG GCCAACTTAC GGATTCGCCG CGCCCGGACG CGGCCGGCGC TGCGGAACGC ATACTCCGGT ACATGCTGCA GAACGCGCAC GCCCCCGGGG CGCCCGACGT GGTCAGTTTG AATTCCGTCC TGGCGTGCTA CAGCAAGCTG GGCAGCGATA AGGATGCCCG AGGCCGACAG CAACAACGTG ATGCTGCGGA GCACTCGTAC AGGCTTTTGC GGGAATGGCA AGAACTGTAT AGCCAAGAAA TGGTCGCCGT CAACGCGGAT CGGATATCCT ACAATACCGT TTTGGCCGCC TTTTGTCGCT CCGGGCAAAC CAACCGGGCT GAGCAAGTCT TTGCCGAACT GCAGCGCCTT GGCCAAGCGA CTGACAACAA CGATGATGAC GATCAAAACA ACGCCGCTTA TCGCCCCGAC GTTGTCTCCT ACGCCACGCT CATTCGAGCA TACGCCATTC AGGGACAAGC CACTCCGGCG GAAGCATTGT TTCAACAAAT GAAACGACAG GGGCCACCAC CGTCCGTCGA ATGCTACAAC AACGTGCTCT ACGCCTATAG CAAATCCAGT CGTCCGGACC GCGCGGTCGA ATTTCTACAG TGGTGGTTGG ACGATGATGC GATCGACCTT GACGAGTCCG TTCAACCGGA CGTACGATCG TACAATATCG TGTTGCATGC ACTCGCACAA TCCAGTGACT TGCAGTCGGT TCACCGTGCG GAACGCTTAC TGGACAGCAT GCCCCAGAGA GATTCCGTCT CGTATACGAC CTATGTTGCC GCCTGCTGTC GTCTGTCCGG TCGTTCAGTC CTGGACGCCG TCCAGAGGGC GCTGGACAAG GCGTACGCCG ACGAACACGT GCACGTGGAC GCGGCCTTTC TTTCCAACAT TCTATACTCC TTAGCTACCT GTGACGAAAA GGACATGCCC GTGTTTGCTG AAGAGCTGGT GACCACCATG ACCACACAAC ACGGCGTCGA CGACAGCGTA CTGGCCGTCT ACAACGCACT CATTCACTGC TGGGCCAAGT CGGGAGAACG TGACGCCGTT CCGCGGGTAC TCGCCATTCT CCGGTATCTA GAAGCGCAGA CCACGGTCCA GCCCGATATC AAAACCTACA CCAATGTACT GGACTGTCTA GCTAAGTCTC GAGACCGGCA GAGTCATGTG GAAGCGGAGG CGTTGCTGCA GCGTATGGAA GAGTTGGGGC CGCCGCCAAA TGTACAAGCC TACACGTCTC TCATACAGAA TTTCTCGCGT TCCCGACTAC CGTACAAAGC CGTCAAGGCG TCGGAAATTT TGCAGCGCAT GAAGGCTTCG GCCAATCCGC TGGCGCGACC CAATGTAGTG ACATACAACG CCGTACTGAA TGCCGCCGAG CACACGGATA CCTCCGACAA GGTGGCCACG GAAGAAGCGT TAAAAGTTGC GTGCTTGACT TTTGACGAAA TTCGATCGTC CACGGTACGA CCGAACCACG TGACATACGG GACCTTCCTG GGCGTCCTTG CCAATCTCAT GCCGATCGAT TCCCGTCACG AAATTGTCAG CCTCGTTTTC CGAAGGTGCT GTCTGGAAGG ACAGGTCAGT CGGTTCGTTC TGA
|
Protein sequence | MRVTLLVITI GIVVCAFGSV VSPVAAFTSL SQGSTPRLSQ SPQSQPFPQR RCDTTFLRTT RTTAETPSTL RPLPTTSSHP VFAVVQDFQL KKPWKDGTTP LLQTRTCNIV LGQLTDSPRP DAAGAAERIL RYMLQNAHAP GAPDVVSLNS VLACYSKLGS DKDARGRQQQ RDAAEHSALD KAYADEHVHV DAAFLSNILY SLATCDEKDM PVFAEELVTT MTTQHGVDDS VLAVYNALIH CWAKSGERDA VPRVLAILRY LEAQTTVQPD IKTYTNVLDC LAKSRDRQSH VEAEALLQRM EELGPPPNVQ AYTSLIQNFS RSRLPYKAVK ASEILQRMKA SANPLARPNV VTYNAVLNAA EHTDTSDKVA TEEALKVACL TFDEIRSSTV RPNHVTYGTF LGVLANLMPI DSRHEIVSLV FRRSVGSF
|
| |