Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44270 |
Symbol | |
ID | 7197946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 81356 |
End bp | 83279 |
Gene Length | 1924 bp |
Protein Length | 628 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178424 |
Protein GI | 219115257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000597333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA CTTGGTCGTC ACTCCTATTC ATCTCTCTCT TGTCCTGCTC AGTGGATGGA GAATCACAGT CTGGGTTGAT TGAGAAACAA ATCTTGACCA AGGAATCTAA GCCATTGGAA TGTGGTTTGT ATCTGGCCCC ATCTACAATC CCGGGCGCCG GGCTCGGTCT GTTTGTTGGC CATCAGGCTT GGGAAGAAGG GACCAAGGCT GGCTACGGTG ACATCTGCAT CCCTATTGTC AACCCACATT TGCATCGCAA TTATTCCAAG CCATTCTTTC ATCCCCTTGG GGACTACACA TGGAGCCATA TTGACGTCGG TATGACTTTA GAAGCCTATC CGGCGCTCCA TCTCGAAGCG TTCTGTCCCG GCATTGACGC CTTGGTCAAT TCGCACCAGT TCGTCCTGAA CTTGGGCTCG CCGAATAAAA GTCACGTGGC GCAGTACCTT CCCGCAGGGA TATTTCGCAC CAGCCCGAAC GCTGGTGCTG TTTCACCCTA CCGTAACTCC ACCACCCGGG TATCCCGACG CATTCCACCT GGTGGAGAGC TGTTCAAGTC CTACGGACCC ACTTGGTTCC CGAGCCGCGC CAAGGCTTTG GGCGATTCCT TTGCTTTTCC AGAAGACTTT AGACGTGGGC AAGCATTGTT AAATAGAATG GCCCCCCTCG TAGATCGGAT AAGCTCCACA GTCTCGGTGT CACCACTTTT TACAGACTTG TTCGACATAG TGCAGGCAGT AACAGGTACT TTTCAGTCGC GCATTCTTAA AACGCTACCA CGAGAGCCTC AACATGCGCA TGATGCGGGT GATGACGAAT CTGGTCTTCT TCGATTTCAC GAGGATATGT CGCGGCATTC TTTGGAATAT CTCACTTCTT ACGGACAGTG TGTGGATCAT GTCAGACCGG TGCCGTTGTC CCCTAAACAT CCCGAAGCGG GGCAAGGAGC TGTCGCAGCT CGCGATCTCC CACAGGGAAC CATCGTATCA ACGTCTCCAT TACACGTTTT TCCGGACTTG GACTACTTTA ACATGTACGT ACTCAAACAA AATGACAAGG GTACATGGGA ACGCGAAAGC GATCACGTAC ATACCAAGCA ACTGCTGCTC AACTATTGCT TCGGCCATCC CAACACCACT ATGACCTTGT GTCCATATGG TCCGGGAGTT AACTACATCA ATCATGATCG CAATCCCAAT GTCCGCATTC GCTGGTCTCT CAACCCATGG CACAACGCTA CAGCGGTTGC CGAGCGAACA GTAGAAAACT CGTTCGACTT CCAATTGAAG CTGGCATTTG ACTACGTTGC GCTACGAAAT ATTTCTGAGG GCGAAGAAAT TCTATTGGAT TACGGAGATG CCTGGGAGAC AGCCTGGCAA GAGCATAAAA GAACATACCA ACCGATTGAT GGGAAAATCA TCCAGCCCGA CTATGTTTAC AATCATCAGC TTCACATTCC CCTTCGTACA CGATCCGAAC AGGAGTTTGA TCCTTATCCG GAGCATCTCC ACACACGCTG CCATGTAGCT TTGCTTCAGC CGACATATCG ACAAAGTCAA TACAATTGGT TGGAAACACC CGAGTCTCCT TTGTTAGTAA ACTACGGACT ACCGTGCCAT GTGCTTCAGC GATACGCTTC CGAGATGGAT CCTAGTGAAT ACGCTTACAC GGTAGAGTTG GATTTGGAGA TACTTTCGGA CGAGCACAAG GCTGACCTTT CAATTCCGAA AAATGATCTC GTTCTGGAAC GACACCAAGT TCCGCGACGA GGTGTAGCCC AGTTTACTCG GCCTAACTTG TCCGACATGC ATTTGTCGAA CACGTTTCGC CATTCAATCG GTTTGCCGGA CAGCGTCTTG CCTCTACAAT GGAAAAATGC AATCTAAATT TTTTTATAGT ATCACCATAG ACACAAGCTG CTGT
|
Protein sequence | MKNTWSSLLF ISLLSCSVDG ESQSGLIEKQ ILTKESKPLE CGLYLAPSTI PGAGLGLFVG HQAWEEGTKA GYGDICIPIV NPHLHRNYSK PFFHPLGDYT WSHIDVGMTL EAYPALHLEA FCPGIDALVN SHQFVLNLGS PNKSHVAQYL PAGIFRTSPN AGAVSPYRNS TTRVSRRIPP GGELFKSYGP TWFPSRAKAL GDSFAFPEDF RRGQALLNRM APLVDRISST VSVSPLFTDL FDIVQAVTGT FQSRILKTLP REPQHAHDAG DDESGLLRFH EDMSRHSLEY LTSYGQCVDH VRPVPLSPKH PEAGQGAVAA RDLPQGTIVS TSPLHVFPDL DYFNMYVLKQ NDKGTWERES DHVHTKQLLL NYCFGHPNTT MTLCPYGPGV NYINHDRNPN VRIRWSLNPW HNATAVAERT VENSFDFQLK LAFDYVALRN ISEGEEILLD YGDAWETAWQ EHKRTYQPID GKIIQPDYVY NHQLHIPLRT RSEQEFDPYP EHLHTRCHVA LLQPTYRQSQ YNWLETPESP LLVNYGLPCH VLQRYASEMD PSEYAYTVEL DLEILSDEHK ADLSIPKNDL VLERHQVPRR GVAQFTRPNL SDMHLSNTFR HSIGLPDSVL PLQWKNAI
|
| |