Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49216 |
Symbol | |
ID | 7195521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 251904 |
End bp | 253988 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183956 |
Protein GI | 219127467 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.127578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCC TTAATCAAAC AATTTCCGCG TTACTTCTTT CCTCGAGTTT GCGTATCTTA GAAGCTGCCG AAGCCGAATT CAATGCTCTC ATTGCCATGC TGGAGGCTGA TGTGTTGGAA CTCGCCAGAA AAGTGGCGCT TCTCTACGAG AACCGCTGCA GCGAAGATGT TTTCACAGGG TGCGCTCGTA GCAATTACCA CGAGTGCATG TCGCTGTCCC CCAACCAGAC ATGTCCCGGT GGTAAAGATT ATAACGTCGC AAAGTGTGGC GATGGCATAT CCTGCAGCGG GCTCTTGGAC TTTTCCGTAT CAAATGTACG CTTGAACAGT AAGTTGGTGG ATTTCAATAG CGGCAACCCA CTCGATCCTA ACGTCATTGA TACAGTCTGC TTCACCCAGC AGCTGGACGA GTTCTTTATC AAGAAAAGGG AAGAGCGGAA GCCATACTGG GACAAGCTTG GGTTGCAAAC TCCACAGATG TATTTCGGGT CCCAAAATGG AGCCTTTCGA ATCTATCCGG CCAGGCATTC AGAAGAGTGC GGCCAATACG ATCCGACAGT CCGTGCCTGG AAAATTGCCG CCGATAGCGG TCCCAAGAAC GTCGTGCTTG TCCTTGATAC CAGTTCCAGT ATGGGAAATT ACAATCGTCT CGGACTTCTC CAGGATGCTG CCATACGAAT CGTAGAAACA CTGTCAGTTG GTGATCGCAT TGCCATTGTC CAGTTCTCTT CCCAAGCGAA GCCGTTCGAG AGCAAAGGAC AGACTTTTTT CTGGGCCACC AAAGAAAACA AGATTGCCCT GAAAACGTAC GTTGAAGACC TTGAGTTAAA TGAAGGAACG AACACTTTGG ACGCATTCAA TAAAACCTTC GCTGTCCTGG ACGACTCCAT TGATCAAGAA CTGCACAACG AATGTATAAC GGCAGTACTG TTCTTGACCG ACGGTGTAGT GTCGCCAGTG ATGAATGAAA CGAAGAGCGA AACAGAAACA AAAATCCTGG ACCTAGTCAC TGCCGGAATT TCCAATTTGG AAGCCCGAAC AAAGCAACCG GTATTTTTGT TCACTTTCAG TGTCTCCGAT AACAACTCTG TTCATGAGTT TCCCAAACGA CTTGCTTGCT CCACTGGCGA AAACGGCTTC TGGTCCAAAA TTGTCGATGC AGACAAGAGC TTTGACTCTC TCACCAGTTA CTACCGTTTG CTGGCCATCG CAATGAGCAG TGTCGAAAAC AGGAATTTTA CGGCATGGGT AGAGCCATAT AATTATGCTT TCAGCAATAT TCTCGGGACG ACAGTTTCGG CTCCCGTTTA CGACCAATCA GGGGTAGCGA TCGGCGTAGT GGGTGTCGAT ACTACAAACG CAGCGTTAGA CACTGTGCTC GGAGTCCCCA ACGGAAGCCA GGAGAGCATT CAGCGTATCG TGAGAAGGTC GTTCGCCAAG TGCCCAAACT TCAACCTCAC TTTGTGCCAG CGCGAGAGTT ACCGCCGCAG TGGTAGCGCT AAAGACGACG CGCTTTGCAC GTCAAGTTGC ACTGCTGATG ACTTTGTCGT GGTCAAACAA GAACCATGTG AGCGTACCAC GGCACGACCA AGTGAGCTCT TCGTTGAGAA CAAGAATCTC CAGGACGTAT CCTACGCGGA GCGCGGATGC TGCATTGTTG GAGAGAGTGA TGCTGCCTCG CCGGGGCAGT GCAAAGCCGT TGCCAACAAC GAAGAACCTT CAAATGATAC CAGCAATACA ACTGGAGTTG ATACCGGAAA CACAACTGGA GCTGATATCG GCAATACAAA TGGAGTTGAT ACCAGTAATG AAACAGAAGA TGATACCGAC GAAGACAAAG AGTGGTTAAA AGTCTTGATC TACGTTCTCG TAGGAGTGGG TTTTTCCCTT CTGGCGGCGG GCTTGATCTG GGTTGTTGGT CGTTGCACCA AGCGCTCCTT TTTTGATAGC CACAGGTCGG CTAGAAATAC GGACATAAGA GATTTAAGCT GGAATCGTCT CAAACCCGTG CAACCTATCA ATACAGCAAC TAACCCAAAT TATGTTTCCC CCGTTGGCTC AGCGCCGCCA TTCGATGACT TGTGA
|
Protein sequence | MKLLNQTISA LLLSSSLRIL EAAEAEFNAL IAMLEADVLE LARKVALLYE NRCSEDVFTG CARSNYHECM SLSPNQTCPG GKDYNVAKCG DGISCSGLLD FSVSNVRLNS KLVDFNSGNP LDPNVIDTVC FTQQLDEFFI KKREERKPYW DKLGLQTPQM YFGSQNGAFR IYPARHSEEC GQYDPTVRAW KIAADSGPKN VVLVLDTSSS MGNYNRLGLL QDAAIRIVET LSVGDRIAIV QFSSQAKPFE SKGQTFFWAT KENKIALKTY VEDLELNEGT NTLDAFNKTF AVLDDSIDQE LHNECITAVL FLTDGVVSPV MNETKSETET KILDLVTAGI SNLEARTKQP VFLFTFSVSD NNSVHEFPKR LACSTGENGF WSKIVDADKS FDSLTSYYRL LAIAMSSVEN RNFTAWVEPY NYAFSNILGT TVSAPVYDQS GVAIGVVGVD TTNAALDTVL GVPNGSQESI QRIVRRSFAK CPNFNLTLCQ RESYRRSGSA KDDALCTSSC TADDFVVVKQ EPCERTTARP SELFVENKNL QDVSYAERGC CIVGESDAAS PGQCKAVANN EEPSNDTSNT TGVDTGNTTG ADIGNTNGVD TSNETEDDTD EDKEWLKVLI YVLVGVGFSL LAAGLIWVVG RCTKRSFFDS HRSARNTDIR DLSWNRLKPV QPINTATNPN YVSPVGSAPP FDDL
|
| |