Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47492 |
Symbol | |
ID | 7202596 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 766577 |
End bp | 768928 |
Gene Length | 2352 bp |
Protein Length | 640 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181625 |
Protein GI | 219122591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00475484 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAGCAGCAA CCCTGTCTCG CGACGAACTC TATTTCACCT GTCATTCACT GTAGTCATTC ACACGGCTTT GGTCTTTTTG TGCAACCAAC GAAAAACGAA CCAAGTGGAG GCGGGAAGAA GGAGACCTTT TACATACCGT TGGGCCGGGC CATTGAGACC AGGGATTGAC TGACTGCGAG TGTTATTCTC TCTCGTTTTC TCACACACAC GCACACATAC ATACACACAC AATCACACAC ACATCCACAG TCCATCATGG CGCTCGGTAC ACGATCCAAC CAGACACTAG TGGTTGGTTT GGCGGCTGCG GCGACCGCTA GTCTCTTGTT GTACTACGTC GTGCAAGCCC GGACCCGACC AACTTCGGAT TCCTCGCCCC GCGACACCAC CGATCGCAAG TCTCGATCGG TAGAATTCAC CACACCGTCA CCGAAAAAGA CCTCCGGGCC TTTAACGAAG GCGGTCGGGA AAGGAGAAGC AGCGGACGGA TCCGAGGACC AGACACCCAT TGTGAAGAAC ACGGCGAAAG ACGCGGAAAA AGAAGTAAAC GCCAAAATTG AACTCTTGGA CAAGAAGGGG AAGGATCTCT TCAAGAACAA ACAGGTACGT AACAAGGAGG AGTAAGAAGA AAATAGAGGC CAGATCTTTC CAAGTGGGAC GAGACAACTC CTCGATACAA CCAGTGCAAT CCTCCCACTC ACCCCCGTTC CCCAACACTA TTTCCTTTGG CAGTACCTCG ATGCCGCGGA AACATTCACG GAAGCGCTCA CGCTCATTGA AACCAGCGAT ACGAGCGGTT CCGTCCAGAA CACATCGTCC TCTTTGCATC GCCAATTGAT TACCTTGCTC AACAATCGCT CCGCCATGTA CGAAAAAGGC AACCAGCCCG AATTGGCCCT GGAAGATTGC ACGCAAATAC TCGACCAGGA TGTGCATCAC GCCAAAGCCC GCACGCGGAA ACTCCGCGTT CTGGAAAGTC TCGGTCGTTG GCACGATGCA CTCGTCGAGG TCTGCGCCGT GCAGCTTTTG TTTATGCGCA AACACCGGGA CAGCATGCGA CTCGGCTTGA AGGTACCGCC GCCTCCCGTT CCCGAATCCA AAATGCAAGA AATTCTCACC AACGTTGTGC CGTTGGAAAT GGAACCCTAC ATCCAAGCGC TCAACGAAAA AACCACGCGA CCACTGCCAT CGGGCTACAC GATCCTGCAA TTGCTCCGCT CCTTTACGTC CTACAATAGC TGGATGGCGC AAGCGGCCAA AGACGGCAAC GTCGCGAACA TTGACAAGGA GTTGGTGGAA GGCGTGGATG CGGCGAGTAA AGCACAGCGC GTGCACGTCC TACTGAAACG CGGACGCCGG CACGTGTACG ACCGGGCTTT TGAAAACGCC AGTGACGATT TCGAACAAGC CTACGCTCTG GCCGAAACCA ATGAGGTGCA ACTATTGTTG GAAGGAGACG ACTACGCTCG CGTGTTGGAA TGGACGGGCA TGGTCAAGCA CTGGCGATAC AAGCTGGACG AAGCCTCGGC TTGTTACGAA AAGTGTGCCG ATCTAGAGCC GACCAATGCC TTGGTGCTGG TAAAGAATGC CGGAGTCAAA ATGGACGGTT CCCACCAAGA CGAAGCCATG AAACTGTTCG ACACTGCCCT GGGACTCGAT CCGAAAAACG CCGACGCGCT CTTGCACCGT GCTAATCTGC GACTGTTGCA AACCAAGCCA GACGAAGCCA AGGAAGATCT GGAAGCCTGC ATTGCGGTGC GACCCGACCA TATTATGGCC CGTCTCCGAC TGGCGTCCAT TTTGGCCGCG ACGAACGAAG CGGCCAAGGC GAAAAAGCAC TTGGACGCGG CCGAAAAAGT GGAGCCCAAA TCGTCCGAAG TGCAATCGTA CCGAGGCGAG CTACACTTTA CGCAAGGAGA ATTCGACCAG GCCCGTGCGC AGTTTGAAAA GGCCATTGCG CTGGACCCCA CCAATCCCAC TCCGTACGTG AACGCCGCCA TGGCTATTTT GCAGACGCCA CCGCCGCCGG GACAAATGCC GGATGCGCAG GAGGTAATTC GTCTGTTGGA AGAAGCCATT CGTGTCGATC CGTCGTTTAC GTTGGCGTAC ACGCATCTCG GCAACGTGAA ACTTGGAACC GCCACGGAAC TTTCCAGTGC TCGTGAAGTG GTGACGCTGT ACGACCAGGC ACTGGCCAAC TGCCGATCGG AAGAAGAAAT CAAGGAACTG TGCAGTATGC GCATTCTAGC GGTAGCTCAA GTAGAGGCAG CCAGCATGCT GAAAATGGAA TCCTTCAACA TGCAATAGAG CGTACGAGAA CAGTCATTAA GCAACTACTC GACGAGGAGA CT
|
Protein sequence | MALGTRSNQT LVVGLAAAAT ASLLLYYVVQ ARTRPTSDSS PRDTTDRKSR SVEFTTPSPK KTSGPLTKAV GKGEAADGSE DQTPIVKNTA KDAEKEVNAK IELLDKKGKD LFKNKQYLDA AETFTEALTL IETSDTSGSV QNTSSSLHRQ LITLLNNRSA MYEKGNQPEL ALEDCTQILD QDVHHAKART RKLRVLESLG RWHDALVEVC AVQLLFMRKH RDSMRLGLKV PPPPVPESKM QEILTNVVPL EMEPYIQALN EKTTRPLPSG YTILQLLRSF TSYNSWMAQA AKDGNVANID KELVEGVDAA SKAQRVHVLL KRGRRHVYDR AFENASDDFE QAYALAETNE VQLLLEGDDY ARVLEWTGMV KHWRYKLDEA SACYEKCADL EPTNALVLVK NAGVKMDGSH QDEAMKLFDT ALGLDPKNAD ALLHRANLRL LQTKPDEAKE DLEACIAVRP DHIMARLRLA SILAATNEAA KAKKHLDAAE KVEPKSSEVQ SYRGELHFTQ GEFDQARAQF EKAIALDPTN PTPYVNAAMA ILQTPPPPGQ MPDAQEVIRL LEEAIRVDPS FTLAYTHLGN VKLGTATELS SAREVVTLYD QALANCRSEE EIKELCSMRI LAVAQVEAAS MLKMESFNMQ
|
| |