Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44770 |
Symbol | |
ID | 7199737 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 193950 |
End bp | 196020 |
Gene Length | 2071 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178723 |
Protein GI | 219115856 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.294873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACACAGTCC GTGTCCTCCG TGTTTTTTGT GCGTTGATTG GTGCACAGTC AAGAAGGACG AAAGTGAAAC TACTTCCCTC TTCCCATCGT ACCGTTTTCG GTGCGTACAA ACGCGAAGAG ATCGACGACA AATCTTTGCC ATTCCTTCGC GTTGTGCGAC TAACATTCTC TGGCGGATTT TCTCACTGCA TCCCTGCTCG ATTCCTGTTT TTCCTTATCA TACCTGTAGT GGCGTCCATT GTACAATGAA ATTCTCGCAC TCCGTTTTGT TGAGCTTTCT CGCGACGACG GTGTTATCTG CGACGCCGTC CGGTGCCTTT GCGCCGCATC AGTCCACGAC CGTCAAGTCT TTAGCACGGA ACATGGTGGC TTCACTGGAA CCCCAAGCTC CGCCGGCACG CGAAGCACCC GGCGCCGGAT ATTTGCCCGA CTGGGAAGAT CGACCCGGCA AGACTCCAGC CGAGTTCATG CAGTCCGATC TCACCAAACC GGATAGGAGC GCCATGTGGG AATGCCCTTT GACTCGATGG AACTCGGAGG GGTACGTTCT CTTGTGTTTC ATCAATGAAA ACCACGTGCG CTCCCCGTCA CGGTGTCGTC AGCACAAACA TTAAAACCAT CCCGCTTTAC GTACAAACAC CTAGTCCTCA CGCTGCCGTT TGTCTATCTC GTTCACAGTA TCGACATTGA ACAAGCGCAA AAGGAAGCGG CCAAAATGCC GCACTGTCCG GCCGAAATCC GTGCCTCGAA CGCGGACAAT GTCATGGGTC GAGACTACTT TGCCACTAAC AAAGAGAAAA TTCGGGCCGA TTTGCTACAA CACGGAGCTG TCTGGTTGCG GGGGTTTGAC CTCATGAAGG ATGTACAGGG GCACCGAGCC ATGTACGAAG CACTCGAGCT GGAGCCCTGT CTGGATCCCT TGCACTCTTC CGGATTGCGC AAATTTGCCT CGGAACGGGA TGCCCTGTAC GAAGAGGTAC GCTTAAACTC TACGATGCTC TTTTGGAACA GCCGCGATCC TTCACTTACC CATACTTTCT TCTTGCTCCA TCTGTATTTG ATACTTGATC TGCTTCTGCT GTCCGTGATC GGGTGCGCAG GTCAATAAAC CGTCGTTGCG TGGACATTAC ATTGGCTTGC ACTGCGAGTC GACAACGAAA CGCACGGCGG CGTACGCCGC ATTTGTTTGC TTCCAAAAGG CCACCGAGGG CGGCGGCCGT TTTCTAGTGG CCGACGGTGC GGCCATTTTA GCCGAACTCG ACACAGCCTT GCTCAAAAAA CTCTACGCAC GCGAAATCCG TATTTCCGTC AGCAACCTGG ACATTCCTCC AGCATTCCCG GGGTTCCTTA AGGAAGGTAT CAAAGGTTTA GTGGACGCCG CCGTAGCTCC CAAATTCGAT ATGGACTTGG AAATGATGTA CGAAGCTGAC GGCAAACCCG GTCGTTTACA GGCCATTGAA ATGGCGGAAT CGCCCATTAA TCGCCACCCA GTAACGGGTT TGCCGGTCTG GTTCAACAAC GCTCACAACC ATGCTCGCAA ACTGCGCGAC CGCCGTCCCT GCGGAGTTCC CGAAGTGGGC ATGACGGAAG TCTTTTACGC CGATACCATG GAACCGCTAA GTTTGGAGGA TTGTCAGGAA ATCAAGCGCG CCAGTGAAAA ACACATTACG GCCTTGAGTA TGGAGCCGGG TGACGTGTTG CTGGTGGACA ATTACCGTGC CTTGCACGGA CGCGACGTCT TTCAGGGCGA TCGGTTCCAC GCCGTGACCT GGTTCACGTG GGACGAAAAC GAAGCCTGGC GTGGAGAAGA GCGTCGCCAA GTGGAAAAGA ATGGATTGAA CAAGGCGATC AATAGCATGA TGGACTTTTT ACCCAAGGAC TTTGAGTCGA ATCAGAGCAG CAAGTAAGGA GCGCGGTACC AAGGATTGAC CATTGATTCG ACCAAAATCG CAGACACAGA GTTACTCTTC TGCATTTTCT ACACATATAC TTTGATTAGA ATTGGATTTG CAACCTTGGT TGAGAATGCC AGCTAGGTGA ACCGATTAGG CAATAGCGGA CTACCTATTG T
|
Protein sequence | MKFSHSVLLS FLATTVLSAT PSGAFAPHQS TTVKSLARNM VASLEPQAPP AREAPGAGYL PDWEDRPGKT PAEFMQSDLT KPDRSAMWEC PLTRWNSEGI DIEQAQKEAA KMPHCPAEIR ASNADNVMGR DYFATNKEKI RADLLQHGAV WLRGFDLMKD VQGHRAMYEA LELEPCLDPL HSSGLRKFAS ERDALYEEVN KPSLRGHYIG LHCESTTKRT AAYAAFVCFQ KATEGGGRFL VADGAAILAE LDTALLKKLY AREIRISVSN LDIPPAFPGF LKEGIKGLVD AAVAPKFDMD LEMMYEADGK PGRLQAIEMA ESPINRHPVT GLPVWFNNAH NHARKLRDRR PCGVPEVGMT EVFYADTMEP LSLEDCQEIK RASEKHITAL SMEPGDVLLV DNYRALHGRD VFQGDRFHAV TWFTWDENEA WRGEERRQVE KNGLNKAINS MMDFLPKDFE SNQSSK
|
| |