Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54375 |
Symbol | AP1mu |
ID | 7200276 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 653168 |
End bp | 654688 |
Gene Length | 1521 bp |
Protein Length | 439 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179260 |
Protein GI | 219116931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.907036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCTT CGGCCGTCTT CATTACGGAC TTGCAAGGCA AGAACATTAT CAGTCGAAAC TACCGCGGCG ATGTACCAAT GCAGAAGGCT CTGGAACGTT TTCAAACGTA CTTGCTAGAA ACCACCGACG AATCCAAGAA ACCGGTCTTT CATGTCGACA GCAATGGCGA TTGTTTAACG GAAGATAATG TCGGTGCAAC AGGAGTTGGC GGGGAAGCGT ACATCTACAT TGCGGTATGT ACAATGCCAC TCGACGTTCG ATCCAAACAA GTCTCCGGAA GACCAACTTT TTGTGACGAG CTATCGTCCT GCGTCTTTCC AACTCTTCTG CTCACTACTT TGTTTTCGTT TTGATTGCCT TTCCCAGCTG TCCAACCTGT ACTTGTGCGC CGTGACGACA CGAAATTCCA ACGTTGCTTT GATTCTGACT TTTCTCTATC GACTATCCCA AGTCTTCAAG GACTACTTCG GCACTTTGGA AGAAGAAAGC ATTCGAGACA ACTTTGTGAT CATATACGAG CTACTGGACG AAACCATGGA TCACGGTTTA CCGCAAGCGT TGGACAGTAT GATTCTGCGT TCGTTCATCA CCCAGGGTGC CAACCGGATG TCGGAAGACG CACGAAACAA GCCCCCGGTA GCTCTCACCA ACGCAGTTTC CTGGCGCGCC GAAGGAATCA AACACAAAAA GAATGAAATC TTCTTGGATG TCGTGGAGAA GCTAAACTTG CTGGTTTCGG CCAACGGTAC AGTCTTGCAT TCGGAGATCC TTGGCGCCGT CAAGATGAGG TCCTTTCTTT CCGGCATGCC CGAGCTTAAA CTAGGTCTCA ACGATAAACT CATGTTTGAA GCCACGGGCC GAGCGAATCA AGCGAAAGGA AAGGCCGTGG AACTCGAAGA CATCAAGTTC CATCAATGCG TGCGCCTCGC CCGCTTTGAA AACGACCGCA CCATTTCCTT CATCCCTCCC GACGGTGAGT TTGATCTCAT GACGTACCGC CTAAATACGC ACGTCAAACC GTTGATTTGG GTCGAGGCTG TCGTCGAGCC GCACAAAGGA AGTCGTATCG AGTACATGAT CAAGACGCGA TCCCAGTTTA AATCGCGATC CGTCGCCAAC AACGTGGAGA TCATCATTCC CGTCCCTCCG GACGTCGATT CGCCTTCTTT CAAGTGTTCG GTCGGCAGCG TTTCGTACTT GCCCGACAAA GATTCCGCTG TTTGGACAAT CAAACAGTTC CACGGCGGTC GTGAATATTT GATGCGGGCG CATTTTGGTT TGCCAAGTAT TTCAGCATCA GATATTGACC CCGAAGCGAA AAAGAAGGGA GATAACGCTT GGAAGGCGCC GATTCGGGTA CAGTTTGAAA TTCCATACTT TACTGTTTCC GGCATTCAAG TCCGGTATTT GAAGATCATC GAACGAAGCG GCTACCAGGC GTTACCATGG GTTCGTTACA TTACCGCCAA TGGTGACTAC CAGTTAAGAA TGGCATAACA CTTCCAACAT TATCTTCTAT A
|
Protein sequence | MVASAVFITD LQGKNIISRN YRGDVPMQKA LERFQTYLLE TTDESKKPVF HVDSNGDSYI YIALSNLYLC AVTTRNSNVA LILTFLYRLS QVFKDYFGTL EEESIRDNFV IIYELLDETM DHGLPQALDS MILRSFITQG ANRMSEDARN KPPVALTNAV SWRAEGIKHK KNEIFLDVVE KLNLLVSANG TVLHSEILGA VKMRSFLSGM PELKLGLNDK LMFEATGRAN QAKGKAVELE DIKFHQCVRL ARFENDRTIS FIPPDGEFDL MTYRLNTHVK PLIWVEAVVE PHKGSRIEYM IKTRSQFKSR SVANNVEIII PVPPDVDSPS FKCSVGSVSY LPDKDSAVWT IKQFHGGREY LMRAHFGLPS ISASDIDPEA KKKGDNAWKA PIRVQFEIPY FTVSGIQVRY LKIIERSGYQ ALPWVRYITA NGDYQLRMA
|
| |