Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49601 |
Symbol | |
ID | 7198211 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 165315 |
End bp | 167231 |
Gene Length | 1917 bp |
Protein Length | 465 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | alanine glyoxylate aminotransferase |
Protein accession | XP_002184315 |
Protein GI | 219128218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000836807 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTCCTACC GTACCAACAT CCAAATATCG ATCGACACCT ACATACATAC ATCTTATCAA TCTGTATTGC ATTGTTGTTG GATTGACAGT CAAGGTTGAT TCACAGTCAA GATTGATATC ATCATACTGA TCATTCGTTT TGTATTGATC AATCTTTCGA CGAACGACTG TGACGTTGAT TGATTCATTC CTTGTTCGAC TTGACGACTA CTTTTACCAC AGTATGTTTC GATCGGTTGC TTCTTTGGCG TTGCGCGGTA GTATTGGTAC TGGACGAGGC GTTGCACAAA GCCCGCGCGT CGTACCGTTC GGATCGGCCG TCACGGTACG CCATTCCAGT AACAGTCACA CGAACAGTAG TAGTCACACG CCGGAACGCT TGCGGTACAA TGTCATTCCC AAGTCAGACT TTGGAGCCTT TAAGGAATAT TCCGTCATTC ACACGGACCG GTCCTTGAAT CTCATGAGTG ATCCCTTTCA ACGCGTCATG CGGGATCTTA ACGAACTACT CAAGGTCACC TACAATGCTG ATAAGGTTGT CATCCTACCC GGGTACGTAC AGTTGTGAAT GTTGTGTAGG TTTAGGGTGT GTATATCTGT GTGTCTAGTT GTGTGTATTT GTGTAGTGTT GTTCCAGGTT GGTGGTGTGC AAAGTGAACT CTGGGACCGT GACTGATTAT CGCACCATTG CGACCCCAAC CCTCACTTAT ATATCATTCC CCACTGCAGA TCCGGTACGT TTGGCATGGA AGCCGTAGCT CGTCAGTTTG CACAGAACGA ACACGTCATG GTCATTCGCA ACGGTTGGTT CTCCTACCGC TGGACCGAGA TTTTCGAAAT GGGGTCGTCC GAACCCGGGG TGGAAGCCGG AGGTGTCGGG GCCGGCATTC CCACGTCCCA CACCGTCCTC AAGGCCCAAC CCGTACCCGT CCCTGGGAAC GACACGGGCA GCAGCAACAC CAAAACGACA CACTTTGCCC CGCATCCCAT TCAAGACGTG GTCTCACGGA TTCATCAGGA ACGACCCGCC GTCTTGTTCG CTCCCCACGT CGAAACATCC ACCGGTATGA TGCTTCCCGA CGAATACATA CAAAAGGCCG CCCAAGCCAT GCACGACATT GGTGGACTCT TTGTCCTCGA TTGCATCGCC TCCGGAACCG TATGGGTCGA CATGAAGGCG CTTGGGGTGG ACGTGCTCAT CTCCGCACCG CAAAAGGGAT GGACCGGTCC ACCCTGCGCC GCACTCGTCA TGATGAGCGA CCGAGCCGTC GCCCGCATGT CACAAACCTC CGAAACATCC TTCAGTATGA GTCTGAAACG ATGGGCCGCA CTCATGGACA CGTACGAAAA GGGCGGATTC GCCTACCACA CCACCATGCC CACGGATGCC CTCCGCGATT TTCACGAAAT ATCCGTCGAA ACCTTGCGTT TCGGTCTACC CGAACTCAAG ACGGCACAAT TGAATTTGGG ATGGTGGGCT CGTGGTACCC TCGATCGCAA GGGTCTCGTC TCGGTAGCCG CACCCGGATT CCAAGCACCC GGTGTGCTCG TTTACTACAG TCCCTCGCAA ACCGACAATC CCGTCATGAT GAGCTCCTTT AAGGCGCAGG GACTCCAAAT TGCCATGGGC GTGCCTTGGA AAATTGACGA ACCGGAAGGC CTCAAAACCT TCCGCATTGG ACTCTTTGGA TTGGACAAAC TGGGCAAACC CGACGAGACG ATTCGTGTCA TGGAAGAAGC CTTGGATCAG GTTTTGGACA GTGTGGGGCA CACGGCCAAG AGCAAAAAAG TGGCCTAGAC ACATGCAGGT GTGGGTCGCG TCTCCACTCC ATGTTTTCCA TTCCCACGGT GTGCACCCTC TTGTCTCCTA CTATGTATGA ACTACCATAA GCACTAGACG TTTCTGCTTT TTTTCTA
|
Protein sequence | MFRSVASLAL RGSIGTGRGV AQSPRVVPFG SAVTVRHSSN SHTNSSSHTP ERLRYNVIPK SDFGAFKEYS VIHTDRSLNL MSDPFQRVMR DLNELLKVTY NADKVVILPG SGTFGMEAVA RQFAQNEHVM VIRNGWFSYR WTEIFEMGSS EPGVEAGGVG AGIPTSHTVL KAQPVPVPGN DTGSSNTKTT HFAPHPIQDV VSRIHQERPA VLFAPHVETS TGMMLPDEYI QKAAQAMHDI GGLFVLDCIA SGTVWVDMKA LGVDVLISAP QKGWTGPPCA ALVMMSDRAV ARMSQTSETS FSMSLKRWAA LMDTYEKGGF AYHTTMPTDA LRDFHEISVE TLRFGLPELK TAQLNLGWWA RGTLDRKGLV SVAAPGFQAP GVLVYYSPSQ TDNPVMMSSF KAQGLQIAMG VPWKIDEPEG LKTFRIGLFG LDKLGKPDET IRVMEEALDQ VLDSVGHTAK SKKVA
|
| |