Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48875 |
Symbol | LLA1 |
ID | 7194955 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 544510 |
End bp | 547002 |
Gene Length | 2493 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | lupus la protein |
Protein accession | XP_002183506 |
Protein GI | 219126525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.407327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAATCAAGA ATAGACACAT CCGCAACGTT GACCTGCACA TCCCGCCAAA CGAGATCGTG TGGCGTAGAT TGCCGCTCAC TGTCAATCCA ACAACGAGCC ACACAGTCCT TGTTTCTCGG TCGTACGGCA ATCGAATCTA GGATACATTC ACACGTAGAC GTAGACGTGC ACGTACACGT AAATCTTCAC GTCTTCTTCG ATACTCTTAC AGGGTGTCGG AAAGATCCGG TTCTTCCATC TACGAATATA TTCTTGGGAA CACTCCTTTA GTGTCAGTCT TTTTACAGAA AGCCACCTCT TGTCCTTGTT TCATCAGTCA TTCGCCCAGT AATATCAACA TGACAGAAGC TACCATGTCG AATCTTTTAG AGGCGGCTCG AGAAACGACT GCACTGGAGT GCATTAGTCG AGAAAATACC AAAGAAACAC AACTCAACTC TTCCACATCT TCGTCATCGT CGTCCCTGTC ATCCGCGTCC GAATCGTTGA CTAGGACAAC AATCCCAGCG ATGTTGAATC AAACCAAAAT AAGACAAGGA TCGACGGAAG GCGAGGTTGA CACGGACCCA ACGGGAACGG CCAGCCATGC TGCTGTACAC GATGAAACTG CCAGTCGCCT CCTTGCCTTG CAAGCAGACG CAATGGCGAG CGAACGGGAG CGCATGCTTC TTGAACAAGA AGGCCTGGTT TTGGGGATTC TCGCCAAGCA GCTGGAATAC TACTTTTCCC AAACGAATCT GGCCAAGGAT ACCTACCTGC AAACCCTCCG GTCCCTCAAC GACGGATGCG TGCCCGTCAC AATTCTCGCC AATTTTACCA AGGTTCAAGC CTTGCTACCG GGACGAACGG AACTCGGACG AATTCACGCC ATTCGTCAAG CAGTAGAATG TTTCAGCCCC AACGCCTTGC GTCTGTTTGT CATCGACTCG GTCAACAGCA AAATTGTTAC GGACCAGGAG GATTCGGTCG ATGAATTGAC GACAAGTGCT ACGACATTTA TAGTCGCAGT GGGTACCTGG GATCAACAGC CTTTGCCAGC TGTTTCCGTA GTATCTACTG CGAATGTTGC CTCAGTGAGC GCATCCCACG CTACGATCAT TGTGCGTGAT GTTCCGGAGC ATGTATCCGA AGAAGATGTC CGATTAATCT TTGCACTTCC GGATGGCCCG TCCATTGTTT CGGTACGCCA GGACGTTGCT CATTGCTGGT ACGCATCCTT GTTGAATCCA CCGGTCGCTT GAATATTGTA TCTTTGCAAA CGGTGCTTAG GCTCACCACT TACTTTCCTC AACTACAGGT TCGTGACCCT GGATATCGAA TCCCCGGGTG CCGACGGCGA CGATGTTACG ATGAAGGTCA TGATGCACTT ACAGGCCCAA CTTTTGGGTG GAGAACCCGT CAAGGCTCGA CGAAAGGCGA GTGTAGCCTC CAACCTTTCC ATCGACCCTA TTCCGTTTTG GTTACCCCCA ATTCCGCTTA AACGAAAGAA GAAAAAGAAG CGATCCAAAA AGAAGAAAAA GAATTCCAGT ACCTCCGGCA GTGCAAATTC CAACAGCGAC GGTACACACA ACAATATCAA CGCAGCTGCG GCGACCAGAA ATCAAAACCA GAATATGCCG GGAAAAGGCA CAGAAACTGC CTCATCATTT CACAGCGCAT TCGCAATCAA GAGTAGCATT CCGTTGGTCA GCCCGCCAAC CTTGGGGGAA GACAACTTTC CAACGCTTCA AGACAAAAAG GTTGAATGGG AGACTCCACC GACGGCAGGA CTCGAAGACG ATGAAAAGTA CGACAAGCAC GAAGACGATG ATGCTGACGA CGAGGAAGAC AAAGAGGACC CTAAATCGGT CAAGGCCTTG TCGGATGTTG CGTCAACAGC GACGACGACT TCGTCTAGTA CGGAGTCGAC CCCCCACGGC AAGAAACTCT GGGGGACTGT GGGAGGCTAC GCCGCGGCCC TCATGAAGCA GGCCGTTGTA CCACCACCAG TTAGTGAGAC AAAAGTTGCT ATTCTTCCTG TCAGTACGGA ATCGAAAGCG GGGCATTTTT CTGCTCTGTC CCACAAGGCT ACGGCTCCAG CTCCCGTTGT GACTGTTTCT ACTCCAAAGT GGGGCGGAAT TCGGTCATTT GCCGATGTCT TACGCCAGGA GGAAGCGCAG CAATCATAAG GACAATGCTT GGGTTGCGTT GTGGCGCACT ATTGGGCGAA CAAGCCTGCT CTCGACGAGA CGCATTCACT GTGTCGTCTT TAACGAGTGT CTACCCCCAC ACGCCCTTGT GCATTGGACA AATCACTTTG CTTCTCGTAG GGAGCGGCCA CTTTTGCCGC AGTACGTACG CAACAACCAC AAAACTCACA GCTTCACATC TTGATTACCA CCTGTTTACA CTTAGCAAAA CGAAGAGCTA CTATGATGTT GACAAGCCCT TACACTCTAT CATGCAGCAA CCTAATGCGA CTAATATTAC TAAAATGGAA ACGATACGCT ATT
|
Protein sequence | MTEATMSNLL EAARETTALE CISRENTKET QLNSSTSSSS SSLSSASESL TRTTIPAMLN QTKIRQGSTE GEVDTDPTGT ASHAAVHDET ASRLLALQAD AMASERERML LEQEGLVLGI LAKQLEYYFS QTNLAKDTYL QTLRSLNDGC VPVTILANFT KVQALLPGRT ELGRIHAIRQ AVECFSPNAL RLFVIDSVNS KIVTDQEDSV DELTTSATTF IVAVGTWDQQ PLPAVSVVST ANVASVSASH ATIIVRDVPE HVSEEDVRLI FALPDGPSIV SVRQDVAHCW FVTLDIESPG ADGDDVTMKV MMHLQAQLLG GEPVKARRKA SVASNLSIDP IPFWLPPIPL KRKKKKKRSK KKKKNSSTSG SANSNSDGTH NNINAAAATR NQNQNMPGKG TETASSFHSA FAIKSSIPLV SPPTLGEDNF PTLQDKKVEW ETPPTAGLED DEKYDKHEDD DADDEEDKED PKSVKALSDV ASTATTTSSS TESTPHGKKL WGTVGGYAAA LMKQAVVPPP VSETKVAILP VSTESKAGHF SALSHKATAP APVVTVSTPK WGGIRSFADV LRQEEAQQS
|
| |