Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48612 |
Symbol | |
ID | 7194822 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 368714 |
End bp | 370134 |
Gene Length | 1421 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183213 |
Protein GI | 219125911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000111344 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTTTGCAT CGAGTACACA GTTCCTTCTA TCCCGTCATT TCAAATTGCA GTATAGAGGA TGTGGCGTGT AAGTCTGATG GTTCGACTCA GCTAGGAACT CCCAGAAAGC CGCAAAACAA AACAATGGCC GAAAACAATA GCAGCAGTGG CATTAAAGTC TTTCTTCATA TAAATGGATC TCCCCAGCAA TCTGAAAAAT ATTCTACCAA TGCCATTCGA ATCGATAGAA ACAGCCGCTC TGATGGCCGC GTCGAGTTGA ACGTTTTGTT TCCTTCTATG GTTGGTACCT CAGAAGCACA GCTTGCAGCC GCACTTATTT CCGACGCGAT TGAGCAGCTC GGTAGAAATG TGGACGTAGA AATGATCCAA CTTAGTGGGC TTCGTTTCTC AAAAGGAGGG GTCGATGGAC TCAAAGCCGC TTTGTCTATC CATGCTTTCT CAGTGAAGCA CGAAGTCTTG ACAGACATCG ATTTTTGCAA AAAGCTGCCC GGCGACCAAG ACGCACTTGC TGCTTTTTCG TCAATCTTCG CTGCTTTTAA GCTGAGTGGA CTAAATTTGA GTGATAACAC AATTGCGGGG TGTGTTTGGC GCTTTTGGGG GTCACAATCG AACTTGAAAG TCCTAATGCT ATAAAACGTT GAAATGGACA TAGTAAGTTT TGAATCGCTC GAGTGCAACT GGAACTGGGG CGTTTCTTTA GGAGACATCA ATATTGAAAT GGGAAATGAG AAATTGCATG GGTCGGGCGG AAGCTGCAAT CGTAAGCAAA GCCTTACGTA AGTGCTTGAA CCTTTTCAGT GTGCGTTGTG TCAATCGTGT TGTAGAATAC GAAACGGCAC TACCATGGCA AGGACTGAGA GACATGGCTC ATGATACACT GCATAGTAGC GGTCGCTGTC TTGGACATCT AGCAATGGAA GGCTGCAATC TTGCGTCTGA CAACATTGCA TCGCTTGGGC TTTGCGGTGC GTTGAATGAT TTTCCTAGAA TGCGCACTCT CAATCTAGTA CGCGTGGGTC TTGATGATCA AATAATAAAG TTTGTTTCTC TCAGTTTGGG GAGTACTGGG GACTGCCTCG AATGGCTTGA TCTGACTGGG AACCAGATCG GGACTGTTGG TGCGAAGTCG CTGGCGTGTC TAGCCCGTAC ACCTAAGATT GCGAAGCGTC TTCAGTATTT GTCACTGGAA AATAACCAAA TTGACAAGGC TGGGGCAGTC GAATTGCTGG ACGCATTTGG ACGCATTGGA TCAAACAAAT TTGATCTAAA CTTAAAGGGC AATCCATGTG ACATGGGTGC AGTTGCACTT GAGATTGCAG CCAGTCAATG CTATACGGAG CAGCAAAATA TTGATCTACT CAAAGGAAGG GATGTTTTAC GTAACGACAT CCAACAAGCC CAGAAGAACG CACAGAACTA A
|
Protein sequence | MAENNSSSGI KVFLHINGSP QQSEKYSTNA IRIDRNSRSD GRVELNVLFP SMVGTSEAQL AAALISDAIE QLGRNVDVEM IQLSGLRFSK GGVDGLKAAL SIHAFSVKHE VLTDIDFCKK LPGDQDALAA FSSIFAAFKL SGLNLSDNTI AGCVWRFWGS QSNLKETSIL KWEMRNCMGR AEAAIVSKAL QYETALPWQG LRDMAHDTLH SSGRCLGHLA MEGCNLASDN IASLGLCVRV GLDDQIIKFV SLSLGSTGDC LEWLDLTGNQ IGTVGAKSLA CLARTPKIAK RLQYLSLENN QIDKAGAVEL LDAFGRIGSN KFDLNLKGNP CDMGAVALEI AASQCYTEQQ NIDLLKGRDV LRNDIQQAQK NAQN
|
| |