Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_1199 |
Symbol | |
ID | 7202278 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 792245 |
End bp | 794162 |
Gene Length | 1918 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181805 |
Protein GI | 219122964 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCCCAAG ATGTTGTGGA CGATATTCTC CAATCTTTGA TTCGACATTC AGCGTTGAAT GCGACCACAC TTCGCATTCT ACGAAACTGT GAATTGGGTG TGCTTTCGCT TTCGGGATGT CGCGGTGTCA CGGACGAATG GCTAGAGGCG CTATCCGCAG AGAGCTCTGA CTCGCCGCCG CACCTTGTTC CTTCGGAAAA TTGCGATACG GCTGACAGCA TGGATCTCGA AGGCGCCAGA AAGCCTTATG AAGCCTTGTC AAGCAATGAA CATTCCGAAA GGTGTGGACA GACTTTTTCG TCTTGCTCCT CTAGCTCGTA TGCTTCGGCA AGATCAACTT CATATCCTTT TTTGGAAGAA GACTTTAATA TTACCAATTC TCCCTCTATC CCTAACAGTG TACTTGGCAA CAAAAAGCGA ACCGCTTTGA TGTGGCATCC TTGCGCGGCT TCGAGCGCAT TGACCAATAC GACGCTTCTG GACCTTCGTG GTTCCCAACG GCTAACCGAC CGTGGTCTAA TGCAGCTTCA TGACCTCGGC CGACTGGAAG TTGCCAAGCT TGACAACTGT CATTCCGTTG TTGGCAGAGG CCTTGTTGTA CTCTCCTCGT CTCCTCGTTT GCACACGTTG TCGCTGACAA ATTGCCGGAG ATTAACGGAC GAAGCGATCG TCAACATCTC GCACCTGCAA TCCCTCCAGG CTCTATCATT AGACGGATGT CGCTGCATAA CGGACTTCTC GTTGGCTGCC TTGGCCGATA TGTACAACCT TAGGAAGCTA GGACTCAGTC AATGCGACTT GATTACCAAC GAGGGTTTGA AAGCGCTCGA GCACCTGCAA CGCTTACAAG AGATTAGTCT GGGATGGTGT CGTCAAGTGT CAGATGCCGG AATCCAGACA TTGACCGCAC AACCAGGACG CTCTTCAAAC CTGCAAATTC TTCGCCTTGC TCGATGTCCC ATCACAGACG AAGGCGTCCA ATACCTGGGT GGTTTGGCGG CGCTCGAGGA GCTCGATTTG AACGGATGTT GTCGGATTAG CAGCGCACCT CTCGGCAAGG CATTGGAAAA AATGCTTTGC CTTACAGTAT TGGATGTGTC GCATTGTCCC GGGATACTGT GAGTGTTCTG CTCGAAATCA TTGATGAATT TTGGTAGGTT TCGGATCTCA TCGCTCTTCT CCTGTTGCTG CCTTGGACAG ACGTTCGGAC TGGCAAGGCA AAATCAGGAA CGTTAAAACA CTGGAGCTGT GTTATTCCGC CGTCAAGGAC ATTCATTTGA CAAAGCTAGT GAACCTTCCT ATGCTGGAAG AGTTGAATCT AGATTCTTGT CCCATTGGCG ACCTAGCGAT CCAGCACTTT GCGAACCATA ACGTTCTTCC CAACCTTGTG TCTTTGGATT TAGCCGACAG TGACATTAGC GATCTTGGCA TGGTCCAAAT TGCCAAGTTT ACGAAATTGA AGCGCCTTTC TCTCTTTTAT TGCAGCATTA GTAATAGAGG ACTGCGACAC TTGTCCATCT TGACCGAGCT GCGGGTACTG AACCTTGACA GTCGCGATAT CTCCGACGAC GGGTTGCGCC ACCTGCAGCA TTTGAAACAA CTCAAGTCTC TGGACATTTT TTCGGGCCGA GTGACGGATC TCGGCTGCAC TTACCTTTCC AAAATCAAGA CACTTGAATC TCTGGAGCTC TGCGGTGGTG GAGTCCGGGA TGCGGGTTGT GCATCGTTGG CCAAGCTCGA GAATCTGACG AGTCTCAATT TGTCGCAGAA CGAGCGGATC ACCAATCGCG GAGCGGCGGC ACTGGCTGCG CTCTCGAAAT TGAAAGCTCT CAACTTGAGT CACACACGAG TCAACGCATC CGCTTTGCGC TACTTTAGCG GCCTCATGAA TTTACAGTCG CTGGCATTGT ACGGTTGC
|
Protein sequence | LPQDVVDDIL QSLIRHSALN ATTLRILRNC ELGVLSLSGC RGVTDEWLEA LSAESSDSPP HLRTALMWHP CAASSALTNT TLLDLRGSQR LTDRGLMQLH DLGRLEVAKL DNCHSVVGRG LVVLSSSPRL HTLSLTNCRR LTDEAIVNIS HLQSLQALSL DGCRCITDFS LAALADMYNL RKLGLSQCDL ITNEGLKALE HLQRLQEISL GWCRQVSDAG IQTLTAQPGR SSNLQILRLA RCPITDEGVQ YLGKIRNVKT LELCYSAVKD IHLTKLVNLP MLEELNLDSC PIGDLAIQHF ANHNVLPNLV SLDLADSDIS DLGMVQIAKF TKLKRLSLFY CSISNRGLRH LSILTELRVL NLDSRDISDD GLRHLQHLKQ LKSLDIFSGR VTDLGCTYLS KIKTLESLEL CGGGVRDAGC ASLAKLENLT SLNLSQNERI TNRGAAALAA LSKLKALNLS HTRVNASALR YFSGLMNLQS LALYGC
|
| |