Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48755 |
Symbol | |
ID | 7195034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 162137 |
End bp | 163917 |
Gene Length | 1781 bp |
Protein Length | 586 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183432 |
Protein GI | 219126370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00414186 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAATA GCGATTCGAG CAGTCGGTCT CCGGCGTCAC TCGCTGACCG GGATATCGAA GATCTCGGTC GGCGCTTTCC CTTTGGCGAT ACCGAACTCC AAGCCCTCTA CGCCGCGTAC GGAGCAATGG CGTATCCCGG GTCGGCAGTT CACCGGGTGT CCTTTTGGTC GGACTGGGCA CGAGCGATTC GCCGTCCGGG TCGTGACGCA GCGGAGGCGA CCGTGGCAAC GGACACGGCG TTGTTGCTCC GGGTGTGCGA AACCAAAATC CTCACGCCCG ATCTGGGCAC CCGGCTCTAC CGAGCCGCCT TTTGGGTACC GGGCGACGTG CTCTGGTACG GTGGTACCGG ACAATCCCAA AATCCGAAAT CTCTTGGGGA ACAAGTGAAC GCCGTGCCGC TGGCCAAGGA CGGTTCCCCC GAACCAACTA CATCGGACGA GTATACCCGT AAAGCTCGTC TCGAGCGAAC CTTTGAAGGG TTGACCTTGT CGTCCCGCAA AGGCTCCGGA CCCGCCGTCA AGGTTCTCTT TGACGCGCTC GCCCTTGATC CCCACGCACC CGCCACTCTC ACTACCGTAC CGACACGTAT ACGGGCTTTC GATTTCGTCA CGGCGGGCTA CCGCTTGGCC ATGGCCACGG CGTTTTTGGC TGCGGCCGCG CACGACAATG ACGACGCCGA CGACATGGCC GGTTTCCTCC CCGAGGCCGA TTCTTCCGGA CGCGATGCGG TTGCCTTGCG AGCGTTGGCG CAGTCTCTGA CGGAGCGAGC CCGTATGCGA GAAGCTCGGC CGGGAATGTC GATCGAGTCC CCGCACGAGG ACCATACCGA TGATCACGTG GAATTGGAAG ATGTTTACGA CTGGATCGAC GCCGTCGCTC CCTTATTCGC CTCCATTTTG CCGACGTTTT GGCACCAAAT TCTGTTTCCT CATCAAGCCT ATCCCCCTTC GCGGACCGCA TTTTCCTTCC CCCGCGTTCC CTGCGATTCC GTCTTTTTCG AATCCACCTC CAGTCCAACG CTCTTTACGT TGGGCTGTCT ATCCAAGTCC CTCACTGGTG TTTACTACCG ACTCTACACT TCCGCCAGCG ACGGCCTCTC CTTCAATCGT TTACAAAACG CGCTCTTGGG ATATTCCGGA CCGACGCTGT TGTTGATCCG GACCACGGGT GGCGCCATCC TCGGTGCCTT CACCGCTTCG GCCTGGAAGG AATCCCGCGA CTTTTACGGC AACACGGACT GTTTTTTGTT TTCCGCGGCC CCCGTGACGG CCGTCTACCG CCCCACGGGC ACGGGTCGTA ACTTTATGTA CTGCAACTCC TTCGCTCGCT CACGTGGGTA CGACCAACAA GCACACGGGA TCGGTTTTGG CGGTACCGTC GACGAACCGC GATTATTTCT GTCGGAATCC TTCGATGCGT GTCGTGCCGG AGCACAGGAC TGCACGTTTG CCAACGGATC GCTCCTACCC CGGACCAGTT CCGGAGCGCC GCAGACAAAT TTCGAACTAG ACGCGGTGGA AGTCTGGGGC GTCGGAGGGG ACGACGTGGT CGACGCGGCG TTGGGCCAAC GGCAAAAGGC GCGGGCTCTC CGGGAAGAAG GGATCCGGCG AGCGCGCAAG GTGGACAAGG CGCAATTCTT GGACGACTTC CGATCCGGCT TGATGGATTC CAAAGCCTTT CAACATCGAC AGCAAATGCG GGGTCGGGCC GATGTGGATT GCGAAGAACG AGCGACCAAA CAGTACGAGT ACGAAAAGTA AATTAGAAGA TGTTGCTTCG G
|
Protein sequence | MGNSDSSSRS PASLADRDIE DLGRRFPFGD TELQALYAAY GAMAYPGSAV HRVSFWSDWA RAIRRPGRDA AEATVATDTA LLLRVCETKI LTPDLGTRLY RAAFWVPGDV LWYGGTGQSQ NPKSLGEQVN AVPLAKDGSP EPTTSDEYTR KARLERTFEG LTLSSRKGSG PAVKVLFDAL ALDPHAPATL TTVPTRIRAF DFVTAGYRLA MATAFLAAAA HDNDDADDMA GFLPEADSSG RDAVALRALA QSLTERARMR EARPGMSIES PHEDHTDDHV ELEDVYDWID AVAPLFASIL PTFWHQILFP HQAYPPSRTA FSFPRVPCDS VFFESTSSPT LFTLGCLSKS LTGVYYRLYT SASDGLSFNR LQNALLGYSG PTLLLIRTTG GAILGAFTAS AWKESRDFYG NTDCFLFSAA PVTAVYRPTG TGRNFMYCNS FARSRGYDQQ AHGIGFGGTV DEPRLFLSES FDACRAGAQD CTFANGSLLP RTSSGAPQTN FELDAVEVWG VGGDDVVDAA LGQRQKARAL REEGIRRARK VDKAQFLDDF RSGLMDSKAF QHRQQMRGRA DVDCEERATK QYEYEK
|
| |