Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43764 |
Symbol | |
ID | 7197041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1411496 |
End bp | 1412632 |
Gene Length | 1137 bp |
Protein Length | 365 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178137 |
Protein GI | 219112771 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTC GATCGAAGCA AACCACCGAA TTCCATTTCG AAAGGGTATT GCTCTATGGA AAAAGAAAGA AGAGTCTTCC ACCCAAAAGT TGTCTTGTTT TCAAGGTGAC GGACATGACT TGTTTTAATT CTGTCTCACA ACCGACGAGT TGCTGGATTG TTCAGGTTTT TCTTGTCCCT ATTTGTGTTG CAGACGTATC AGATGCCATG AGTAGAACTT GTCGAACAGC GATTCTGGTC ACGGGCTCAA CTGATGGTAT AGGATTAACC ACGGCAAAAA ATGTAGCCAG TCGTGGGTAT GATGTTATTA TCCATGGTCG CGAGACCACT CGCCTGGACC AAGCACGTAA AGCAGTGCAC GCTTATGCAG TGCAGCACTC GAGCGATCCA GGCCGTCTCT TCCCACTGCC GCCAGCAGAT TTCTCCAGTA TTCGGGAATG TCACGGCTTT GCGCACGATG TCCGCAATCT TTGCAAGGAG AAAGATCTGC AGCTCTGCGT ACTCATGAAT AATGCTGGCG TGTATTCGGA AAAGCACATC ATTACAGAAG ACGGACTCGA GCAGACATTC GCCGTAAACG TTGTGGCGCC GTTTATTGTC ACATCGCTCT TGTTACCGAC GTTGCTCCAG CAAAAGAGTA GAATAGTTAT TGCTTCGTCA ATTTCTCAGT GTGGCAAGAT TCGAAGCTGG AAAGACCTTC ATTACCAGAA TCGGTCATAT AGTGCGGATG CTTCATACAG TGAATCAAAG CTTTTAGATG CTATGCTTTC TATGGAAATG GCAGATCGGC TTCAAAAAGC TGGATTCGGA ACCAATCAAA TCACTTGCAA TTGCCTCGAC CCGGGGACCG TTAATACAAA GATGCTGTTG GCGGGTTGGG GGGCATGTGG AATTCATGTC GAGGACGCAC TCGACCAGAC GTGGCTATGT ACTTCCGAAG AAGTAGAGAA TGAAACGGGG AAATATTTCG TGTACCAGAA GGATCGCTCT GCTCAGTCGT CTGCTTACGA TCAAAACGAA CGCAACAAAA TGTGGGCGCT ATTGTCAGAG CTTGCTCCAG AAGCAGCAGC AATGTGGACA TTTGACTGGT ATAAATGACT ACGTTGATTT TATAATAAGT AAAATGGTTT GGATTGG
|
Protein sequence | MSIRSKQTTE FHFERVLLYG KRKKSLPPKS CLVFKVTDMT CFNSVSQPTS CWIVQVFLVP ICVADVSDAM SRTCRTAILV TGSTDGIGLT TAKNVASRGY DVIIHGRETT RLDQARKAVH AYAVQHSSDP GRLFPLPPAD FSSIRECHGF AHDVRNLCKE KDLQLCVLMN NAGVYSEKHI ITEDGLEQTF AVNVVAPFIV TSLLLPTLLQ QKSRIVIASS ISQCGKIRSW KDLHYQNRSY SADASYSESK LLDAMLSMEM ADRLQKAGFG TNQITCNCLD PGTVNTKMLL AGWGACGIHV EDALDQTWLC TSEEVENETG KYFVYQKDRS AQSSAYDQNE RNKMWALLSE LAPEAAAMWT FDWYK
|
| |