Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44287 |
Symbol | |
ID | 7198004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 129526 |
End bp | 130620 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178433 |
Protein GI | 219115275 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGTA GGGCCGCAAA AGGAACCACT GGAGGGTTTT CGTTCGACTT TTTGTCGTCA GATGATAGAG CACCCTTTCC TGAGTCAAAC CACACGTCTC ATTCCACGGA GCTGTGCCAA GATACTGACA GTGAACGTCG TCCTTTGGTG TGGATCAAAA ATGTCACAGA GCTTCTTTTG GATCGATCGC AGGAAGAAAT TGTGTTTGAC GAGATTCCTT GGCCATCGAA TGATAAGAGC GATGATGATG ACACGGTCGC AAACGAAGTG CTGCATACCT TGAAATGTTT GGCACCTGTA CGTCGAGTAG ATCATCATTC GTCGTCATTC GTTGATCAGA GGGAAACTAC TGGCATCAAC TTAGGCTTTG AGAGTCAGAT AGACACCTGG CAGAACACAG ACATAGAGCC GGGTGTTTAC GAAGGCGGCA TGAAAGTGTG GGAATGTAGT ATCGACCTAG TTCGCTACCT TGCAACTCAG GAGATTCGAC TGGATCCGAA CCAATTCGCA ATCGAGCTCG GATGTGGCCA TGGTTTGCCG GCGTGCTATT TACTACGGGA AAGCTTACGG GCATCCCGCA GAGCAGATTT CAATGACGAT GAGGCTTTTA AAATCATATT TACTGACTAC AACGACTATG TGCTCAAAGA CGTGACTATT TCAAACATGT TCATCAACAT TGTTCAGCAA GTATCGAATG AAACCATCAA AGCGTCCGAT GCCGACCTTA AGCGCGTGGG CGAAAGTGTT CTCCTCGGTG CCGGGGATTG GATGAACTTG TCGCGGCAGT TGACAAACGC AGATGCAGGG GATCTGCCAC TACCCAAGGA TGGCCATTTC GATTTAATTT TGGCAGCTGA GACGCTTTAT TCAGAGATAA CTGCACGTGA GACTGCACAA TGGTTTAGTC GACACCTGAA ACCTAACTCC GGCGTTGGTC TGGTGGCGAG TAAGCGATAT TACTTTGGCG TCGGTGGTGG CGTCGATACT TTTCGGATGA CGGCGCAGTC GCTCGATTTG CTGGTGGAAA CGGTAAAAAT ATATGACAAC GGCTCTAGCA ACATTCGGGA ACTGCTGCGT GTGCAAAAGG TATAA
|
Protein sequence | MASRAAKGTT GGFSFDFLSS DDRAPFPESN HTSHSTELCQ DTDSERRPLV WIKNVTELLL DRSQEEIVFD EIPWPSNDKS DDDDTVANEV LHTLKCLAPV RRVDHHSSSF VDQRETTGIN LGFESQIDTW QNTDIEPGVY EGGMKVWECS IDLVRYLATQ EIRLDPNQFA IELGCGHGLP ACYLLRESLR ASRRADFNDD EAFKIIFTDY NDYVLKDVTI SNMFINIVQQ VSNETIKASD ADLKRVGESV LLGAGDWMNL SRQLTNADAG DLPLPKDGHF DLILAAETLY SEITARETAQ WFSRHLKPNS GVGLVASKRY YFGVGGGVDT FRMTAQSLDL LVETVKIYDN GSSNIRELLR VQKV
|
| |