Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_9547 |
Symbol | RFC4 |
ID | 7196349 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 860881 |
End bp | 862190 |
Gene Length | 1310 bp |
Protein Length | 350 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176669 |
Protein GI | 219109832 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.606232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGACG AATTAGCCAC CCAGCCTACT CTGGTCCAGG ATAACAGGCC CTGGGTGGAA CGGTATCGAC CGAAATCGTT ACAGGAAGTG TCGCATCAAA CCGAAGTCGT CGCGACACTC CAGAACGCGG TTACCACCGG CCGCTTGCCA CATCTGTTAC TTTATGGGCC TCCCGGCAGT GGAAAGGTAC AAACTCAGAC TCGATTTGAT ACCATTTCGT CTTTGTTATT ATATCTTTTC GTTCTTCTCA CGTTCGCAAC CAATCAAACA GACTTCCGTA GCTTTGGCGT TGTGTCGTCA ACTCTGGCAT CCTTCGCAAT GGCGTCGCCG TGTATTAGAG TTAAACGCAA GTGATGAGCG TGGGATAAGC GTAGTGCGCA ACAAAATCAA GCATTTTGCT AGTTTGACCG TGGCGAAAGG AAACAGTAAC GTTTCGTCCA CCAAGAGCTT CTTTCTCAAG AAGAAAGACG CTGGTAGCGA TGAGATGGAG ACAGACGATA TGGAGAACTA TCCCAATCCT CCTTTCAAGA TCATCATTCT TGATGAGGCC GATACAGTCA CACCCGACGC ACAAGCTGCT CTACGACGAA TCATTGTAAG TCGTCTTTTA TGATGAAGCT TGTTGTGCTT TGTTCGACAA AACATACTGA GACGGGGCTA ACGTTTTGTG CTACATATTC TCTTTACTTG CGTTTCAGGA AGCCCATTCC AAGATAACTC GCTTCATTCT AATTTGCAAC TACGTCACGC GGGTAATAGA ACCCCTAGCC TCGCGTTGCG CTAAATTTCG ATTCCAATCC CTACCTCCGA GCAGTATGAA AGCCCGACTG GAATGGATCG CCAACGAACA AAATTGCTCC GAATCCGAGA AAGACTTGTT GGACGACATT TTAGAATATG CGGACGGTGA CATGCGTCAG GCCGTGACAA CTTTGCAGTC GGTCCACTCG TTGGCAGCTG GTGGTGCCAA GGTGGACAAG GCTGCCTTGG CCGAAATCGC CGGTCTACCA CCACCAGCTA TTGTGGATAT GCTTTGGACA GCTCTCCTCT CTAATTCCTT CGACACGATG GAGAAAGTGG TTGAAACTCT AAGTGCGGAA GGATTTTCGG CTCAGTTGCT ACTGTCAGCT CTCGTGCCCA AGCTCGTCAC TGACCAGGAC TTGAACGAAC TGTCCAAGGC CGAACTAGCA ATTCGAATCG CTGAAGCAGA GAAGAATATG ATTGAGGGAG GAGATGAGCA ATTGCAGCTG TTGACTGTTT GTAGCTTGGC TGTGAGCTGC TTTGAACAAT CCAAGACAAT TAATCAATAG
|
Protein sequence | MSDELATQPT LVQDNRPWVE RYRPKSLQEV SHQTEVVATL QNAVTTGRLP HLLLYGPPGS GKTSVALALC RQLWHPSQWR RRVLELNASD ERGISVVRNK IKHFASLTVA KGNNDMENYP NPPFKIIILD EADTVTPDAQ AALRRIIEAH SKITRFILIC NYVTRVIEPL ASRCAKFRFQ SLPPSSMKAR LEWIANEQNC SESEKDLLDD ILEYADGDMR QAVTTLQSVH SLAAGGAKVD KAALAEIAGL PPPAIVDMLW TALLSNSFDT MEKVVETLSA EGFSAQLLLS ALVPKLVTDQ DLNELSKAEL AIRIAEAEKN MIEGGDEQLQ LLTVCSLAVS CFEQSKTINQ
|
| |