Gene PHATRDRAFT_9547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_9547 
SymbolRFC4 
ID7196349 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp860881 
End bp862190 
Gene Length1310 bp 
Protein Length350 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176669 
Protein GI219109832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.606232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACG AATTAGCCAC CCAGCCTACT CTGGTCCAGG ATAACAGGCC CTGGGTGGAA 
CGGTATCGAC CGAAATCGTT ACAGGAAGTG TCGCATCAAA CCGAAGTCGT CGCGACACTC
CAGAACGCGG TTACCACCGG CCGCTTGCCA CATCTGTTAC TTTATGGGCC TCCCGGCAGT
GGAAAGGTAC AAACTCAGAC TCGATTTGAT ACCATTTCGT CTTTGTTATT ATATCTTTTC
GTTCTTCTCA CGTTCGCAAC CAATCAAACA GACTTCCGTA GCTTTGGCGT TGTGTCGTCA
ACTCTGGCAT CCTTCGCAAT GGCGTCGCCG TGTATTAGAG TTAAACGCAA GTGATGAGCG
TGGGATAAGC GTAGTGCGCA ACAAAATCAA GCATTTTGCT AGTTTGACCG TGGCGAAAGG
AAACAGTAAC GTTTCGTCCA CCAAGAGCTT CTTTCTCAAG AAGAAAGACG CTGGTAGCGA
TGAGATGGAG ACAGACGATA TGGAGAACTA TCCCAATCCT CCTTTCAAGA TCATCATTCT
TGATGAGGCC GATACAGTCA CACCCGACGC ACAAGCTGCT CTACGACGAA TCATTGTAAG
TCGTCTTTTA TGATGAAGCT TGTTGTGCTT TGTTCGACAA AACATACTGA GACGGGGCTA
ACGTTTTGTG CTACATATTC TCTTTACTTG CGTTTCAGGA AGCCCATTCC AAGATAACTC
GCTTCATTCT AATTTGCAAC TACGTCACGC GGGTAATAGA ACCCCTAGCC TCGCGTTGCG
CTAAATTTCG ATTCCAATCC CTACCTCCGA GCAGTATGAA AGCCCGACTG GAATGGATCG
CCAACGAACA AAATTGCTCC GAATCCGAGA AAGACTTGTT GGACGACATT TTAGAATATG
CGGACGGTGA CATGCGTCAG GCCGTGACAA CTTTGCAGTC GGTCCACTCG TTGGCAGCTG
GTGGTGCCAA GGTGGACAAG GCTGCCTTGG CCGAAATCGC CGGTCTACCA CCACCAGCTA
TTGTGGATAT GCTTTGGACA GCTCTCCTCT CTAATTCCTT CGACACGATG GAGAAAGTGG
TTGAAACTCT AAGTGCGGAA GGATTTTCGG CTCAGTTGCT ACTGTCAGCT CTCGTGCCCA
AGCTCGTCAC TGACCAGGAC TTGAACGAAC TGTCCAAGGC CGAACTAGCA ATTCGAATCG
CTGAAGCAGA GAAGAATATG ATTGAGGGAG GAGATGAGCA ATTGCAGCTG TTGACTGTTT
GTAGCTTGGC TGTGAGCTGC TTTGAACAAT CCAAGACAAT TAATCAATAG
 
Protein sequence
MSDELATQPT LVQDNRPWVE RYRPKSLQEV SHQTEVVATL QNAVTTGRLP HLLLYGPPGS 
GKTSVALALC RQLWHPSQWR RRVLELNASD ERGISVVRNK IKHFASLTVA KGNNDMENYP
NPPFKIIILD EADTVTPDAQ AALRRIIEAH SKITRFILIC NYVTRVIEPL ASRCAKFRFQ
SLPPSSMKAR LEWIANEQNC SESEKDLLDD ILEYADGDMR QAVTTLQSVH SLAAGGAKVD
KAALAEIAGL PPPAIVDMLW TALLSNSFDT MEKVVETLSA EGFSAQLLLS ALVPKLVTDQ
DLNELSKAEL AIRIAEAEKN MIEGGDEQLQ LLTVCSLAVS CFEQSKTINQ