Gene PHATRDRAFT_20837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20837 
SymbolARP 
ID7201813 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp429055 
End bp430756 
Gene Length1702 bp 
Protein Length463 aa 
Translation table 
GC content58% 
IMG OID 
Productactin related protein 
Protein accessionXP_002180828 
Protein GI219120167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTAGAGT ATCGCACTCG GCGCTTCTCA CATTCTCTCG CAGCGAATAG TAATTTCCTC 
CCCTAGCGCT GACTGGAAAT AGAGTCTCGA CTCGTAAGGA AAGAAGTCCC TGTCACGCTG
TGAGTGCTCC CCCCGAGCCA ACATGTACTG TGGTGACGAA ACGGGATCCT TCGTCGGCGA
CGTCGGTTCC CATACCAGTC GGTTCGGTTA CGGCGGCGAG GACTGTCCCA AATATGTGGT
GCCGTCGTAC GTCGCTCGGA ACAAATCGCC GGACGACCGC GCGCGACGCT CTCCCGTACC
GAATGCGCCC CACCATCCCC GTTGGGCCGA GGCGGAACTC GCCAGCGCCC TGCGACAAGC
GCGAACAGAC GATAATTCGC ATCAACCGTT GGTCGATCCC GTGGCGTACC TAGCGCAGGG
TGATTCCGTA CAAGATTGGG ATGCTTACGA ACAGCTCTGG CAGTCGGCTT TTGACGTCAT
GCACGTGAGG GAGCGATACA AACACACCAA AGGAGGAGGG AATGTTCGCA AAGAAAAGAA
CAGTAACAGT ATAATTGCCA GCGACAATAC TGCAATTGGC GCATCTGGTG TCACGAGCAC
GACCATCCGC GACACGATCT CGCAAGACAG CCGCATCGTC CACCCTGTTC TGGCGTCCAC
ACCAGGATGC ACCTACAGTG TTGGAGTCGG AGCCAAAGCC ATGGCGTCGG CTCGTCGCCG
CGATCTCGTC CACCGCGTCG AATGTCTCAT GGAAAGTCTC GACTGCCCCG CGGCCTTTTT
GGCGCCAACT CCAATGCTGA GCGCCTTTGC CTACGGTCGT CAAACCGCAC TGGTCGTGGA
CGTGGGAGCG GGAGGTTGCG TCGCCACGCC CGTCGTGGAC GGGCTCTTGT TGACGCAGGC
GCAACGACGC AACGGTAGGG GTGGGGACTG GCTGGGGAAC GTCACCTGGC AAGCCTTGCT
GGAACAGCGG ACCATTGTCC GTCCACGCTA TCAACAACAC GCCAGTTTTA AACCCGACGA
GTCGGCAGCG AAAAACGGGA TCTTTCATCG GTGGGCCATG CAAGATCTCA TGTACGAATT
TAGAACGTCC GGTAACGTTG CGGTACCGGC ATGGTGGTAC GACGAAACAG TACCGTTTTG
CAAGAGCCCA GCTACCGAAG CCGGTGACGA AATAGTGATC GACCCCATTT CTCCTGGAGG
GTCCGAGTCC ATTACGTACG AACTCCCCGA CGGTACGCTG GTGGACTTGA CCAATCGAGT
TGGCCGAGAC TTGTGTCGCG TTCCAGAATT GCTCTTTACC GATCAAGTAC CCTTCGTCTC
GGCCGATCAG ATCAGCAACT CGAGCGTGCT TATGGAACAC GAGTCACTAA CGAACTTGCC
CCTCCACAAG CTCGTGCACG AGTCCCTGGC GGCTGTAGGT GACGTCGACG TCCGCAAGGA
CTTGGCGGCC AATATTGTTT TGACGGGGGC GTCCTCCCTA TTGCCCAATA TGGAGCAGCG
ACTATCTTTG GAAACGTCCC GGATGACGTC GAGCGCATAC AAGTGCAAAG TGCTGGCCTC
TCGACACGCC GTCGAACGGT CCTGTGCTGC GTGGATTGGT GGTAGCATCC TCAGCAGTCT
CGGTAGTTTT CAGCAATTAT GGCTGAGCCG GACCGAGTAC GAAGAATACG GCGCGACGCT
GGCGATTCAG CGCTTTCCTT AA
 
Protein sequence
MYCGDETGSF VGDVGSHTSR FGYGGEDCPK YVVPSYVARN KSPDDRARRS PVPNAPHHPR 
WAEAELASAL RQARTDDNSH QPLVDPVAYL AQGDSVQDWD AYEQLWHRIV HPVLASTPGC
TYSVGVGAKA MASARRRDLV HRVECLMESL DCPAAFLAPT PMLSAFAYGR QTALVVDVGA
GGCVATPVVD GLLLTQAQRR NGRGGDWLGN VTWQALLEQR TIVRPRYQQH ASFKPDESAA
KNGIFHRWAM QDLMYEFRTS GNVAVPAWWY DETVPFCKSP ATEAGDEIVI DPISPGGSES
ITYELPDGTL VDLTNRVGRD LCRVPELLFT DQVPFVSADQ ISNSSVLMEH ESLTNLPLHK
LVHESLAAVG DVDVRKDLAA NIVLTGASSL LPNMEQRLSL ETSRMTSSAY KCKVLASRHA
VERSCAAWIG GSILSSLGSF QQLWLSRTEY EEYGATLAIQ RFP