Gene PHATRDRAFT_42721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42721 
Symbol 
ID7196118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp892321 
End bp894669 
Gene Length2349 bp 
Protein Length773 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176677 
Protein GI219109848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAGTCAG TGGAATATCG AGCACCAATG GCCGCCGCCA ACGTCAAAGA GGTATGGAAA 
CCTGTGCGAA AATCCGTCCT GTACCTCATT GCGGCAGTCG CGCTTGCAAT CTCGGCATCG
ACGGTTGGGG GTAACTCATT GCCGGCAGGT AGTAGAACGC AGGTCGAAGC CGTGGACGAT
GTCGGTACAA GTGCGCAAGA TAATAGTCAC TCCCGCCGCT TGCTAGGTTA TCAACGAGTC
ACGGTTTACA CAACAGCTCC GGAAGCCGAT CAGGACAATC CGATTGCACG CAAACAGTTC
GCCGACTTGA TCGCCTCCCT GCCCACGGAT GTGACAACCC TCGATGTTGT TGCCTTAAAC
TTAATGGAAC AACGCCATTT TCTGGAGCAG AATTGTTACG GGAAAGAGGA GGACAACATG
CCCGTAAAAA TGAGCGCTCT GAAGCGTTTC GACGAACTGC GTACGCGTGG ACAAGATCAT
CTCGCAACCG AAGTCTACAA GTGGTGCGCG TTGAAAACGA AACCCTACCT CTCTGACTCG
GTGGCGTACA TTGATTCATC CAGTCCATTG CTGATGCGAC TGCAAGACTT TTTGTCGGAC
GTACAAAATG TGGTAGTCTT AGGAGACGAT TATTTCCCCC ACACTGCTCA TGGCAGTCTG
ATTGTGTTGC GCGATAATCA ACTCTACGTA GCGGAACAGA TGATGAAAGT TCTTCTGGAA
ACTCCGGCGG AGCACTTGGA AGTAAACCCT CTTCTTTTGC CGCAGCGCTT GTACGAAGCG
GTGGCGTTCG CTTCCGCCAA AAAGCATCCT AAACCGGGCA AGGTGGGCGA TTTCCTACTG
TTGAAAGAGA CATGCCGAAT GGATCCCTTG AGGCGCCACA ATGATGATGC AAAAGGCTGG
ACGGATACAC AAACTCGTCG CCACGTACAC CACTGCCCCG AACGTGCAGG CTTTTGCTGC
TCAATTCACG ACATATCTCG TAACCGTGTC GTCCTCATGA GTCGTCATCC TCTGTTGCCA
TTTCAAATTC TGTCCCAATC ATTTTCTCGG CCTTACAACG CCGAAGCGGA TCACTATGAA
GAAGACGAGC TGCCCTACAT CACCACAATC CAGGCTACTT TTCACGAGCG GCCTGCGGAT
ATGCCCGAGA CACCTAACTT TTACGCCTTG CTTGCCGCGA AAGATTGCTT GCCCGACAAA
GAAGAATGCT CTAAGTGCTG CCGAAACCAG GCCGGTGCTA CAAGGGAACT ATGTGCCAAA
GACTGCCCTT GCTATGCGAA GGCTCTCTGT TCGGATAAAC CTCCACCAAA GCATGTTGCT
CATACGTGGA CTGTAACCCC ACCGGCCTAC ACGCGCGATC CCAACCGTGT TGTTCCCCGT
ATCGTACACC AAACTTGGTT CGAAGACCTA GCCCAGGACA GGTACCCAAA TATGAGCCGC
ATGGTCCAAT CGTTCCGTAA CTCTGGCTGG GAGTACAAAT TCTACAACGA CGATGACGCA
GTTAATTTTT TGAGTACGCA CTTTCCGCCG GAAGTGCGAG AAGCATACGA AGCTCTTCGC
CCTGGTGCCT TTAAGGCCGA TCTCTTTCGG TATTGTGTGC TGTTGATACA CGGTGGTCTC
TATGCTGACG TTGACATCAT GTTGGAATCG GCTTTGGATG CGGCCATTGG ACCGGATGTA
GGATTTATGG TTCCGACAGA CGAGCCCGGC ATGGCCACAA ATCACCGGAT GTGTCTATGG
AATGGAATGA TCGCCGCCGC TCCAGGTCAT CCGTATCTGG CGAAAGCCAT CGAAACCGTC
GTGAATCAAG TCCGAAACCG ATTTACATCT GTTGATATTG ATGCCACCCT CTGCCCCAAT
CCGGAGCTCT CCATCTCGCA CGCTTATGAT ACTCTCTTCA CTGCAGGACC GTGCTTGCTG
GGTGCCTCAA TCAATCGAGT TCTGGGACGG CATCCACAAA AATCCTTTAC AGCTGGGGAA
ATTAACATTT TGGCTGATCG CCGACAATTA GAAGCCGGTA CATCGTTTAT TGTCGGAGAC
GGTGTTGCAT TGGAGGCACG CGTGCCAGGA AGGAGTGTGA TTTTGAAGCA GGACAAGTGG
GATATGGGCG CGCATCGATT TACCTACGTG GAGCGCAACT TGGTTGTCTC GGCCACAGAT
CTGCAAGATT CAAATGATAG GGACACTCAC AAAAAGAACA AAAAAACAGA GCATTACAGC
ACGACACACG CCAAGACGGG TATTTATGGA TTGGAAGGTC TGTACACAGA CACACGTATT
GCAAATGAAG ATATTCGGAT TATACTGGAT GTTTCCAAGC AGGCTAGTGT CCCATCGTCT
ACGTCTTAG
 
Protein sequence
MAAANVKEVW KPVRKSVLYL IAAVALAISA STVGGNSLPA GSRTQVEAVD DVGTSAQDNS 
HSRRLLGYQR VTVYTTAPEA DQDNPIARKQ FADLIASLPT DVTTLDVVAL NLMEQRHFLE
QNCYGKEEDN MPVKMSALKR FDELRTRGQD HLATEVYKWC ALKTKPYLSD SVAYIDSSSP
LLMRLQDFLS DVQNVVVLGD DYFPHTAHGS LIVLRDNQLY VAEQMMKVLL ETPAEHLEVN
PLLLPQRLYE AVAFASAKKH PKPGKVGDFL LLKETCRMDP LRRHNDDAKG WTDTQTRRHV
HHCPERAGFC CSIHDISRNR VVLMSRHPLL PFQILSQSFS RPYNAEADHY EEDELPYITT
IQATFHERPA DMPETPNFYA LLAAKDCLPD KEECSKCCRN QAGATRELCA KDCPCYAKAL
CSDKPPPKHV AHTWTVTPPA YTRDPNRVVP RIVHQTWFED LAQDRYPNMS RMVQSFRNSG
WEYKFYNDDD AVNFLSTHFP PEVREAYEAL RPGAFKADLF RYCVLLIHGG LYADVDIMLE
SALDAAIGPD VGFMVPTDEP GMATNHRMCL WNGMIAAAPG HPYLAKAIET VVNQVRNRFT
SVDIDATLCP NPELSISHAY DTLFTAGPCL LGASINRVLG RHPQKSFTAG EINILADRRQ
LEAGTSFIVG DGVALEARVP GRSVILKQDK WDMGAHRFTY VERNLVVSAT DLQDSNDRDT
HKKNKKTEHY STTHAKTGIY GLEGLYTDTR IANEDIRIIL DVSKQASVPS STS