Gene PHATRDRAFT_21296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21296 
Symbol 
ID7202116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp227206 
End bp228744 
Gene Length1539 bp 
Protein Length437 aa 
Translation table 
GC content52% 
IMG OID 
Productbiotin synthase 
Protein accessionXP_002181331 
Protein GI219121975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGGCGCTT CGCGTCACGA CAAACACGCA CAGTTTGGGC CATTCGATCC CTTGACTTTC 
TCTTTGTAGC ATCGCTATCA TGATACCGTC TCGACTTGCT TCATCTGCAG CTCGCCGTCT
TTCTCAATGG CGACCTTCGG TGTCGTCGAC GGGAGGGATG CGTTGTTTGT CGCTCTCCCC
GCAAGCTTTG GATACCTCCG TGTCGTCCGA CGTGATTATA GATCGATACG TGGCGTCGCG
TGTTGTTGCC GAAGAAAGTC CCCGCTTACC CAACGACTCG TACGAACGCA ACGCTCTTTA
CGAAGCAGAG GAAGAGCGTT CTATGGGTGG TGTTACGGCG AGTGTCCGCC AGAATTGGAC
ACGTGACGAA ATTGCCGAAA TATTCCATCA ACCTTTCTAC GAGCTCATGT ATAAGGCTTC
CACCGTCCAC CGCATGTACT GGGATCCTTC GGAGGTCCAG CAGTGTACGT TACTTTCCAT
CAAGACGGGC GGTTGTTCGG AAGATTGCTC GTACTGCTCG CAGTCAACTC GCCACAAAAC
ATTCGTCAAG CCCACTCCTA CTATGAAGGT CCAGGAAGTC TTGGAGGCGG CCCAGCGTGC
TAAGGAAGCG GGGTCGACCC GGTTCTGTAT GGGGGCGGCG TGGCGAGAAC TTGGTAACAA
GAAGAATGCC TTCGGCCATA TTCTTGAAAT GGTTCGTGGT GTCAACGGTA TGGGACTCGA
GGTTTGCTGC ACTCTCGGTA TGTTGAACGC GGAACAGGCG AAACAGCTCA AGGAGGCGGG
TCTATCAGCC TACAATCACA ACCTCGATAC AAGTCCCGAG CACTACCCCA AGGTCATTTC
TACCCGCAGC TACGAGGATC GTCTCAATAC AATCGCCAAT GTGCGTGATG CTGGAATCAG
CGTCTGCTGT GGAGGTATCT TGGGGTTGGG TGAAGAAGAA AAGGATCGGG TCGGACTCCT
TCACGTACTG GCGACTCTCC CGGAGCATCC CGAATCTGTT CCCATTAATG CTTTAGTAGC
CGTTGAAGGC ACACCTCTCG GTGACTCTGA GGATATATCC CGTGTAGATG CATTCGCCAT
GGCCCGTATG ATCGCTACCG CACGTATCGT AATGCCACGC ACCATGGTCC GTCTTTCCGC
TGGACGTCTG TCATTCAGTG ACGCCGAACA GTACCTCATG TTCCAAGCCG GTGCTAATTC
AATTTTCAAC GGCGACAAGC TCTTGACGAC AGCCAATCCC GAGTTTGACC AAGACCAGGC
ACTTTTCCGA AAGTTTGGTT TCCAAGGAAA GCCGGCACAC AAAGGTCCTC GGGTTGCTCC
CGCTGAAGAA CAGGGCAAAG TTGCTATTAC CAAGGTGCAG GGGACAAACA ACGTCGAGCA
GCAATACGCG TAAACGAGGC CGTATCCACA AAATATGTAA ACACACAGCC GCATGCATTG
GACATCATTT CTAGGTGTAC AGTGCTGTTT GCCACTGTGG TACACATCAC GATGAAAATG
TGAATAAATG AAACAAAATT AATCCTCTTA ATCAGTTGG
 
Protein sequence
MIPSRLASSA ARRLSQWRPS VSSTGGMRCL SLSPQALDTS VSSDVIIDRY VASRVVAEES 
PRLPNDSYER NALYEAEEER SMGGVTASVR QNWTRDEIAE IFHQPFYELM YKASTVHRMY
WDPSEVQQCT LLSIKTGGCS EDCSYCSQST RHKTFVKPTP TMKVQEVLEA AQRAKEAGST
RFCMGAAWRE LGNKKNAFGH ILEMVRGVNG MGLEVCCTLG MLNAEQAKQL KEAGLSAYNH
NLDTSPEHYP KVISTRSYED RLNTIANVRD AGISVCCGGI LGLGEEEKDR VGLLHVLATL
PEHPESVPIN ALVAVEGTPL GDSEDISRVD AFAMARMIAT ARIVMPRTMV RLSAGRLSFS
DAEQYLMFQA GANSIFNGDK LLTTANPEFD QDQALFRKFG FQGKPAHKGP RVAPAEEQGK
VAITKVQGTN NVEQQYA