Gene PHATRDRAFT_35989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35989 
Symbol 
ID7201340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp275110 
End bp276411 
Gene Length1302 bp 
Protein Length434 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180403 
Protein GI219119279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGG CCCAAAAAAA CTCCTCCTCC CGAAGCCGAC TGACGGAGAA TCCAGTGTTC 
TCGGACCTCC TTGGCAGTGA GATACGCATG TCGAAGGCGA GGGCTTGGTG CGAGCCAATA
GAATCAAAGA CGCTCTATCT TTCGGGAGAC GAGCGTTACG AGGTGTGGAC CATCAAGGCA
GCGGAACGAC CGCATCTTCT GGATGTCTGC TCCACAGATT CCAATCATCA GCAAGAGAAC
ACGAACGAGG ATGCTGCCGA CATGAACGAG ATTCTGGCTC TGACAGACGG CTGCGATGTT
TCGGGTTGGC GAAAGGAAAA GTTTGTCCGG ACAAACGCAG AGGTGTTGTC GGCTTTCAAC
GTGGGTTCCG GGTGTGTTTT GCTCAGACGT GAAACTCAGT TGCCGATTGT CCTATCGTCC
ATCGCCTCCT CGGCTTGGGA CTCGCTTCGC GACCGTCTTT TTCAGGAACC TTGCACGACA
GCATCCCGTC AATTGTACTC CTCTAATAGA GCGCTCCATC CTTGGTTTGA ACGAGAGAAC
GTCCCAGTAA TTCTCGATGG CTGTCCCGCT ATCGATAAAT GGGCCGCCAT GAAATCGTGT
CGCTTCGACA ACCTTGTGCA GCGATACGGA GATCTGGAAT GGCGATTTTC CGACACGCAT
GGGGAGACCA TAACTCTGAA GACCTATCAA AAGTACCTGC GCTCAATAGA AGGTTCCACA
GACGATGCAC CGCTGGCCGT ATACGACTCG CAATTCGGTG GCGATGACCG GTCGTCGCTG
CTGGACGATT ATACCGTCCC TTCCTGTTTT GACTCAGATT TGTTTGCCTC CGCCATCCCG
AACGAGGACG ATCGACCACC ATTCCGATGG TTGTTGATTG GACCTGCGAG ATCGGGAACG
GGACTACATA TTGATCCAGT AGGAACTCAT GCTTGGGTAA CACTGATCGA GGGGTGCAAA
CGGTGGATCC TCTTTCCGGC TGGGACCGAT CCAGAAGCGA TACACATGAG GGACCCCCAA
ATTCCTTCGG CTATTTGGTT TCGCGATTTC TATGATCAAG CTATGCGGGA TCATGCCGAT
GCGGTTGAAG TCTTGCAACG ACCGGGGGAA ACTGTTTTCG TTCCAGCGGG TTGGCCGCAT
CTTGTCTTGA ACCTGGAACT ATCTGTGGCA ATTACACACA ACTTTGCCAC GGAATATCCG
TCGCTTTTCC TACTAAACAA AGCAATTGCG CAAGCAGAGC CAGAGTTGGC CGGGCGTTTT
GAGATCGCAC TGAAGAGTTC GAGACCGGAT TTGTTTTCCA TT
 
Protein sequence
MAKAQKNSSS RSRLTENPVF SDLLGSEIRM SKARAWCEPI ESKTLYLSGD ERYEVWTIKA 
AERPHLLDVC STDSNHQQEN TNEDAADMNE ILALTDGCDV SGWRKEKFVR TNAEVLSAFN
VGSGCVLLRR ETQLPIVLSS IASSAWDSLR DRLFQEPCTT ASRQLYSSNR ALHPWFEREN
VPVILDGCPA IDKWAAMKSC RFDNLVQRYG DLEWRFSDTH GETITLKTYQ KYLRSIEGST
DDAPLAVYDS QFGGDDRSSL LDDYTVPSCF DSDLFASAIP NEDDRPPFRW LLIGPARSGT
GLHIDPVGTH AWVTLIEGCK RWILFPAGTD PEAIHMRDPQ IPSAIWFRDF YDQAMRDHAD
AVEVLQRPGE TVFVPAGWPH LVLNLELSVA ITHNFATEYP SLFLLNKAIA QAEPELAGRF
EIALKSSRPD LFSI