Gene PHATRDRAFT_37761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37761 
Symbol 
ID7202767 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp10414 
End bp12051 
Gene Length1638 bp 
Protein Length507 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181824 
Protein GI219123006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GAGATGTACG TTGTAAAATT GGGGGAAGGG TGACTGCAAA AGCTTGTCAC 
GTTGTTTTTC TTGGTGAATG TGCTCGTAGA TATGGTGCAT TGAAGACTAC CAAGGTCATT
GTTGGGACTG TTGTAGAGGT CACAACCACC AGGAAGCCTC AAACCAACCG TACTTCCACC
TTTGTTACTG CTGACTTTGA TTTGGGGGGT GGAGCAGTGA AGTGAAGCAC GCTGAACATC
CGTAGCGTCA AGGCCTTTGT ACCAGACTTG ACCACGTCAC CAAATGACGC TGATACAGCA
GCTGTGGCAG CGCCCAACAA TATGCTCGAA AACATTGATG CGCTTCCAAC CGTAACAGCG
GATACTGCAA CTGAGACTGA TTTGTTTCAT ATGGAAGCAG GGGTTGATCA ACATCCAATT
CTCGAACAAG ACACGGAGTC ACTGGTGCAG CTACCAGTAC TTCCAATAGT GCCCAATGCT
GACTTTGGTA ACAATAACTT TTCCGAGGCA GAAGCTGTCC CTGCTGCAGT AGCACATGGC
ACAAAGTGGT ATGAAGATGA TGAAGCTACT CTAAATGATA CAAATGGCAG TGTGCCAATA
AAAGACTTCG GTATTTCCAC GCCTGTTGGT GAAGTCTTGG GTCCAAACTC TGATATTGGC
GGAAAGTACT CGAGACTGGA ATACTTCCTT CTGATGTTTC CACCCAAACA GCTCACAACT
ATGTGTCAGC TAACAAATAA CGCTCTGGTG CAACAGAACA AGCACATCAT CACTACTGGC
GAGCTGCTTC GCTTTTTTGG AATAGTCATT CTGACAACAA AGTTTGAGTA CACAAGCCGA
TCCCAGCTGT GGTCAACAAC TGCACTTTCA AAATATATTC CTGCTCGATG CTTTGGACGG
ACAGGAATGT CAAGACAGCG ATTTAACGAT ATATGGCAAT GTCTTTGCTG GAGTGAGCAG
CCTCCTGAGC GGCCAGAAGG TATGAGTTCG CAGAGCTACA GATGGAAACG TGTTGATGGC
TTTGTAGCCA GGTACAATGA TCACCAAAGT ACAGCTTTCA AGCCCTCTCA CATGATTTGT
GTTGACGAGT CCATCTCTCG CTGGTATGGC CAAGGGGGGA ATTGGATTAA TCATGGGCTG
CCTATGTATG TTGCCATAGA TCGAAAGCCA GAGAACGGTT GCGAGATCCA AAATGCGGCA
TGTGGATGTT CCGGAATTAT GCTTCGGTTG AAACTGGTCA AGTCAAAGAC TGCTCGGGAA
GAAGGGGATG AGGGTGGTCT AAGCGACAAT CATCTTTTAC TTGGCACAAG GATTCTCAAA
GAGCTAGTTA CTCCTTGGGC ATGGACAAAC CAAGTTGTAT GTGCTGATTC CTATTTCGCT
TCTGTTGGTG CTGCATTGGA GTTGAGACAA ATAGGTTTGG GATTTATTGG GGTTGTGAAG
AGTGCAACAA AGCACTTTCC AATGGCTTAT CTTTCGAGAC TGGAGTTCAA TCATCGAGGA
GACCGAAAAG GATTGTTGAT GAAAGACGGA CTCAATGGAA GTAGCTTGAT GGCGTTTGTA
TGGATTGATC GTGATTGCCG ATACTTTATA TCAAGTGTGT CCAGTCTTGA TGCCGGCAGT
CCATTTGTTC GATATTGA
 
Protein sequence
MSEGDTTKVI VGTVVEVTTT RKPQTNRTST FVTADFDLGG GAVNVKAFVP DLTTSPNDAD 
TAAVAAPNNM LENIDALPTV TADTATETDL FHMEAGVDQH PILEQDTESL VQLPVLPIVP
NADFGNNNFS EAEAVPAAVA HGTKWYEDDE ATLNDTNGSV PIKDFGISTP VGEVLGPNSD
IGGKYSRLEY FLLMFPPKQL TTMCQLTNNA LVQQNKHIIT TGELLRFFGI VILTTKFEYT
SRSQLWSTTA LSKYIPARCF GRTGMSRQRF NDIWQCLCWS EQPPERPEGM SSQSYRWKRV
DGFVARYNDH QSTAFKPSHM ICVDESISRW YGQGGNWINH GLPMYVAIDR KPENGCEIQN
AACGCSGIML RLKLVKSKTA REEGDEGGLS DNHLLLGTRI LKELVTPWAW TNQVVCADSY
FASVGAALEL RQIGLGFIGV VKSATKHFPM AYLSRLEFNH RGDRKGLLMK DGLNGSSLMA
FVWIDRDCRY FISSVSSLDA GSPFVRY