Gene PHATRDRAFT_19089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19089 
SymbolGEL3 
ID7197808 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp932064 
End bp933474 
Gene Length1411 bp 
Protein Length373 aa 
Translation table 
GC content50% 
IMG OID 
Productgelosin/severin like protein 
Protein accessionXP_002178606 
Protein GI219115621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCGAGACAA AAGCCTGGCA CCGGATCCAA CAGGCTCACA ATCACAACCC GTTCGCACAA 
GCAGTCCCGC GAACGAAAGA ACAAGTTCCG TCATGAGCCA ACGTGTCCCC TGGAAGGAAT
CTAACCTGGC CTTGATTGGT AGTGACCTGG ATCACAAGAT CAAAGCGGCT GCTGCTGAGA
ATGAAGAGCA GTGGCAGGGT CTGAACGAAG CGCCGGGCAG AAAAGTCTGG CGCATCGAAC
AGTTCAAGGT AGTCCCTTGG CCCGAGGATC AGTACGGAAA ATTTCACAAG GGAGATTCTT
ACGTTGTTTT GAACTCTTAT ACTGAGGACG GGAGTGATGC ACTCTTGCAC GATATACATA
TTTGGATTGG CTCTGAATCT TCGCAGGACG AGTACGGAAC GGCCGCTTAC AAGATGGTCG
AAGCCGATGA TTCGTTGGGT GGCGCCGCTA TCCAGCATCG AGAGGTTCAA GGCAAGGAGA
GCCCGGTAAG CATCCCATCC AACGATTGCA GTGCCAATTC GCACTCTGTA AGCATCTATA
TTCCCTAGCT CAACAACCTC TTGTTCTTTT TCTCGCGATC GCAGCTTTTT CAGTCCTACT
TTGAGGAATT GACTTATCTA GAAGGTGGTG CCGACACCGG ATTTAATGTC GTCGAGCCCA
CGAAGGACAA GCCGCATTTG TACCGGGTGA AGGGCACGGA AAAGGGAATG TCGCTCACCC
AGCTGTCTCT CTCCAAGTCG TCTCTGAATA CCGGAGATTC CTTTATTCTA TTCGCCAACG
GAAGCAACGT TTGGCTTTGG AACGGCGAGT CTGCTAACCC CGACGAAAAG GCCCGCGCGA
ACTCATTGGC TGAGAGCATG TGTACGCAGG GAACAGTCAA AGTTTTGGAT CAAGGTCAGG
GCGACGAAGA AGAGACCGAC TTTTGGGATT ACCTTGGTGA TGGCGAAATT CAAGAAGCCG
ATGATGGAGA TGAAGAGGTT GATGAGTTTA TTCCTCTCTT GTTCAAGCTC TCGGATAACC
CGGACGAAGA ACCTGAGCAG GTTGCGGAGG GTGAACCTGT GAAAGTTCGT TGGGGTAGTC
CTTCACCCAA GATAGATCGC TCCTTTCTGA ATGAGAACGA TGTATTTTTG CTCGACGCCG
GTTGGGAAAT TTTTGTTTGG ATCGGTACCG ATGCAGACCG CAGTGAGAAG CTTATGGCCA
TGGGCAAGGC GGATAGTTTT TGCAAACAGG ATCCTCGTAA GGCCGACCTC CCCGTCTCCA
TTGTGAAGAG CGGTTGGGAA AGCTCTGGAT TCAAGGCTTT CTTCAGCGAA TAGACGGTTG
GACTGGCAAC GATTGTTGCG AGAGATCTAG AATGTAGCTG ACTGACAGAG AATATGAGAT
AATATTTCAG AAATTTCCGT ATTACAATTA C
 
Protein sequence
MSQRVPWKES NLALIGSDLD HKIKAAAAEN EEQWQGLNEA PGRKVWRIEQ FKVVPWPEDQ 
YGKFHKGDSY VVLNSYTEDG SDALLHDIHI WIGSESSQDE YGTAAYKMVE ADDSLGGAAI
QHREVQGKES PLFQSYFEEL TYLEGGADTG FNVVEPTKDK PHLYRVKGTE KGMSLTQLSL
SKSSLNTGDS FILFANGSNV WLWNGESANP DEKARANSLA ESMCTQGTVK VLDQGQGDEE
ETDFWDYLGD GEIQEADDGD EEVDEFIPLL FKLSDNPDEE PEQVAEGEPV KVRWGSPSPK
IDRSFLNEND VFLLDAGWEI FVWIGTDADR SEKLMAMGKA DSFCKQDPRK ADLPVSIVKS
GWESSGFKAF FSE