Gene PHATRDRAFT_37843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37843 
Symbol 
ID7202648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp197775 
End bp199514 
Gene Length1740 bp 
Protein Length579 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182024 
Protein GI219123422 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGGT TACCGCAAGG CCAGTCGGCC CCGACAACTC CAAGCTCCAA GTCGATGAGC 
AAGCAGACCC TGGCTCAGTC CAAAACGGGC TACGGTCGCC AGGGCCTCTC CCGAGGACCG
ACTATGATGG TGCCCCGTCC ACAGCGTTCG CGCTTTGCCG ACCCACCTCG GAGGCATTGT
CAACCAGGTA GAGCCTCTCG AAACGGTGTG GCGTCTATTA AACAGGTCTC GTTTTGCAAC
GAAAGCATTG GAGTATCCCA GCTAACGCAA GATTCTACGA GCACGACGCC TCCTTCCTTT
GGAGGATATC CTCCTTTTCA GCAAAGCTGG GCGTCTTCCT GCAACACGGC ACAGCAAGAG
CGGTCCGTGT CTTCCTATAC GTACTCCAGC AGTGCTTCGG TAACTGCAGG GCAAGGGAGG
CCCAGCTTAT GTCCCGAATC ATCGAAGAGC CTTCATCAGT CGTCATTGGT CTCTGTAGCG
TGTTCAGAGA AGTCAAAGGG ACTGTCGAGC GCTTCCCGTG CCGCCTGTAA TCGTAATATG
CTGCGGTCTA TGCTCAAGCC CACATTTGCC ATGAGCCGTG CGCTTGTTCA ACGCCCCGCC
CTGCTTCCAA TTCCTCACCA GCGTTCTCTC TCAACACAAG CAAGCATTGC CAGTGCTCGA
CAAATTTTGA CACCATCTGA CAAATCAAAT CATGCAAAGG TCGGACCGCA TATCGTTGCT
CACGGCCAGT CTGTTATCAA GAAGACTAGC TTAGATGACG ACGAAAAGCG CACTCCGAAC
ATCGGAGTCT CTCTCGATAC ACATCGACTT ATCAAGTCGA TTGTTTTCGA AGAATTCAAA
TCACACTTTT CGGAACGTGC CTTGCAAGTC GATGAGAAAG AGAATAGCAT TGCTAAAAAA
TTGCTTGAAA TGCGCACAAT AGCGGAATCG TACGCCGTTG CCACCATACA GCACGGTGAG
CGTGTCAAAT ACCTCGACCA GAAGACCCAG GAAGTAGACG AGAAACTTTC AAAGGTCTCA
AATATGCTTG GAAAGGCCAA CGAAGCTCTT TCAACCGTCA CTGACATCGC CGAATCAGCA
ATTATCAAGG TTGAGCTAGC TAGGGATACG ATTGTGGCTT CGGCCTTACC CTTTGTGAAG
AACGCCGTCT TCCAAATGGC CGAAAGCCTT TTTCAGCGCA ACCGAACTTC CTCCCTGACT
AATACGAGTC TATCGTCCCT TTCTCAATCC CAAGAATCAG CTTTTGTTGA AGAAGAGAAT
CCCGAGAATC GCGTACCTCC CCCAAACAAA ATCACAAGGG GCAATGGCGT GCTCATTTCG
AAGCACAAGC GCAAGCAGCC CGATCCCAAG GTACTCGCGA AGGGAGCTAA AAAGCACCGC
AAATGGCCAA GTCTACCAGC GCGTAAGACC TCGACTGCGA CAAAGATATG TGCCAATGAA
GTTGCTTGCT CTCCGTTCAA GCCACTCGAC TTCGTCACTG TAGAAAATTG CAAAGACGGC
CATCCTGTTA CACCCTGCGG CAAAGGAAAG ATTGGCCACA TGAGCTGGTG GGATGTAAAT
TCGGACGAAG AAGAGCTTCA TTGGGGCAGC ACTTCCACCA GTCCACTGTG CGTGTCAAAG
ACTGCCACCA AAAGCGGATG CAAGCGTCGT TGTGCTGACC GATCTAATAG TAAAAAGCAT
CGCAGCGCCT TCGGCAGTCG CAACACCGAA ATTTTGAACG ACATAGACGG CTTTCTCTAA
 
Protein sequence
MDGLPQGQSA PTTPSSKSMS KQTLAQSKTG YGRQGLSRGP TMMVPRPQRS RFADPPRRHC 
QPGRASRNGV ASIKQVSFCN ESIGVSQLTQ DSTSTTPPSF GGYPPFQQSW ASSCNTAQQE
RSVSSYTYSS SASVTAGQGR PSLCPESSKS LHQSSLVSVA CSEKSKGLSS ASRAACNRNM
LRSMLKPTFA MSRALVQRPA LLPIPHQRSL STQASIASAR QILTPSDKSN HAKVGPHIVA
HGQSVIKKTS LDDDEKRTPN IGVSLDTHRL IKSIVFEEFK SHFSERALQV DEKENSIAKK
LLEMRTIAES YAVATIQHGE RVKYLDQKTQ EVDEKLSKVS NMLGKANEAL STVTDIAESA
IIKVELARDT IVASALPFVK NAVFQMAESL FQRNRTSSLT NTSLSSLSQS QESAFVEEEN
PENRVPPPNK ITRGNGVLIS KHKRKQPDPK VLAKGAKKHR KWPSLPARKT STATKICANE
VACSPFKPLD FVTVENCKDG HPVTPCGKGK IGHMSWWDVN SDEEELHWGS TSTSPLCVSK
TATKSGCKRR CADRSNSKKH RSAFGSRNTE ILNDIDGFL