Gene PHATRDRAFT_42451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42451 
Symbol 
ID7196655 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp104724 
End bp106887 
Gene Length2164 bp 
Protein Length681 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176523 
Protein GI219109537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACGAGTTGA CAGTTACAAG TGCGATGATC TACGAACGTT CGCTCACCAT GTCGCAACCA 
CAGAGCTGGG AGAGTATAAC CGGTTTTGCT GCTTCTTTAG TGAACACAGC CAATGAGCTT
TTTAGCGGTG GAGATGAAGC AGAGGCCTTG CTTATGTTCG AGTCAGCGCA CCACGTTTTG
CGTACGCCAT CCGCCGAGGT TCTGGCAGTT GCGGCTAAGC AGTATCAGCT TAGCACCGAT
CGTCTGCACA ATAATGAATC TGATCGGTCA AAGGATGGTC CGATCTGCCC CGCTGATCTT
TATCAAGAAG ATGAATGTGA CGTCGGCCCG CGACTTCTCC GCTCACCAAT TCGCTTGGAC
GGGATTTATC CTTTGCATAG TGAAGCTGCT ATTGTCGAAA CTGCAATCGT CTTCAACAAA
GCTTTGGCAC AGCAGATTGG AAACGATGAA AACAGCGCGA AGCAAAACTA TCAGCTTGTG
CTAATGTCCC TACGATCATT CCTTTCCAGA CTGGCTCCAA GCTTGCCTCC GACACTCCTC
CTAGAACTAG GTATGCGAGC ACACAACAAT ATGGGATTAC TCAGTTACGC CAAGGGAGAT
GAAGATGCTG CTACTTCTCT TTTTCAAGCA GCATTGGTTT TTGCCAAGCA ACTCTCGGAA
ATTTCGAAGG CCTACAGTCT CGAATATGTA ACCGTCCTTT CTAACTGGTG CCGCGTAAGC
TGGATGAGGG GTGACATAAA TGACAATTTG TACAAGGGGT TGCGCGAAGT TCTCCGACTT
CGTTCCGCAA ATCTTTGTTG GGATCATCCA GATGTGGCGG CAGCCCATTA CAACTTGGCT
GTGGCTGAAT ACGCTAGGCA GCGGGAGGAA GAAGCAACTT TACATTTGCG CCAGTATCTA
CACGTTGCCT CGCACCGATC CGAAACCGGA CAACACGATG TCGACCCTAT TCCTGCTCTG
ATTCACTTGC TTTTGCTGCA GAACGAAGAG AAGGACGACA GCATATCACA AGACTTGGTT
CGGGGATTGC GCACACTCCA GGACAAACGT CAAGATCAGG GACCGCGTAG CTCCGATGTA
GCCTCGGTAC TAAACTTTAT TGGCACGCTG CTTTTTCACA AGCAAGATTT TGAGAACGCA
CTACTATTCT TTGAAGAAGA GCTGCGATTG GAAGAAACAA TGCAAGAATG CACGGACGAC
ATTTCGGTTG CGGTAACCTG CAATAACATT GGCCGCATCT TACAAGAGCT CGGTCGCCAC
CCGGAAGCGG TACACTTTTA CAAGCGCGCT CTCGAAAACC GAATACGGAG ATGTGAGTAC
GACATGGGAA GGCAAGGAAG CAAGAGCATG CCGTTTCAAA ATGCCCTTGG AAGCAAAACC
CTCTTCCGTG AATCTGTACT CGACAGTGTG GTACAACCTT GGGCTGATCC ATGATCGACT
CGGGGCCTAC ATTGACGCCA TTTCCTCCTT TCAAATGTCT TTGGAATTGC GTAAGACGAT
GCTCGGACGT GACCACCCCG ATATTGCCTG CCTACTGTAC AATATTGGAG TCCTGCAAAT
GGAACAGCAA CAGCTATCTG AAGCGTCGTT TTCCTTCAGA GAGGCGCTGC GTATCCGATG
CGTAGGCACC ACGGGTCAGC TCAACGATCG ACACGTAATA AAGACGCTGG AAAAGCTCTC
GTCTTTACAA AGAGCCAAAG GAAATATTAA CGGTGCTCTG GAAGCGTCTC GTGAAGTCTT
ACGTATTCAG GAAGTTACGG CCGAGTACGA TCATGTAACC CGGCTGAAGG ACACAGGTGT
GACGCTGCGA TCAATTGCGG AACTCCAGCA CGCAATTGGG AATCTTGACT GTGCCCTTGA
GGCTTCGATG GAAAGCGCTC GCAAACTCGA AGCAGCCAAC AGCGCTCGGT ATAGCCAAGA
TGAGACCATG AACAGCAAAA GTGTTTCCGA TTCGTTTGTC TACACGGAAC ACCAGGTCTC
ATCTTTACTT TTGGTCGGGA GTCTACACCA TGAACTGGGC GAGCCGATGC ACGCACTTCG
TGTCTTTGAG GAGGCCTTAC TGCTGATGGA AGAAGCCGAC GCTAGGCGAC TCGTGCCTTC
GAGCCACGCG TTGCGTGAAG TTACTTGCAT GTTAGCTCGC TCCCACTGTG CCCCGGTTGC
GTAA
 
Protein sequence
MIYERSLTMS QPQSWESITG FAASLVNTAN ELFSGGDEAE ALLMFESAHH VLRTPSAEVL 
AVAAKQYQLS TDRLHNNESD RSKDGPICPA DLYQEDECDV GPRLLRSPIR LDGIYPLHSE
AAIVETAIVF NKALAQQIGN DENSAKQNYQ LVLMSLRSFL SRLAPSLPPT LLLELGMRAH
NNMGLLSYAK GDEDAATSLF QAALVFAKQL SEISKAYSLE YVTVLSNWCR VSWMRGDIND
NLYKGLREVL RLRSANLCWD HPDVAAAHYN LAVAEYARQR EEEATLHLRQ YLHVASHRSE
TGQHDVDPIP ALIHLLLLQN EEKDDSISQD LVRGLRTLQD KRQDQGPRSS DVASVLNFIG
TLLFHKQDFE NALLFFEEEL RLEETMQECT DDISVAVTCN NIGRILQELG RHPEAVHFYK
RALENRIRRL WYNLGLIHDR LGAYIDAISS FQMSLELRKT MLGRDHPDIA CLLYNIGVLQ
MEQQQLSEAS FSFREALRIR CVGTTGQLND RHVIKTLEKL SSLQRAKGNI NGALEASREV
LRIQEVTAEY DHVTRLKDTG VTLRSIAELQ HAIGNLDCAL EASMESARKL EAANSARYSQ
DETMNSKSVS DSFVYTEHQV SSLLLVGSLH HELGEPMHAL RVFEEALLLM EEADARRLVP
SSHALREVTC MLARSHCAPV A