Gene PHATRDRAFT_49741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49741 
Symbol 
ID7198432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp129071 
End bp130390 
Gene Length1320 bp 
Protein Length343 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184573 
Protein GI219128759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAACA AACTAATCCG AGATTCGACG TTCCCATTTT CTGAATTCTC CACGATGGTG 
ACGCCGTCGT TGGCCATGGC AGTAATAGCA TTCATTTGCA CCGGAAGCCG TGTCTCTAAA
CCTACGCATC TCACCAGACT TTTCAGTACG GTACCGTATA GAGAAAGCAC TCTGATATCG
TCCACTAAAT CCAATACAGT CAAGAAGATC CAAGCGCTTT TGAATAAACG GAAAAAACGA
CTAGAGCAAC GCGAGACCGT CGTGGAAGGA CCTCGGATGG TTGTCGACCT ACTGAAACAA
CCACAAACCC ATCATTTGGT TCGAAGCATC TTTATTGCAT CAGACAAGTG GGAGCAGTAC
TATCCTGAAA TCCTCCGGGC TGTCGGTGAA GACGAGAGCC GCCTACCACT CACGCTTCCT
GTGACGCCAC CGGTCTTTGC CGCGTGTAGC GACACCATCA CGCCACAGGG AATTCTGGCC
ATAACGGAAA TCCCCATTTT TTCATTGGAG AGAAGGGATG CTCCTCAAAA TCCGCTGTAT
TTGGTCCTCG ATGGCGTGTC CGATCCGGGG AATCTGGGAA CTCTACTACG CTCAAGTGTG
GCGACAGGTG TCGCTGGAGT GTTGCTGCTG CCCGGATCCT GCGATCCATG GGCACCGAAA
GCTATTCGAT CCGCCATGGG AACGACTTTT CAAATACCAG TCGAAACTTT TGAAAACTGG
GATGAATGCC GGGAAAAACT AGTACATTTG GGTTGCAACA ATTTTTGGGC GGCTACCATG
TTGGAAGACG GAGTCGGCAG GTCCCATTAC GAGGTTAACT GGTTGAGTGG GCCAAATGCT
TTGATCATTG GAAGTGAAGG AAACGGCCTG ACAAAACCGA TTCGGGATGA CCTAGCTGTC
ACATCACCAA GGTTGAAATC TGTCTACGTT CCCATGAAAG CCGGTATTGA GAGCTTGAAT
GCAGCCGTGT GTGGGAGTGT TATTCTGTTT GAATACATGC GTCAAGGGGA AACCAACGTA
TCCACAAAGT GAATTGTGTG GACCCCAAGA TCTTGTTCTG GAACACCAGG TCAAGCGCAT
AACTTCTCTT ACCCACTGAT TCTCGACCCA CCATATATTG GTAAGGCTTG GACAGGAGCG
AACTGTCTCT ATGCGTGCAC ACACCCGCAT AAACTTTAAA GCCCTACTTT TATCTAATTT
TCAAAGCTCA GGTTGGCCAC CAAAAGGTAT TTGGGAATTG GCACGAGACG ACGAGCATTG
GAAGGGGACA TAAAGACACT GATTTGGTCT ACTAGTAGAG ATGGTTTCTA TTCATCCGTG
 
Protein sequence
MRNKLIRDST FPFSEFSTMV TPSLAMAVIA FICTGSRVSK PTHLTRLFST VPYRESTLIS 
STKSNTVKKI QALLNKRKKR LEQRETVVEG PRMVVDLLKQ PQTHHLVRSI FIASDKWEQY
YPEILRAVGE DESRLPLTLP VTPPVFAACS DTITPQGILA ITEIPIFSLE RRDAPQNPLY
LVLDGVSDPG NLGTLLRSSV ATGVAGVLLL PGSCDPWAPK AIRSAMGTTF QIPVETFENW
DECREKLVHL GCNNFWAATM LEDGVGRSHY EVNWLSGPNA LIIGSEGNGL TKPIRDDLAV
TSPRLKSVYV PMKAGIESLN AAVCGSVILF EYMRQGETNV STK