Gene PHATRDRAFT_16222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16222 
Symbol 
ID7198349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp233294 
End bp234856 
Gene Length1563 bp 
Protein Length521 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184587 
Protein GI219128788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGTTG TTGGGACCCT TTTACCGCCC TTGCTGCCAA CAGTTTTTAC CGTTTCAGTC 
GGAATTTCTG ATCAGCGTTT GAGCAAAAAA AGGATTGCGT GTTCCAACTC CGAAGATATT
TTGGTTGCGG GAAAGGTCCA AACGGCATGC TTTGACAAGA CTGGGACGCT TACAAAACAA
GGTCTCGATT TCGTGTCCGC GCAGTGCATA CCAACATGGA ATGATCCTCA TTCTCCTTCG
TCCCCTCTAT CGGATACGGT AGCTCTCGGC ATGGCGTGTT GCCACAGTCT CACGACATCG
GGTGATGCCA TGATTGGCAA TGCCGTTGAC CGAGTGATGT TCGCTGCGTC GGGGGCACAA
CAGAATCAGT CCTGGATTGT TCTGAATGGA AACCGAATGA AAGTGCTCAA ACAATTTGAT
TTTTGCCACA ATCGCATGAC TCAGAGCGTG ATTGTAAAAC GAGTAGACGG CTCCATGCTG
GCAATTGTCA AGGGTAGTGG AGAGAATGTG CAACGCGCCT GCCTGCCTGC GAGCTTGCCG
CAGGATTACG AGAGGGTTTT GAGAGAAAGC GCAAAGGCTG GTATATACCA GATTTCAATG
GCTGCCAAAG TCCTGTCCCC GGCCACGAAC TTGGAAGATA TCCAACGCGA CAAAGTAGAG
CTTAATATGG AATTCGCCGG TGTGATAAAC TTTCAGAACG TGCTCCGTGA GGAAACACCG
TATGTGATCA CTCAGCTTCA GGCAGCAGCT GTTGAATGCC TCATTGTAAC CGGGGACGCG
GTTCTAACAG GCATCACTAT TGCGAGGGAG AGCGGTATCA TCCCTACGGG AGCGGCAGTA
TTATGGTGCG CTATGCCCCA CAAGGACGAC CGTGTCGAAT GGGTCGATTT CGATCATGAA
GGGCGCATGA CGGATTTGCC ATGGTCAGCT TTACGCTCGG GGACGACGGT GTTGGCAGTC
ACGGGTGATG TCTGGGACTC CCTCGATATC TCATTTGTGT CGGAGCTGAG TCCGTTTGTC
CGCGTTTTTG GAAGGTGCAC ACCCGCACAC AAGGTCGCGA TCATCTCGCA TTATTGCGAT
CAAGGCAAGA TCACGCTCAT GTGTGGCGAT GGAGGCAACG ACTGTGGAGC GCTCAAGGCT
GCACACGTGG GTGTGGCTCT TAGCGATGCG GAGGCCAGTA TGGTGTCCCC TTTCACCAGT
TTGGACAAGT CAATTGTGTC GGTGACGGAG ATCCTCAAGG AAGGACGGTG TGCTTTGGCG
TCGGCACTGG CCTCGTACAA GTATGTGATA ATGTATGGTC AAGTAGAAGC AATCGCAAAC
GTCATGAATG CATACTTCAT GATAAACCTA TCAGAGTATT GTTGGATGTT CATGGACGGT
TTCTGGGTCA TTTCAATGTC TTTCACTTTG CCGCTCGGCA AAGCCGCTTC CGCTTTGGCC
GAAACTAGGC CTACCGCGTC CCTCCTTGGT CCCATTACTG CCTCGAGCGT CGTCGGTATT
CTTCTTATCA ACACTACTTT TGCAATTATT GCTCTCTGGA TTCTATTTCA TCAAGATTGG
TTC
 
Protein sequence
MYVVGTLLPP LLPTVFTVSV GISDQRLSKK RIACSNSEDI LVAGKVQTAC FDKTGTLTKQ 
GLDFVSAQCI PTWNDPHSPS SPLSDTVALG MACCHSLTTS GDAMIGNAVD RVMFAASGAQ
QNQSWIVLNG NRMKVLKQFD FCHNRMTQSV IVKRVDGSML AIVKGSGENV QRACLPASLP
QDYERVLRES AKAGIYQISM AAKVLSPATN LEDIQRDKVE LNMEFAGVIN FQNVLREETP
YVITQLQAAA VECLIVTGDA VLTGITIARE SGIIPTGAAV LWCAMPHKDD RVEWVDFDHE
GRMTDLPWSA LRSGTTVLAV TGDVWDSLDI SFVSELSPFV RVFGRCTPAH KVAIISHYCD
QGKITLMCGD GGNDCGALKA AHVGVALSDA EASMVSPFTS LDKSIVSVTE ILKEGRCALA
SALASYKYVI MYGQVEAIAN VMNAYFMINL SEYCWMFMDG FWVISMSFTL PLGKAASALA
ETRPTASLLG PITASSVVGI LLINTTFAII ALWILFHQDW F