Gene PHATRDRAFT_12174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_12174 
Symbol 
ID7200679 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp148944 
End bp150800 
Gene Length1857 bp 
Protein Length585 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179590 
Protein GI219117595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCGT TTGATCTCTT CCAGTCACAA CTCGAATCCA ACTCGACCGA AGCCGAAATT 
GACGCCATGA AGCGTTTAGC CGTTGTTGCC ATTACTATGG GAAAAGATGA CGCCCAAGCT
ACGCTCATAC CCTATCTAAC ACAGATAGGT ACCGCGCAGC CGCTTCCTTC GGACGAACTC
TTGCTTATTC TCGGACAGGA ACTGCCCGCC GTCGCCAAGT TCATCGGCCC CGCTTGTGTT
GTGGACTTTT TGCCCCTTCT CGAACGTCTC GCCGCGGTAG AGGAAACAGT CGTTCGCGAT
CAAGCCGTCG TGGCGCTGTG CGAACTCCTT GGACAGGCAG GGACCGGGCT GGACGCCATT
CCCTGGACGG CACTCGCCAA ACGTTTGGGC TCGGCTGACT GGTTCACCGC CAAAGTCTCC
GCTTGCGGCG TCGTAGCTTC TATTCTCCAA CTCAATAACA GTAATTCGGA AGAGTTACTC
GCGCTTTACA AAGATCTCTG TCAGGACGAG ACACCCATGG TGCGGCGGGC AGCAGCCAAG
CATATTGGCA AAGTTCTCGG TGTCGCTGGG TACGAGCAAC GTGATTTTTG CACCGCCACC
TTGCCCGTAC TCTGTCGGGA CGAACAAGAC TCGGTCCGAC TCTTGGCGAT CGGGTCCTTG
GCCGATGCGG GATCCAGCTT TTCTGTGCAT CCGTCGTGGA CTGCCACAAA TTGGTTGCCC
TTGGTCAAGG ACGGATCCAC CGATATGAGT TGGTACGTGT AAAAAAGCAG AGAGAGCTCG
CCTTGCAGCG TATGCAGCCC TCCCAATGGT CGCTTTCCCT GTCTCACCGT TTGTGATTTC
TTGACTCGTA GGCGTGTGCG AAACAATTTG GCCAAGAATT TCGCCAACGT TGCCAACAAC
CTTGGTTTTC AAAACGATCC TGACCAGCAG ACCGAGCAAA GTGTCGTTAT GGCTTGCTTC
GTGGCTCTCT TAATGGACTC GGAAGCGGAA GTCCGAGCGG CCTCCGTCGG TCACCTTTCC
AAAATGGTGT CTTGGGGCGG AGCGACTCAC TTTTCGAGCC ATCTCCAATC CTTGTTGCCG
GCGTTAGCCG ACGATGTGGT CATGGAAGTC CGCAGCAAGT GTGCTCTCGC ACTCATGAGC
GCCGCGCACA GCGGCGTCCT CGATGATGCG GTCATTCTCC AGAGCTTCGG TCCCTTGCTC
GAAAGCTTTT TACAAGACGA ATTCCAAGAA GTCCAGTTAC AAGTATTGAC CAATCTCGAC
AAGATTGCAC ATTTGCTGCC CGCACTGTCG GGCGTTGTGA CCAGTTTGCT GCAAATGTCC
AAGGCCAGCA ATTGGCGCGT ACGGGAAGCC GTCGCCCGGC TTTTGCCGCA TTTGGCCCAA
ACTCGTGGGC TCGACTTTTT TGCCAATGTT CTTTTGGAGC CCGCTTGGTT GACTCTCCTA
CTGGACCCGG TCGCCACTGT CCGCAATGCC ATTGTCCGCG GTATGCCATT GTTGGTAAGC
GCAACCGGGG AAGAATGGTT GACGTCCAAA TTGATACCGG AGCACGTACA AATTTTCAAC
CAAAATTCAT CGTCCTACCT CATTCGTATG ACAATTATAC AAGGTCACGT GGAAGCAGCC
GTGGCGCTGA AGGATGGCCC CCTGTGGAAT GAATTAATGG TGCTGCTACT GCGCGGCCTC
AATGATCGCG TTCCCAATGT ACGCATGGTG GCAGCGCAAG GCCTGGCTCA AGTTATGCGT
GAAGGCGATT CAAGTGTGAT CGAAGCTAAG CTCCGCCCTG CGTTGGAGAA GCGGTTGCAA
GAAGATAATG ATGAGGATTG CCGGCGTTGT ATTTCTCTAG CTCTGGAAGT GGAATAA
 
Protein sequence
MSAFDLFQSQ LESNSTEAEI DAMKRLAVVA ITMGKDDAQA TLIPYLTQIG TAQPLPSDEL 
LLILGQELPA VAKFIGPACV VDFLPLLERL AAVEETVVRD QAVVALCELL GQAGTGLDAI
PWTALAKRLG SADWFTAKVS ACGVVASILQ LNNSNSEELL ALYKDLCQDE TPMVRRAAAK
HIGKVLGVAG YEQRDFCTAT LPVLCRDEQD SVRLLAIGSL ADAGSSFSVH PSWTATNWLP
LVKDGSTDMS WRVRNNLAKN FANVANNLGF QNDPDQQTEQ SVVMACFVAL LMDSEAEVRA
ASVGHLSKMV SWGGATHFSS HLQSLLPALA DDVVMEVRSK CALALMSAAH SGVLDDAVIL
QSFGPLLESF LQDEFQEVQL QVLTNLDKIA HLLPALSGVV TSLLQMSKAS NWRVREAVAR
LLPHLAQTRG LDFFANVLLE PAWLTLLLDP VATVRNAIVR GMPLLVSATG EEWLTSKLIP
EHVQIFNQNS SSYLIRMTII QGHVEAAVAL KDGPLWNELM VLLLRGLNDR VPNVRMVAAQ
GLAQVMREGD SSVIEAKLRP ALEKRLQEDN DEDCRRCISL ALEVE