Gene PHATRDRAFT_50492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50492 
Symbol 
ID7199326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp203493 
End bp204863 
Gene Length1371 bp 
Protein Length314 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185399 
Protein GI219130494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.498831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGGAAGAATC CAGCGATCAA CGAGGAAAAC GATCATTTTC GGTTTGCTCG GATTTTGATT 
CCCCCCCGGT CTCCCGTAAC ACACAAGACT CTGACTGGAT CCTTGTCCTT TGTTCCTTTC
TAGAGACTGG CAAGTGAATA CTACTTCACC ATGCTTGCTC ACAAGTTTCA TTCACGTCCT
TCATTGATCT CTTGCAACAG CACGCACGGA ATACGTTCCG ATTCGAATCG CGGGTTTGAC
AACTCCACCA CGTCGCTTTC CGTGGAGTCG CGGTCTCATC CGGGGCCATC GTGGAAACGT
GCGCTGCCCG TGTCTGGAAT GCTGGATGCA TCCGCCGCTC TCTTGCTTCT CCAAGTTGGC
AAGATTGCTA CCGACGAAGT TCAGAAAGAT TCTCTGGTGT ACCAGCCATC CTTGTTCGGA
AACGTTCCTC CCCACACGGA GAGCTCCGTT TTCTCCTCCG ATGAAGATTC CGATCGAGCA
GAATTTGTTC CGGGGGAAGC CCGCATAAGT GCGCGCCACC GATGCCGTAC CGTCTCGGTT
GATGTGCTGG AATATCAAGG ACGACGAACG CAAACACAAC ATTCGCAGCG ACCCCGTCTC
CTTGCTGGTC CCGTGATTCA TACCGTCCCA CCTTCTCCCA CGGGACCAAA GCCGATCAAG
TCTCTAAGTA CAACTACAAC TGCTTCGACC CTGTCCCTGG ACGGAATCAC TACTAAAAAC
CCGAGTACTT CCGCAAGCGA ACCTTTGGTC GTTCCCCACA AGCCTTCCGC GAATCTCGTG
GGGATCACCA CCGCACACGG AAGGAGCGTC AAGGGTGTGC TACGGCGTAA ATTCTCTTGG
AAGAATTTCC CCGAACTGGA AACCTACCTG ATCGATAATC GTCAACAGTA TTTGCAGTAC
AGTTCCCAGC TAAACTACAC TTCGGAACAA AAGCGCTACA ACAACCGTCT CACGCAAGGC
CTGCTGGATT TGGCTGCCGA GGAAGGTTAC GTCTTTGAAG ACTTTACCTT CGCGGCCATT
CGCGACCGGA TTCGTTGCTA CTACAAATCC TGCGTGCAGG CCGCCAAGAA GAAAAAGCGC
AAGCGTCGCA AGTAAACGCA AAAGTCACGC GCAAGCGTTG TTGACCCAAA CCCGTTTTTC
GGATTGTGTC TGCGGAGCCA GACATTTTCC GCCCTCCACC GTGCATACGC CATTCATCTC
CATATCGTCC ACGCAAACCT GCACACGAAC CTACACGTTC ACAACCCTCT CACTACTATA
TTTCTTGCAT ATATACAAGC GTCAAGTGCC AGTCTCGGCC AAGTCGCGAG TTTTGATCCA
TTTTTCGTCT AGCGAGTAAA TCAACTTTAC TAGTCTCTGA TTGCATACCA A
 
Protein sequence
MLAHKFHSRP SLISCNSTHG IRSDSNRGFD NSTTSLSVES RSHPGPSWKR ALPVSGMLDA 
SAALLLLQVG KIATDEVQKD SLVYQPSLFG NVPPHTESSV FSSDEDSDRA EFVPGEARIS
ARHRCRTVSV DVLEYQGRRT QTQHSQRPRL LAGPVIHTVP PSPTGPKPIK SLSTTTTAST
LSLDGITTKN PSTSASEPLV VPHKPSANLV GITTAHGRSV KGVLRRKFSW KNFPELETYL
IDNRQQYLQY SSQLNYTSEQ KRYNNRLTQG LLDLAAEEGY VFEDFTFAAI RDRIRCYYKS
CVQAAKKKKR KRRK