Gene PHATRDRAFT_49988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49988 
Symbol 
ID7198772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp43966 
End bp45504 
Gene Length1539 bp 
Protein Length452 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184881 
Protein GI219129407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGA ACCAATTCGT TCGCGGTTAT CTGTCCTATG TGCCAATCGC TGTTTGTGCG 
CTGTACCTCG TTGCATCGCC TGCGAAACTG GTAGCGGCTC ACAGTCAATG GAAGGTGCAG
AATAGGCATC GTCAAACGAA ACTGTATGAA TCAAGCAAAA GTCCTGCAGC CCGCAGCGTC
ATAGCAGATC TACGCGGAGG TTCACAGCCA AGGCACTCCT CTGCGGATGT CCGAGAAATT
AATTCTACTC TAGATTTCAC CAACCCTTCC CTGATAGAAG AGGGACCCCA TTTAAACGCG
TTGGATTCAC AGTCAGAAGC AGATCGGTTA CGGAATCTGG ATTTGAATAC ATCGGCAGCT
CTGCCAATTC CGGCATGGAG GGAGCATTTG CCTCCACCGC TGCGTCTAAA GAAAAATACA
TTGCAACGAG TCCGGATAAA GAACGTTGAG ATCTTCTTGT TGGGCACGGC ACACGTTTCC
AGCGATTCTA GCGAGGAAGT TAAACTTCTG CTCCGTCATG TGCATCCCGA CGCTATTTTC
GTTGAGCTTT GTGAAGCTCG CATACCTCTT CTTGAAGGAA CTGCGAAGGA CGAACACGAA
GAAGAAGCAT TGGCACACCA GAATCGCACG ATGCGTGAAA AGATACGGCA GGTACAGTCC
ACACAGGGAG GCTCCCGTCT TCAAGCTCTT TCCACAGTTT TGTTGACTTC TGTCCAAGAA
GACTATGCAT CCGAGTTGGG AGTAGAGCTG GGAGGCGAAT TTCGGGCCGC ATACCAATAC
TGGCAAGCGC AACAATCCAT ACCGACTGGA ACAAGTTCTC AATCTTGTGC TTTGATTTTG
GGCGATCGTC CTCTACAATT GACACTTGTA CGTGCCTGGG AGTCTCTCGG GTTTTGGCCC
AAGGTAAAGG TTTTGCTAGG TCTGCTTTGG AGCTCATGGC AAAAGCCGAA AAAGGAGGAA
ATCCAGGAGT GGCTACAGTC TGTGCTTCGG GACGAAACAG ATGTTCTCAC GGAAAGTCTG
AAAGAACTGC GCCGTCATTT CCCTACCCTT TTCACAGTAA TTATTGCAGA ACGTGATGCA
TGGCTAGCTG CCAAGCTTGT ACAAAGCTGT CGAGTATTAT CGGCCTCAGC AACAGCAGCT
TCTCCTGTAT GCACGGTCGT GGCCATCGTT GGTGCTGGAC ATATCCCGGG AATTGTAGCC
TGGCTGACCA CGCCTCCAGC CGATACGTCT ATCACGCCTG AAACAGTACT ACGCGACTTG
GTCACCACAA AGCGTTGGGC TCACGATGAC GCTATCCAAT TGCAAGCTAT CCCGGCGTGG
ATTTACGAAG TTTCTCACTT GCAGCCCAGT GCCTCGTAAA AAAACACGCC GGCGCGATGC
GAAATGCTTG GATGATACCT TCAACTTCGG AAGGGCCTCC TTTGCCCTTT TGTCTAATCA
ACTGATAAAC CTTTCTGACT ACCGCTTTTG ATCATTGCGA GGGAGATTGG ATTGGCTTTG
CAAAGCTTAA TCTAGTGTCA AACTTTAAAA TATCATCTC
 
Protein sequence
MGKNQFVRGY LSYVPIAVCA LYLVASPAKL VAAHSQWKVQ NRHRQTKLYE SSKSPAARSV 
IADLRGGSQP RHSSADVREI NSTLDFTNPS LIEEGPHLNA LDSQSEADRL RNLDLNTSAA
LPIPAWREHL PPPLRLKKNT LQRVRIKNVE IFLLGTAHVS SDSSEEVKLL LRHVHPDAIF
VELCEARIPL LEGTAKDEHE EEALAHQNRT MREKIRQVQS TQGGSRLQAL STVLLTSVQE
DYASELGVEL GGEFRAAYQY WQAQQSIPTG TSSQSCALIL GDRPLQLTLV RAWESLGFWP
KVKVLLGLLW SSWQKPKKEE IQEWLQSVLR DETDVLTESL KELRRHFPTL FTVIIAERDA
WLAAKLVQSC RVLSASATAA SPVCTVVAIV GAGHIPGIVA WLTTPPADTS ITPETVLRDL
VTTKRWAHDD AIQLQAIPAW IYEVSHLQPS AS