Gene OSTLU_43955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43955 
SymbolTPR5a 
ID5004336 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp187844 
End bp189241 
Gene Length1398 bp 
Protein Length453 aa 
Translation table 
GC content59% 
IMG OID640419757 
ProductTRP-containing protein 
Protein accessionXP_001420268 
Protein GI145351836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0248757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0716262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGCAACAGA TGATGTCGAA CATGACGCCC GAACAGATGG CGCAGATGCA ACGCATGGCG 
GGGTCGATGG GCCTCGGCGC GCCGCCGGGC GCGGCGGAGG CGATGAAAAA TATGACGGCG
GAGGACATGC GACGAGCGGC GCAAGAGATG GGGAACATGA CGCCGGAACA GTTGAAGACG
CAGTACGAAC AGGCGCAGGG ACACGCGAAA GCGAGCGCGG ATTATCGATA CGCGGGGAGC
GAGACGCTGA AGAAGGAGGG GAATAAACTC GTGGGAGAGG GGAAACACGC GGATGCGGTG
GAGAAGTACG CGCGAGTGAA GGAAAACTTG AAGGATGATG TGAACGCGGC GGCGAAGACG
CTGCGGTTGT CGTGCATGTT GAACATGGCG CTTTGTTTTA ACAAGATTGG GAAGCACGAC
GGGGCGATTA GCGAGTGCAC CGAGGCGCTC GAGCTCGAAC CGCGGAGCTT GAAGGCGTAC
TATCGACGCG GACAAGCGTA CGTGGCGAAG GGTGAGCTCG AACAAGGTGT GAATGATTTG
ATGCGGGCGA ATAAGCTCAG TCCTGGAGAT GAGACAGTGG CGGGAGAGCT CGAAGCAGCC
GTCAAGCGCA TGGAATCGCA AGGATTAGCC GTGCCGGCCG CCGCGCCGGA ATTCGACCAT
CCCGAGGCGG CGCCCACGAC GAGCGCCGGC TCGAGCGGTA GTGCGATGCC GGGGATGCCT
GGGATGCCGA CGCTCACGCC TGATGTTCAA GCACAGGTGA GCGCGATGAT GAGCGACCCG
AACGCCATGG AGCAAATGTC GTCGATGATG GGCAACTTGT CCGACGATCA AATCGAGCAG
ATGGCGGCGA CGAATCCCAT GATGGCGGGC ATGGACCCGG ATCACGTGAA AAAGGCAGCG
GGCATGATGA AGAACATGAA GCCGGAGACT ATGCAAAGCA TGATGAAGAT GGCGCAGAGC
ATGGGCACGG AGGGCGGGAA AGGCTTCGAC CCGAACGACC CCGAAATGAT GTCCAAGATG
CAAAAGGAGT TGAACAACCC GGAGATGCGC GAGGCCATGG TTGAAATGAT CCAGGGTATG
GATTCGGAGT CTTTGAAGGA GATGTCCAAG AGCATGGGCA TGACGATGGA CGACGCGCAA
GCTGAGCAAG CGGTCAACGC TCTCAAGAAC ATTTCGCCAA AGACCATGGA ACGGATGCTC
TCAATGGCTT CCGTCGCGGG CGGGATCTAC AGTCGATTCA AGCGACCGAT CGATTGGGCT
ATGCGCAACA AGCGCACGGC GCTCAGCATC TTTGTCGTGT TCACGGCCAT GGGGACGACG
TACGTGCTGC GATGGTGGCG TCGCCGCGGA GGAGCCGTGG AAGCGGCTGA CAACCAAACC
TCGACCTCAA CGTTTTAA
 
Protein sequence
AQQMMSNMTP EQMAQMQRMA GSMGLGAPPG AAEAMKNMTA EDMRRAAQEM GNMTPEQLKT 
QYEQAQGHAK ASADYRYAGS ETLKKEGNKL VGEGKHADAV EKYARVKENL KDDVNAAAKT
LRLSCMLNMA LCFNKIGKHD GAISECTEAL ELEPRSLKAY YRRGQAYVAK GELEQGVNDL
MRANKLSPGD ETVAGELEAA VKRMESQGLA VPAAAPEFDH PEAAPTTSAG SSGSAMPGMP
GMPTLTPDVQ AQQMSSMMGN LSDDQIEQMA ATNPMMAGMD PDHVKKAAGM MKNMKPETMQ
SMMKMAQSMG TEGGKGFDPN DPEMMSKMQK ELNNPEMREA MVEMIQGMDS ESLKEMSKSM
GMTMDDAQAE QAVNALKNIS PKTMERMLSM ASVAGGIYSR FKRPIDWAMR NKRTALSIFV
VFTAMGTTYV LRWWRRRGGA VEAADNQTST STF