Gene OSTLU_31731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31731 
Symbol 
ID5001861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp438867 
End bp440252 
Gene Length1386 bp 
Protein Length461 aa 
Translation table 
GC content56% 
IMG OID640417282 
Productpredicted protein 
Protein accessionXP_001418015 
Protein GI145347099 
COG category[S] Function unknown 
COG ID[COG4399] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGGC GATTCCCGCG CGCGGTGACG CTCGCGGCGA CGACGTACGT CGCCGCGACG 
CTCGCGCTCG CGCGCTGGAC CGACGACGAC GACGGCGCGA CGCGTCTCCG GCGACGACTG
CTCGCCGCTG ACTCGACCAC GCCGCTCTGG CGATACGCGC TCATCCCGTT CATCGCCGCC
GCCGTGGGCT GGGGGACGAA CGTCGTCGCG CTGAAGATGA CGTTCTATCC GCTCGAGTTC
TTCCCGGGGT TCTTGAGGTT TGCGCAAGTG AAAGGGCAGC CGTTCGGCGC GCTCGGCGGA
TGGCAAGGGA TCATCCCGAG CAAGGCGGGA GAGATGGCGG AGATATTGGT CGATCTCATG
ACGAAGAAAT TGATCGATAT CAAGGAAATT TTCACGAGGT TGGAGCCGAA AACGTTCGCG
AGCATCATGG ATCCCGAGAT GCGGTGCGTG ACGGAGGATA TATTTGAGAC GGTGCTCGCG
CGGGAGGCGC CGACGTTTTG GCAAGGATTG CCGAGAGTGG TGCGGGAGGA GATGGTCGCG
GAGGCCATGG CGCAATCGAG TGGGTTGTTG GAAGACATAA TCGCGGATTT GATGGAAAAT
GTGTACGACG TGCTGGATTT GAAGACGATG GTGGTGACGC TGGCGGTGAA TAATAAGGAC
AAGGTGGTCA ACATGTTTCG AGAAGTCGGC GCGAATGAGT TCGTATTCAT CGAGCGGAGC
GGGATTTACT TTGGTTTTGC GTTTGGTTTG GTGCAGATGG TGGTGTTTTA CTTTGTCGAC
AAGCATGCTC CGGAGCAGGG AGTGTGGTTG CTTCCATTTT TCGGATTCGC CGTGGGCTAC
CTCACGAATT TCGTCGCGTT GAAGGTGATT TTCCAGCCAA TCGAGCCAAA GCGCGTGTGC
GGCGTCACGT TGCACGGCGT GTTTTTGAGG CGCCAAAACG AAGTGAGCGA AGAGTTTGCG
CGCTTGAATC AACTTCACTT TTGCAACGCC GAGAACTTGT GGGAAGAGAT GATGAACGGA
ACGTACAAGG AAAAGTTTGA AGCCCTCGTG CGACGAAACG CCGAAAACTT TTTTGATAAA
GCCATCGGCT CGGTGACGAC GGCAAAGCTC ATCATCGGCG CGGAAAAGTA TGACGAAATC
AAGTGCACCA TCGTAGACAT GATTTTTGCT TCGATTCCCG ATTGCGTGCC CGTGACATAC
GATTATCAAA ACGAAGCGCT CGGCATCGAG GATACGGTGC GCGAGCGAAT GCAAAAGCTT
CCTGCAAAGG ATTTCGAGCG CGTTTTGCAT CCGGTTTTCG AGCAAGACGA AATCAAACTC
ATCGTCGTGG GTGGGGTATT AGGCGCTTTG ACGGGCGTAG CGCAGTATTT CTTAGCGTTC
GCATAG
 
Protein sequence
MRRRFPRAVT LAATTYVAAT LALARWTDDD DGATRLRRRL LAADSTTPLW RYALIPFIAA 
AVGWGTNVVA LKMTFYPLEF FPGFLRFAQV KGQPFGALGG WQGIIPSKAG EMAEILVDLM
TKKLIDIKEI FTRLEPKTFA SIMDPEMRCV TEDIFETVLA REAPTFWQGL PRVVREEMVA
EAMAQSSGLL EDIIADLMEN VYDVLDLKTM VVTLAVNNKD KVVNMFREVG ANEFVFIERS
GIYFGFAFGL VQMVVFYFVD KHAPEQGVWL LPFFGFAVGY LTNFVALKVI FQPIEPKRVC
GVTLHGVFLR RQNEVSEEFA RLNQLHFCNA ENLWEEMMNG TYKEKFEALV RRNAENFFDK
AIGSVTTAKL IIGAEKYDEI KCTIVDMIFA SIPDCVPVTY DYQNEALGIE DTVRERMQKL
PAKDFERVLH PVFEQDEIKL IVVGGVLGAL TGVAQYFLAF A