Gene OSTLU_34718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34718 
Symbol 
ID5003762 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp451829 
End bp453370 
Gene Length1542 bp 
Protein Length475 aa 
Translation table 
GC content65% 
IMG OID640419183 
Productpredicted protein 
Protein accessionXP_001419811 
Protein GI145350855 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5207] Isopeptidase T 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.151191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0293051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACG CCGCGCGCGT CGAGCGCGAC GACGACGACG ACGACGACGA CGACGACGCG 
CGACGCGCCG ACGGCGCGCC GACGGTCGGC GCGACGCGCG GACGCGTGCG ATTGTTCAAG
ACGACGCGCG ACGACGGATG CGCGACGACG ACGCGACGGA ACGCGAACGC GGCGTGCGTC
GTCGGCGTGC CGAGCGCGAT CGCGATCGCG GATTTCTGTC GATTCGTCGC GGGCGCGATG
GCGACGACGC GGTCGGTGCG CGCGGTGGCG GGACGCGGCG ACGGCGCGCG GTCGACGTAC
GACGCGGTGC TGGAGTTCGA CGACGGCGAC GCGGCGGACG CGTTCGTGGA GAATTATCAC
GGGCGGAGGT ACGCGATGGG ACGGGAGGAG ACGTGCGTGG CGGTGCGCGT GGTGCGCGTG
GAGGAGGGGG CGGAGGCGAG CGGGACGCTG GGAACGGAGG TGCCGACGTG TCCGGTGTGC
CTGGATAGGC TGGACGCGGA GGCGTCGGGA ATAGTGACGA CGATTTGCGA ACACGCGTTT
CACGCGGAGT GCCTGAGCGG GTGGGCGGAC GCGTCGTGTC CGGTGTGTCG GTACGCGCAC
GAGCCGGAGA GCAAGGCGCG GTGCGCGACG TGCGGTAAGG ATCACGATTT GTGGGTGTGC
TTGATTTGCG GCGAAGTGCG GTGCGGACGC TACGCGGGGG CGTGCGCGGT GAATCACTGG
ACGGAGACGA ATCACACGTA CGCGCTCGAG CTCGGAACGC AACGCGTGTG GGATTACGTC
TCAGATGGCT TCGTGCACCG GTTGATTCAG AGCAAGAGCG GACTCGTCGA ACTGACGCCG
CCGCCGTCGA CGCGAAGGGC GTCGTCTTCG GACGCGAGCG GCGCGGCGTG CTCGCCGATT
CGCGCGCCAG ACGTCGGCGA TTTGGACGCA CAACTCGAGG AAGCGCTGGT GGCGTCCAAA
CTCGACGCCA TCGCGAGCGA GTACGACTTA CTCTTGACGT CACAGCTCGA ATCGCAACGC
AAGTATTTCG AAGGCTTGCT CCAGACGGCC AACGCGAGGT GCGCGGGGAC GATCTCGCGA
GAAGACGAGG ACAGTAGGAA CGCCGCCGTG GTGGCGCGAG CGATGAGCGA AGCGAAGGAC
GCCAAACGTG AATTGAAAAT GCTTCAAAAG GCGAACGCGT CGCACGTGGC GTCGATTGAA
CAGCTCCGCG ACGAGCTCGA GCACGCGCAC GCGCTCAGCG ATACGTTGGC TGAAAACGTC
GAGACGCTCA GGGCGGAAGC GACGCGCGCT GAAAAGCGTA AAACGATCGA ATTGGCGATA
AAAGACGCGA GAATCAAAGA ACTCGAGGAA GAAAATCGCG ACTTGATGTT GTTCCTGGAC
ACGTCGAACA AACTGAGCGT CGACGCGTCG CTGGCGGAGG AGATAGCCGG CGGCACGGTC
GTGGGCATCG ACACCGACAC GACGCCGGAG CCGACGCCGT CTCGAAACCG CACGCACGAA
AGACTGAAAG GAAAAGTCGA AGCCGAGCGA AGGAAACTTT AA
 
Protein sequence
MAHAARVERD DDDDDDDDDA RRADGAPTVG ATRGRVRLFK TTRDDGCATT TRRNANAACV 
VGVPSAIAIA DFCRFVAGAM ATTRSVRAVA GRGDGARSTY DAVLEFDDGD AADAFVENYH
GRRYAMGREE TCVAVRVVRV EEGAEASGTL GTEVPTCPVC LDRLDAEASG IVTTICEHAF
HAECLSGWAD ASCPVCRYAH EPESKARCAT CGKDHDLWVC LICGEVRCGR YAGACAVNHW
TETNHTYALE LGTQRVWDYV SDGFVHRLIQ SKSGLEALVA SKLDAIASEY DLLLTSQLES
QRKYFEGLLQ TANARCAGTI SREDEDSRNA AVVARAMSEA KDAKRELKML QKANASHVAS
IEQLRDELEH AHALSDTLAE NVETLRAEAT RAEKRKTIEL AIKDARIKEL EEENRDLMLF
LDTSNKLSVD ASLAEEIAGG TVVGIDTDTT PEPTPSRNRT HERLKGKVEA ERRKL