Gene OSTLU_27771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27771 
Symbol 
ID5005863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp153158 
End bp154249 
Gene Length1092 bp 
Protein Length363 aa 
Translation table 
GC content60% 
IMG OID640421284 
Productpredicted protein 
Protein accessionXP_001421726 
Protein GI145354928 
COG category[K] Transcription 
COG ID[COG5665] CCR4-NOT transcriptional regulation complex, NOT5 subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000700977 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.296138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACA GAGAAAACAA TGACGTCGAC GACGCGGTGA CGGCGAAGAT CGAGGACGCG 
ATGCGGGAGT GCGCGGCGGA TATCGGGTGG TTTGAGAGCG TTTTGGCGGC GGACGCGACG
AAGGACGGCG ACGGCGACGA TGAGGCGGGA GCCGAGGCGA GCGAGGCGGA CGTCGAGCGT
TTGGAAACGC CGGGGAAAGA CGGCGCGATG CGAGCGGCGG TAGATGAAGC GTCGACGTCG
GGGACGTCGA CGCCTTCGAC CCCGGCGGAG GCGCAGGCGA AGAAGCCGAA ACGCTCGGAG
GACGGCGGCG ACGAAGGCGC GGCGGTAAAT CGTTTGCAGG CGAAATTCGA GGCGACGTCG
CCGAAAAAGT CGTCTGCGTC GACTTCGCGC GCGAAGGTTA CGACCGGGAT GCATCCGGAC
GCGTGTATTT TCGATTTACC CGGGAATGGA TTGTTGAGCC TGGATGAACT GTCGCGCGGC
GGACACGCGT ACGGTGAGAT TTCGCCCGAG AGCGGCGGTG CCATGGCTCT CGAACACGCC
GATATTCCAT CGGGCGTGTC GATTCGCGGC GTCGTTATAA ATGACCGACA TGTCAGCCAT
CGCTTGCTCG AAATCGCGTG CGCCAAGCTT CCGCGCGAAG GACTCAGCGC GGATGCGAAT
TGGCGGTTAT CGAGCGAGAA GAACGCAAAG AAATCCGCGT CGGCGCCGCA AAAGAGTAAA
ATCGCGACGC CATCGTCGTA TCCGCGATCA CCTCGAGACA TACCGCCCGG ATGCCAGTTG
GACAACCCGG CGCTCTTCAA ACGTCTCGAT AGCGACGCCT TGTTTTTCAC GTTCTATTAC
GGCAGAGACA GACTCAAATT GCTCGCAGCG AACGAGCTCC ACGCCTCCTC GTGGCGTTTT
CACAAGATTC TCGGCACGTG GTTCGCGAGA CTCGACCGTC CGAAAATCAT CAACGAAAAA
GAAGAGTTCG AAACAGGATC CGTCATCTAC TTCGACAACA ACATCGTGGT CAATCCGAGC
GACAGCAGTT CGAGCGGGTG GTGCCAACGC AGCAAGAGCG ACTTCACCTC GCGATACGCC
GATTTCCTCT AG
 
Protein sequence
MEYRENNDVD DAVTAKIEDA MRECAADIGW FESVLAADAT KDGDGDDEAG AEASEADVER 
LETPGKDGAM RAAVDEASTS GTSTPSTPAE AQAKKPKRSE DGGDEGAAVN RLQAKFEATS
PKKSSASTSR AKVTTGMHPD ACIFDLPGNG LLSLDELSRG GHAYGEISPE SGGAMALEHA
DIPSGVSIRG VVINDRHVSH RLLEIACAKL PREGLSADAN WRLSSEKNAK KSASAPQKSK
IATPSSYPRS PRDIPPGCQL DNPALFKRLD SDALFFTFYY GRDRLKLLAA NELHASSWRF
HKILGTWFAR LDRPKIINEK EEFETGSVIY FDNNIVVNPS DSSSSGWCQR SKSDFTSRYA
DFL