Gene OSTLU_44053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44053 
Symbol 
ID5004449 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp112783 
End bp114261 
Gene Length1479 bp 
Protein Length463 aa 
Translation table 
GC content60% 
IMG OID640419870 
Productpredicted protein 
Protein accessionXP_001420250 
Protein GI145351799 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0440] Acetolactate synthase, small (regulatory) subunit 
TIGRFAM ID[TIGR00119] acetolactate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0143382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCT CGAGCGCGAC GCGCGGGTCG GGCGACGATT ACGACCTCGA CGCGGGCATC 
GCGGGCGCGA CGCCGGGCGA TGGATGGACG CCGACGTCGT ACGACGGACG CGGGAGCACG
GGGGACGTGT ACCAAGGGCC GGCGCGGCTG GCGGAGGGAT TGCGAAGACA CACCGTGCTG
GTGTACGTCG CGGACGAGAC CGGGATGATC AATCGCGTCG CGGGGGTGTT CGCGCGACGA
GGGTACAACA TCGAATCGCT CGCGGTGGGG CTGAACATCG ACAAGGCGAT TTTTACGATC
TCGGTGATTT GCAGCGACGG CGACGTGGGG AAGCTGATCA AGCAGGTGAA TAAATTGGCC
AAGGTGCGAA AGGTGGAGAA CGTCACGGAT AAGGAGTGCG TGGAACGAGG GTTGATGCTG
CTGAAGGTGA AGTGCGAGCC CGAACAGCGG TCGCAGGTGC TGGAGATTAA TCGGATTTTC
CGAGCGAGCG TGGTGGACGT GGCGGAGCGG TCGCTGACGA TGTCGGTGGT GGGGGATCCC
GGGAAGAATC GAGCGTTTCA GAGCGCGCTG ATGAAGTTCG GGGTGATTCA AGTCGCGCGA
ACGGGGAAGT TGGCGCTGAA GAGAGAGCCC GTGTACAGCG AGGCTCGGTC TCGTCGAGTG
AAACTCATGG AGGCGATGCG AAAGGCGAAG GACGCGGTGA GCGGATTAAA CATCAAGGAG
AAGAGCGCCA AGTACGAGAG CATCTTGGCG TCACGAATCG TTCGCGCGGT CGCGGGGATG
GAACACGACG ATGACGGCGA CTTGCACGTC GGCGACGTGT ACACGTCGCT CGAGAACGAC
GAAATCGGAG TGTGGGACGT CCCCGTGCTC AGCTCTTCCT TCTCGGGACT GGGGCACGGT
AGCGACAAAG TCGACATCGA CAAGATGGAC GAAAACGCCA AGTACACGCC GCACACTATC
TCTATTTTGG TCGATAACCG CCCGGGCGTG TTGGATTCCA TCACCGGCGT GTTCGCTCGT
CGCGGGTACA ACATTCAGTC CCTCGGCGTC GGCCCGGAAA GAACTTTCGA CATCTCTCGC
ATTTCCACCG TCGTTCCGGG CAGTACCGAA GACATCGCGA TGCTGCTCAA GCAAATCTTA
AAGGTGCCTT ACGTCATCTC CGCCGAAGAT ATCACGATGA CGCCGTTCAC GGAGCGCGAA
CTCATGCTCA TCAAGGTCGT CAGCTCTCGC GCGCAACGAG CAGAAATTAT CGATTTGTGC
GGCATGTTCC GAGCCAAGGT GTGCGACATC TCCGAAGACA CCGTCACCAT CGAAGTCAGC
GGGCGCCAGC GTAAGATTAA CGCCATTCAA GCGCTTCTCG AGCCGTACGG GATTCTGGAA
GTTGCGCGCA GCGGTCGCGT CGCGCTCCCG CGCGATTCGG GCGTCGATTC CAAGCTCATG
ATGGCGATCG AGTCCGAAAG CGATCTCGAC AAGTGGTAA
 
Protein sequence
MTRSSATRGS GDDYDLDAGI AGATPGDGWT PTSYDGRGST GDVYQGPARL AEGLRRHTVL 
VYVADETGMI NRVAGVFARR GYNIESLAVG LNIDKAIFTI SVICSDGDVG KLIKQVNKLA
KVRKVENVTD KECVERGLML LKVKCEPEQR SQVLEINRIF RASVVDVAER SLTMSVVGDP
GKNRAFQSAL MKFGVIQVAR TGKLALKREP VYSEARSRRV KLMEAMRKAK DASAKYESIL
ASRIVRAVAG MEHDDDGDLH VGDVYTSLEN DEIGVWDVPM DENAKYTPHT ISILVDNRPG
VLDSITGVFA RRGYNIQSLG VGPERTFDIS RISTVVPGST EDIAMLLKQI LKVPYVISAE
DITMTPFTER ELMLIKVVSS RAQRAEIIDL CGMFRAKVCD ISEDTVTIEV SGRQRKINAI
QALLEPYGIL EVARSGRVAL PRDSGVDSKL MMAIESESDL DKW