Gene OSTLU_25061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25061 
Symbol 
ID5003758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp438001 
End bp441281 
Gene Length3281 bp 
Protein Length729 aa 
Translation table 
GC content63% 
IMG OID640419179 
Productpredicted protein 
Protein accessionXP_001419809 
Protein GI145350850 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0638404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0884002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGG CGTGCGCGCG CGCGATCGAC CGCGACGGCG CGCGCGCGGG CGCGCCGTCG 
ACGAAGCGCG CGCCGCCGGG CCCGCACGCG CTGCCGGCGC GATCGATCCT GAAATCGTCG
TCGATCGCGG GCGAATTGGC GAGCGCGCGC GATTCGCACT CGACCGGGAC GGCGGCGTCG
AGCGCGGCGA CGACGCCGTG CGGGAGCGAG CGCGATCTGG AGGGACGGGA GCGAGGAATG
ACGCGAAAGG TGTCGCTGGT GAAGTTCGCG GGACTGAGCG ACGACGAGGG GGAGGAACAC
GGGGAGGGAT GGCGGGGACG GCTGTTCGGC GGACGCGCGG TGGCGAGCGG CGGCGGCGGC
GGCGGCGGCG AACGGGACGA CGACGGCGCG ACGACGACGC ACGGGAATGG ATTCGAGAGC
GATTCTTCCG GGCGCGAGGT CGGCGGAGGG ACGATTTACG GGATGCCGAC GCACGAACGA
CGGGGAAGCT TCACGAGCTC GGTGCCGGTG GTGGAGCACG AGGAGGAGGC GCACTGGTTT
CTGTCGATTG GCGTCGATAC GCGGTTGGCC GCGGCGCGCG TGATGGAGAG CAGACTCGCG
GCGGCGCGCG TGGAGGAAGA GCGGCTGCGT GAGCTCGTGA TCGAGGCGGA AGCGGCGACG
CGAGAGGCGG AAGCGGAGGC GCGCGCGAAG AGCGAAGGCG AACACGACGA GCCGCGCGCG
GGGTCGAGCG GCGGAGACGG CGGCGAGGAC GACGGTGCGG TTGTATCGAC GCCGCCACGC
GTGAAGACGG CGACATTGAT GCCATGGGAT GTAATGGTGA CCGTCGCGCG AGCTAGAGAA
GGCGATGCGC GCGCTGCGGC GCACAAAGCG ACGCTCGTGG TGAAGAAGCT CGCGCACACG
GTGGCGCTCT GGGAGCTCAA GTCGACGAAG CGGGCGCATT CGGATATCGG TGAACACTTG
ACGAAGATTA AAGAAAAGAA GGGTAGACGA CTGGCGTCTG CGACGCGCGC TCGGGCTACG
TTTATAGCCG CCGAAGAGAC GCGCGATCGC GAGTTTGAGA ACACGCGAGT GTTGGAAGAC
AAGCGTCTCG CCGCCGACGT GAACATGGCG GAGGCTTCGG CGCGAATCCA CGCCGCGCGG
ACGGCGTTGC AGAACGCGCG CGACCGCAAA AAGGGTATTT TAACGACGTT TCGTCAAGCC
GTGCGAACGT CCATGGACTC GAGCGTTGAT TTGTTGGAGT TTAGGCAGAA GGAGTTTGCC
CTGCGGGCGT CCAACGTCAC GAAGACTATG TTCACCGCGT ACGACGAGAT GACGTCAAGG
CTCGAAGAGG CGCTCATGAA CGCTATCGAC GCGGTGGGGC GGAAGTCACC GAAGAAGATC
GATAAAGTGT TCGACATCGA GGCCGCCGCC GCGCTCACCG CACGCGATCG CGCCGAAGCC
TCGTTAAATG ACGCCACGAG TCAGTGCATG GCTGAGATGC GCGCGCTCGA GGCGTCGACG
CGTACTTTGC TGGATGACGT CGATGAGGAA ATCAAAAACG GCATAGAGGA GTATTATGCT
GCCGCGGACG ACGAGCGCGT GTGCTATATG GAGGCGCAAC GAGCGACGAT TGCGTGCAAG
ATGGCGCACG CTGCGGTCAA GGTGTACGAA GAGCAATACA ACGCCACGAA GAACATGTCC
GACGAGGCTC ACACTTGGGC GAGCTCACTC GACGACGAAA TCGCACAGCT GAACAAAGAC
TACGCGATCG CCGAGGAGCG GCATCTCGAA GCCGCAAAAG TCGTCGACGC AGCGCTGGTC
GAAGTTGAAG CCGCAGAGAA GGACGTCTTG TCGACTTCGG CGCCTGACGA GTCGAATCAA
AAGAGCGCGC GACGGCAAAG CGTGCGAAAC GTGTTCGAAG TCATAAATCG ATTCAGTACA
ATGGTGCGTG ATAAATCGGC GCGAGGCGAC AGTGATACTT TCGCGAGTCT CAAGCAACTC
TCGCTTCAAG CTGGCTCGGA TTCGCCGAGA AATTCCGACG AGCAACATCG CGTCTCCATG
CTCGGATTTT CCACATCGTT CCCCGACGCC GACGCCACGA TGGAGAGCAT CATGGAAAAG
GCGCACGAGC TGGAGCACTC GAGAGACGTG CAGACGCTCA CGCGGAATGT ATCCCGAAGC
GCCATGGCGT ATAATCCGCT GCAAAAGTGA AGAGAGACCG CAAAAAGTTT GAATACGAAT
ACGCATGTTA TTGACAAGAT TGTATTATTC GCCACCTACT TTGGCGTGGG TATACGAGTA
GATCTGTAGA AAAGAGACCG GTAGGACATA TTTATTCATG TCGGTCAACG TACACGTTGG
TTTTGTAAAA CAAAATTTGT TCGATGAGTC TGGTGCGACG ACGAACGGCG GGGGATTATT
GCCCCATCGC GACGGAGAAG CCAGTCTTGT AGTGATGCCC CTTGGACGCC ATACGCAAGC
ACCCCGTGGC GATGGCGCCT TCGCAGAGCG TCGTGGTGTA TGTCACGTCC GTGTCACCGT
GATTCGAGTG CACGACACGC ATGGTGTCGC CGGATTGGAG CGTCTTGCTC ACGCCGAGCG
CGTAGCTCAA CGAATTGCGT TCTCCGCCCA TCGCGAGCTT GGCTTCCAAA CCCGCGGTCG
TCTTCGCGTC GAGCTCGCTC GTGACGCCCA AAGCGACGGT ATTCCCACCA TCACTCAACG
CCGCGCTCAA AGTCGTCGGT CCATCTCGTC GCTGGGCCGC GAGCGTCCAG TCGGCCAGAC
CCCCGCTCGA GGGCGACACC GTCGCCTTGG CCCCGAGCGC GGTGTTATCA TCGAACGCGT
AGCACGCGCT CGCGACGATA TCACCGCCCG CCAACGCCGA ATCAGCCGTG ATCCCGACGG
AATCGATCGC GAGCTTCGCC CCGATCGTCG CGTTCGCGGT ATCGCGCGCC AATCCCTCCA
CCGCCACGTC GAGTCCGCTC AGCGCCAATC CGCTCGCCAC CACGTCCGCG CTCAGATCCC
CGTGATGATT CGCCTTCGCC ACCAACTTCA CCCCATCGAT CGGCAGTTCA AACGTTTGCT
TCGCGTCCAC CGCGATCGCT CTCGCCGCGC CGTTCACGTC CACGGTCACG TCCGCGTCCG
CGAGCGCCGA ATTCTTCGGT CCCTTCGTCG TCACCTTCGT CGCCCCCACC GCCGCGGGTT
CGAGCAAATC GCGCAGCGCT TCGCCGAGCG CTTCAAAGTA CGTCATTTCG CGTCGGTGCG
CGTCGTTTAC GAGCGCGCGC GTCGCCGTGT GTCGCGTCGC G
 
Protein sequence
MGAACARAID RDGARAGAPS TKRAPPGPHA LPARSILKSS SIAGELASAR DSHSTGTAAS 
SAATTPCGSE RDLEGRERGM TRKVSLVKFA GLSDDEGEEH GEGWRGRLFG GRAVASGGGG
GGGERDDDGA TTTHGNGFES DSSGREVGGG TIYGMPTHER RGSFTSSVPV VEHEEEAHWF
LSIGVDTRLA AARVMESRLA AARVEEERLR ELVIEAEAAT REAEAEARAK SEGEHDEPRA
GSSGGDGGED DGAVVSTPPR VKTATLMPWD VMVTVARARE GDARAAAHKA TLVVKKLAHT
VALWELKSTK RAHSDIGEHL TKIKEKKGRR LASATRARAT FIAAEETRDR EFENTRVLED
KRLAADVNMA EASARIHAAR TALQNARDRK KGILTTFRQA VRTSMDSSVD LLEFRQKEFA
LRASNVTKTM FTAYDEMTSR LEEALMNAID AVGRKSPKKI DKVFDIEAAA ALTARDRAEA
SLNDATSQCM AEMRALEAST RTLLDDVDEE IKNGIEEYYA AADDERVCYM EAQRATIACK
MAHAAVKVYE EQYNATKNMS DEAHTWASSL DDEIAQLNKD YAIAEERHLE AAKVVDAALV
EVEAAEKDVL STSAPDESNQ KSARRQSVRN VFEVINRFST MVRDKSARGD SDTFASLKQL
SLQAGSDSPR NSDEQHRVSM LGFSTSFPDA DATMESIMEK AHELEHSRDV QTLTRNVSRS
AMAYNPLQK