Gene OSTLU_42572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42572 
SymbolPYR1L 
ID5003256 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp404620 
End bp408197 
Gene Length3578 bp 
Protein Length1105 aa 
Translation table 
GC content56% 
IMG OID640418677 
ProductCarbamoyl-phosphate synthase L chain 
Protein accessionXP_001419372 
Protein GI145349915 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0674343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGCG AGAGCGAGGT TTTTCGAGCG TGGGATCAGG CGACGAAGAG CCAAAAGAGA 
ACCGACTTGA AGAAGATCAT GATCCTGGGC GCGGGACCGA TCGTGATCGG ACAGGTGCGC
GCGCGCGATG ACGCGCTCGG GGAGGGAGGA GAGGCGAACG GCGCGGGATG GGAGGGGGTG
ATCGCGAACG CGAGCGCGGG CGCGTTTTTA AATATTCAAA CCGCGGCGGG GGCGATCGCG
AGGGAGCGAC GCGCGGGGTA GCGACGCTTC GGGGGATTGG CGCGAACGGT TTAGTGAGAA
TTAATTGTGT TTCGTTCGTC GTCTCGACAC GGGCGCGGAA AGACTGACGG TGGTGCGAGT
TTGAACTTTG GTAGGCGTGC GAGTTTGATT ACTCTGGGAC CCAGGCGTGC AAGGCGTTGC
GCGCCGAAGG ATACGAAGTC GTGCTCGTGA ACTCGAACCC GGCGACGATT ATGACGGATC
CAGAGACGGC GAGTCGCACG TATGTGACGC CGATGTCGCC GGAGAGCGTT GAGCAAATCA
TCGCGCGCGA AAAGCCGGAC GCCTTGTTGC CGACGATGGG CGGGCAAACC GCGCTCAACT
TGACCAAGGC GCTCGCGGAG TCCGGGATTT TGGCCAAGTA CAACGTCGAG TTGATCGGTG
CTAAGCTTGA TGCTATCAAC AAGGCTGAAG ATCGTCAGCT TTTTAAGGAT GCGATGGACA
AGATTGGCTT GAACACGCCC AAGTCTGGCA CAGCCGAGAC GTGGGCGCAG GCGCAAACCA
TCGCGGCGGA CATCGGCACC ATGCCGCTCA TCATCCGTCC TGCTTTCACT CTCGGCGGTT
CTGGGGGTGG TATCGCGTAT AACATGGCTG AGTTTGAAAA CATCGTCAAG GGTGGTCTCG
ATGCGTCTGC GACGAGCCAA GTTTTGATTG AGCAATCTTT GCTCGGCTGG AAGGAGTACG
AGCTCGAGGT GATGCGTGAT CTTGCCGACA ACGTCGTCAT CGTGTGCTCG ATCGAAAACT
TCGATCCCAT GGGCGTGCAC ACCGGTGATT CCATCACTGT TGCCCCGGCT GAGACGCTCA
CGGATAAGGA ATACCAGCGT CTTCGCGATG CCTCGGTTGC TATTATTCGT GAGATCGGCG
TAGAGTGCGG TGGTTCGAAC GTGCAGATGG CTGTCAACCC AGTCGATGGT CAAGTCATGA
TTATCGAGAT GAACCCGCGC GTGTCTAGAT CTTCTGCTTT GGCTTCCAAG GCGACTGGTT
TTCCGATCGC TAAGATGGCT GCAAAGCTTG CGGTCGGTTG CACCCTCGAC GGCATCCCGA
ACGATATTAC GCTCAAGACG CCGGCGTCGT TTGAGCCGTC GATCGATTAC GTCATCACTA
AGATTCCTCG ATTCGCTTTC GAGAAGTTTC CGGGTGCTAA GGCGGTGCTC ACGACTCAAA
TGAAGTCCGT CGGTGAAGCG ATGGCCATGG GTCGCACCTG GCAAGAGTCC TTCCAAAAGG
CGTTCCGCTC TCTCGAGACG GGCTTCTCTG GTTGGGGTCT CAGCAAGAAG GATGGCTTCA
TGCAAGGTGA TGTCTCCGCC ATTCGCGATG GTCTTACTAT TCCGAACCCT GAAAGAATTG
TGACCATCCA CGAAGCGTTC CTGGCTGGAT TTACGGAGAA GGAGATCATC AACCTCACGA
CGATGGACCC ATGGTTTGTT CGCCAACTCG GTGAACTTTA CGAAACCGAG TGTTGGTTAA
AGTCGCTCAA GTCGATCGAT GAACTTAGCG AAAACGATTG GCTTGAAGTC AAGAGACGTG
GTTTCAGCGA CGCTCAAATC TCTGTCGCTT TCCCTGGCAC GGATGAGATG ACTATCCGCA
AGGCCCGCAC CGGCAAGGGC TGCGTCCCGT CGATGAAGCG CGTCGATACT TGCGCGGCGG
AGTTCCAAGC GGACACGCCT TATATGTACT CTTCCTACGA TGGAAATGAT GAAGCCGAAC
CGACGAACAA CCGTAAGATT CTCATTTTGG GTGGTGGCCC GAACCGTATT GGCCAAGGTA
TCGAGTTCGA TTACTGCTGC TGCCACGCCG CGTTCGCGCT TGCCGATGCA GGCTTTGAGA
CTATTATGCT CAACAGTAAC CCAGAAACTG TGTCGACGGA CTACGACACT TCTGATCGCT
TGTACTTTGA ACCTTTGACG GTGGAAGATG TCTTGAACGT GTGCGAGACT GAGCGTCCGG
AAGGCATTAT CGTGCAGTTC GGCGGCCAAA CTCCGTTGAG CCTTGCCACG AAGCTTGAGG
CCGCGCTAAA CGCGAACCCG ATTCCGGCGG CGTCCGGTAA CGGCTTCACG AAGATTTTGG
GTACGCCTCC GGACTCCATC GATGCGGCTG AAGATCGTGA GCGTTGGATC GATATTTTGG
ATGAACTTGA AATCCTTCAA CCCCCGGGCG GCGTCGCTCG CTCCGAGGAA GAGGCGCTGA
AGGTGGCTGA AAAACTCGGT TACCCGGTCA TGGTTCGCCC GTCTTATGTT CTTGGTGGTC
GTGCGATGGA AATCGTTGGT TCTACTGCCG ATTTGAAGCG ATACATCAAC ACTGCGGTGG
AAGTTGATCC GGAACGACCC GTCCTCGTTG ACAAGTACCT CCAAAACGCG ACGGAAATTG
ATTGCGATGC TCTGTGCGAC ATGGAAGGCA ACGTTGTCAT TGGTGGGATC ATGGAACATA
TCGAACAGGC AGGTGTGCAC TCCGGTGACT CTGCGTGCTC TTTGCCGACC CAAACCATCC
CTGAATCTGC CTTGGCGACT ATCCGCGAAT GGACTCCTAA GCTTGCTCGT CGACTTGGAG
TTGTTGGTCT CATAAACATT CAGTACGCCG TGACTCCGGA CGGCACTCCT TATATTATTG
AGGCAAACCC TCGTGCGTCT CGTACGGTGC CGTTTGTTGC TAAGGCGATC GGTCACCCAT
TGGCGAAGTA TGCCTCTTTG GTGATGGCGG GAAAAACGTT GAAGGAAATC GGCTTCACGG
AAGAAGTGAA GCTCAACCAC GTCGCCGTCA AGGAAGCGGT TCTTCCGTTC GATAAGTTCC
CGGGCGCCGA CACGTTGCTC GGTCCGGAGA TGAGAAGTAC CGGTGAAGTG ATGGGCATCG
ACAAGGACTT CAGCCGTGCC TACTGCAAGG CGCAGCTCGC TGCTGGACAA CGTTTGCCAA
CTTCTGGCAA TGTGTTCATC TCCGTCCGTG ATGGTGATAA GGATGCCATA GTTGACATAG
CGCGTGACCT CGTCGCCATG AAATATACGG TCCTCTCGAC CGGCGGTACG GCGAGCCACT
TAGAAAACGC GGGCGTTCCC GTTACCAAGG TGAAGAAGGT CCACGAGGGC CGCCCTCATA
TTGGTGACAT GATCCGCAAC GGTGAGATTG GTTTGATGGT TGTCACCTCG TCTGGTGACG
CTCAAGATTT GGTGGATGGT CGCGAGATCC GTCGCACCGC GGTTGGCCTC AAGGTTCCTA
TGGTGACGAC GATCGCTGGC GCGAAGGCGA CTGTTGGCGC CGTCAGAGTC TTGCAACAGA
ATGACTTGGT GATGGATGCC TTGCAAGATT TCTTCTAG
 
Protein sequence
MGGESEVFRA WDQATKSQKR TDLKKIMILG AGPIVIGQAC EFDYSGTQAC KALRAEGYEV 
VLVNSNPATI MTDPETASRT YVTPMSPESV EQIIAREKPD ALLPTMGGQT ALNLTKALAE
SGILAKYNVE LIGAKLDAIN KAEDRQLFKD AMDKIGLNTP KSGTAETWAQ AQTIAADIGT
MPLIIRPAFT LGGSGGGIAY NMAEFENIVK GGLDASATSQ VLIEQSLLGW KEYELEVMRD
LADNVVIVCS IENFDPMGVH TGDSITVAPA ETLTDKEYQR LRDASVAIIR EIGVECGGSN
VQMAVNPVDG QVMIIEMNPR VSRSSALASK ATGFPIAKMA AKLAVGCTLD GIPNDITLKT
PASFEPSIDY VITKIPRFAF EKFPGAKAVL TTQMKSVGEA MAMGRTWQES FQKAFRSLET
GFSGWGLSKK DGFMQGDVSA IRDGLTIPNP ERIVTIHEAF LAGFTEKEII NLTTMDPWFV
RQLGELYETE CWLKSLKSID ELSENDWLEV KRRGFSDAQI SVAFPGTDEM TIRKARTGKG
CVPSMKRVDT CAAEFQADTP YMYSSYDGND EAEPTNNRKI LILGGGPNRI GQGIEFDYCC
CHAAFALADA GFETIMLNSN PETVSTDYDT SDRLYFEPLT VEDVLNVCET ERPEGIIVQF
GGQTPLSLAT KLEAALNANP IPAASGNGFT KILGTPPDSI DAAEDRERWI DILDELEILQ
PPGGVARSEE EALKVAEKLG YPVMVRPSYV LGGRAMEIVG STADLKRYIN TAVEVDPERP
VLVDKYLQNA TEIDCDALCD MEGNVVIGGI MEHIEQAGVH SGDSACSLPT QTIPESALAT
IREWTPKLAR RLGVVGLINI QYAVTPDGTP YIIEANPRAS RTVPFVAKAI GHPLAKYASL
VMAGKTLKEI GFTEEVKLNH VAVKEAVLPF DKFPGADTLL GPEMRSTGEV MGIDKDFSRA
YCKAQLAAGQ RLPTSGNVFI SVRDGDKDAI VDIARDLVAM KYTVLSTGGT ASHLENAGVP
VTKVKKVHEG RPHIGDMIRN GEIGLMVVTS SGDAQDLVDG REIRRTAVGL KVPMVTTIAG
AKATVGAVRV LQQNDLVMDA LQDFF