Gene Mlg_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1981 
Symbol 
ID4268524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2248469 
End bp2251705 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content66% 
IMG OID638126737 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_742813 
Protein GI114321130 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00635498 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGAGACTGC GCGGCATGCC CAAGCGTACC GATATCAAGA GCATCCTCAT CATCGGCGCC 
GGTCCCATCG TGATCGGCCA GGCCTGTGAA TTCGACTACT CCGGCGCCCA GGCCTGTAAG
GCCCTGCGCG AGGAGGGGTA CCGGGTCATC CTGGTCAACT CCAACCCGGC CACCATCATG
ACCGACCCCG AGACGGCGGA CGCGGTCTAC ATCGAGCCCA TCGAGTGGCA GACCGTGGCG
CGCATCCTGG AAAAGGAGCG GCCCGATGCG GTGCTGCCCA CCATGGGGGG GCAGACGGCG
TTGAATTGCG CCCTGGACCT CTCCCGCGAG GGCGTGCTGG AGCGCCTGGG CATCGAGATG
ATCGGGGCCA ACAAAGAGGC CATCGACATG GCGGAGGACC GCGAGTCCTT CCGCGAGGCC
ATGGCCCGCA TCGGCCTGGA GACCGCCCAT GCGGAGATCG CCCACTCCAT GGAAGAGGCG
CTGGATGCGC AAAAGCGCAT CGGTTTCCCC ACCATCGTCC GGCCCTCCTT CACCCTGGGC
GGCTCCGGCG GGGGCATCGC CTACAACCGT GAGGAGTTCA TCGAGATCTG TGAGCGTGGT
CTCGACCTTT CGCCCACCAA CGAGCTGCTC ATCGAGGAGT CGGTGCTGGG CTGGAAGGAG
TACGAGACAG AGGTGGTGCG CGACAAGGCG GACAACTGCA TCATCATCTG TTTCATCGAA
AACCTGGACC CAATGGGCGT GCACACCGGG GATTCCATCA CCGTGGCCCC GGCGCAGACC
CTGACCGACA AGGAATACCA GATCATGCGC GACGCCTCTC TCGCCGTGTT GCGGGAGATC
GGCGTGGAGA CGGGTGGCTC CAACGTGCAG TTCGCCATCA ACCCGGCGGA CGGGCGCATG
GTGGTGATTG AGATGAATCC GCGGGTGTCG CGCTCCTCGG CGCTGGCCTC CAAGGCCACC
GGCTTCCCCA TCGCCAAAGT GGCCGCCAAG CTGGCGGTGG GTTACACCTT GGATGAGCTG
CGCAACGAGA TCACCGGCGG TGCCACCCCG GCTTCCTTCG AGCCGACCAT CGACTACGTG
GTCACTAAGA TCCCGCGTTT CACCTTCGAG AAGTTCCCCA AGGCCCCGCC GCGGCTGACC
ACCCAGATGA AGTCGGTGGG CGAGGTGATG GCCATCGGCC GCACCTTCCA GGAGTCGCTG
CAGAAGGCCC TGCGCGGCCT GGAGAACGAT TTGACCGGCC TGGACGAGCG GGTGGACCTG
TCCGTCGAGG GGGGCAACGA CCTGATCCGC CAGGAGCTGC GTCAGCCCTC GCCGGAGCGG
CTGCTCTACC TGGCCGATGG CTTCCGCGCC GGCTTCACCC TCGAGGAGCT GTTCGAGCTG
ACCTGGATCG ACCCCTGGTT CCTGGCCCAG ATCCAGGAAC TGGTGGCGGT GGAACAGGGC
CTGCGCACCG GCGGCCTGAA GTCGCTGGAC CGTGACCGCT TGTTCAATCT CAAGCAGAAA
GGGTTTTCCG ACGCCCGGCT GGCCCGCCTG CTGGGGGTGC GCGAGGCGGA CGTGCGTGCG
CGCCGCCTGC AACTGGAGGT GCGCCCGGTG TTCAAGCGGG TGGACTCCTG CGCCGCCGAG
TTCGCCTCCG CCACCGCCTA CATGTACTCC ACCTACGAGG AGGAGTGCGA GGCCGAGCCC
ACCGACCGGC GCAAGATCAT GGTCCTTGGC GGCGGCCCTA ACCGGATCGG CCAGGGCATT
GAGTTTGACT ACTGCTGCGT GCACGCCGCC CTGGCCATGC GTGAGGACGG CTATGAGACC
ATCATGGTCA ACTGCAACCC GGAGACGGTC TCCACCGACT ACGACACCTC CGACCGGCTC
TATTTCGAGC CGCTCACCCT GGAGGACGTG CTGGCGATCG TCGAGATCGA GCGGCCCGAG
GGCATCATTG TTCAGTACGG TGGCCAGACG CCGCTCAAGC TGGCGCGCGA CCTGGAGGCC
GCCGGGGCGC CGATCATTGG CACCACCCCG GACTCCATCG ACCTGGCGGA GGACCGCGAG
CGCTTCCAGG GGCTGATCAA CAAGCTGGGG CTCAAGCAGC CGCCCAACCG CACCGCCCGC
AGCGCCGATC AGGCCCTGCG CCTGGCCGCG GAGATTGGTT ACCCGCTGGT GGTGCGCCCG
TCGTACGTAC TGGGTGGCCG CGCCATGGAC ATCGTCTATG GCGAGGATGA ATTGTTGCAG
TACATGCACG AGGCCGTGCG GGTCTCCAAC GACTCGCCGG TGCTGCTGGA CCGCTTCCTG
GACGATGCCG TGGAGGTAGA CGTGGACGCC ATCTGCGACG GCGAGGACGT GCTGATCGGC
GGTATCATGG AGCACATCGA GCAGGCCGGC GTGCACTCCG GGGATTCCGC CTGTTCGCTG
CCGCCCTATA CCCTGGCGCC CGATGTCCAG GACCGGTTGC GTGAGCAGAC CCGGGCCCTG
GCGCTGGAGC TGGGTGTGGT CGGGCTGATG AACATCCAGT TCGCCATCAA GGGCAGTGAC
GTCTATCTGC TGGAGGTTAA CCCGCGCGCC TCGCGTACGG TGCCCTATGT CTCCAAGGCC
ATCGGCACCC CGTTGGCCAA GGTGGCCGCC CGCTGCATGG CCGGGCAGAC GCTGGCCGCG
CAGGGGATCA CCCGTGAGGT TATTCCGGCC TATTATTCGG TGAAGGAGGC GGTCTTCCCC
TTCATCAAGT TCCCCGGCGT GGACCCGATC CTCGGCCCGG AGATGAAGTC CACCGGCGAG
GTCATGGGTA TCGGCCGCAG CTTTGGCGAG GCCTACGCCA AATCGCAGGT GGCGGCCAGC
GTCAAGCTGC CCCGGAGCGG ACGCTGCTTC ATCAGTGTCC GTGACGTGGA CAAGCCGGGT
GCCATCGAGG TGGCGCGCGA GCTCATCCGG CGGGGCTTCT CTTTGGTGGC CACCCGCGGC
ACCGCCGCTG CGCTCTCCGA GGCCGGCGTC GAGTGCGACG TGATCAACAA GGTGCTGGAA
GGCCGGCCGC ACATCGTTGA TGCCATCAAG AACGATGAGA TCGACCTGAT CGTGAACACG
ACGGAGGGGC GTCAGGCCAT CGCCGACTCC TACTCCATCC GCCGCGAGGC CCTGCAGCAC
AAAGTGTGTT ACACCACCAC CATCAATGGC GCCCGGGCCA CGCTGCTGGC GCTGGACTAC
CTGGATGCCG CCGACGTCAA CCGCCTGCAG GATCTGCACA GGGAGGCAAC CGCATGA
 
Protein sequence
MRLRGMPKRT DIKSILIIGA GPIVIGQACE FDYSGAQACK ALREEGYRVI LVNSNPATIM 
TDPETADAVY IEPIEWQTVA RILEKERPDA VLPTMGGQTA LNCALDLSRE GVLERLGIEM
IGANKEAIDM AEDRESFREA MARIGLETAH AEIAHSMEEA LDAQKRIGFP TIVRPSFTLG
GSGGGIAYNR EEFIEICERG LDLSPTNELL IEESVLGWKE YETEVVRDKA DNCIIICFIE
NLDPMGVHTG DSITVAPAQT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAINPADGRM
VVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGYTLDEL RNEITGGATP ASFEPTIDYV
VTKIPRFTFE KFPKAPPRLT TQMKSVGEVM AIGRTFQESL QKALRGLEND LTGLDERVDL
SVEGGNDLIR QELRQPSPER LLYLADGFRA GFTLEELFEL TWIDPWFLAQ IQELVAVEQG
LRTGGLKSLD RDRLFNLKQK GFSDARLARL LGVREADVRA RRLQLEVRPV FKRVDSCAAE
FASATAYMYS TYEEECEAEP TDRRKIMVLG GGPNRIGQGI EFDYCCVHAA LAMREDGYET
IMVNCNPETV STDYDTSDRL YFEPLTLEDV LAIVEIERPE GIIVQYGGQT PLKLARDLEA
AGAPIIGTTP DSIDLAEDRE RFQGLINKLG LKQPPNRTAR SADQALRLAA EIGYPLVVRP
SYVLGGRAMD IVYGEDELLQ YMHEAVRVSN DSPVLLDRFL DDAVEVDVDA ICDGEDVLIG
GIMEHIEQAG VHSGDSACSL PPYTLAPDVQ DRLREQTRAL ALELGVVGLM NIQFAIKGSD
VYLLEVNPRA SRTVPYVSKA IGTPLAKVAA RCMAGQTLAA QGITREVIPA YYSVKEAVFP
FIKFPGVDPI LGPEMKSTGE VMGIGRSFGE AYAKSQVAAS VKLPRSGRCF ISVRDVDKPG
AIEVARELIR RGFSLVATRG TAAALSEAGV ECDVINKVLE GRPHIVDAIK NDEIDLIVNT
TEGRQAIADS YSIRREALQH KVCYTTTING ARATLLALDY LDAADVNRLQ DLHREATA