Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1981 |
Symbol | |
ID | 4268524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2248469 |
End bp | 2251705 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126737 |
Product | carbamoyl-phosphate synthase large subunit |
Protein accession | YP_742813 |
Protein GI | 114321130 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00635498 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGAGACTGC GCGGCATGCC CAAGCGTACC GATATCAAGA GCATCCTCAT CATCGGCGCC GGTCCCATCG TGATCGGCCA GGCCTGTGAA TTCGACTACT CCGGCGCCCA GGCCTGTAAG GCCCTGCGCG AGGAGGGGTA CCGGGTCATC CTGGTCAACT CCAACCCGGC CACCATCATG ACCGACCCCG AGACGGCGGA CGCGGTCTAC ATCGAGCCCA TCGAGTGGCA GACCGTGGCG CGCATCCTGG AAAAGGAGCG GCCCGATGCG GTGCTGCCCA CCATGGGGGG GCAGACGGCG TTGAATTGCG CCCTGGACCT CTCCCGCGAG GGCGTGCTGG AGCGCCTGGG CATCGAGATG ATCGGGGCCA ACAAAGAGGC CATCGACATG GCGGAGGACC GCGAGTCCTT CCGCGAGGCC ATGGCCCGCA TCGGCCTGGA GACCGCCCAT GCGGAGATCG CCCACTCCAT GGAAGAGGCG CTGGATGCGC AAAAGCGCAT CGGTTTCCCC ACCATCGTCC GGCCCTCCTT CACCCTGGGC GGCTCCGGCG GGGGCATCGC CTACAACCGT GAGGAGTTCA TCGAGATCTG TGAGCGTGGT CTCGACCTTT CGCCCACCAA CGAGCTGCTC ATCGAGGAGT CGGTGCTGGG CTGGAAGGAG TACGAGACAG AGGTGGTGCG CGACAAGGCG GACAACTGCA TCATCATCTG TTTCATCGAA AACCTGGACC CAATGGGCGT GCACACCGGG GATTCCATCA CCGTGGCCCC GGCGCAGACC CTGACCGACA AGGAATACCA GATCATGCGC GACGCCTCTC TCGCCGTGTT GCGGGAGATC GGCGTGGAGA CGGGTGGCTC CAACGTGCAG TTCGCCATCA ACCCGGCGGA CGGGCGCATG GTGGTGATTG AGATGAATCC GCGGGTGTCG CGCTCCTCGG CGCTGGCCTC CAAGGCCACC GGCTTCCCCA TCGCCAAAGT GGCCGCCAAG CTGGCGGTGG GTTACACCTT GGATGAGCTG CGCAACGAGA TCACCGGCGG TGCCACCCCG GCTTCCTTCG AGCCGACCAT CGACTACGTG GTCACTAAGA TCCCGCGTTT CACCTTCGAG AAGTTCCCCA AGGCCCCGCC GCGGCTGACC ACCCAGATGA AGTCGGTGGG CGAGGTGATG GCCATCGGCC GCACCTTCCA GGAGTCGCTG CAGAAGGCCC TGCGCGGCCT GGAGAACGAT TTGACCGGCC TGGACGAGCG GGTGGACCTG TCCGTCGAGG GGGGCAACGA CCTGATCCGC CAGGAGCTGC GTCAGCCCTC GCCGGAGCGG CTGCTCTACC TGGCCGATGG CTTCCGCGCC GGCTTCACCC TCGAGGAGCT GTTCGAGCTG ACCTGGATCG ACCCCTGGTT CCTGGCCCAG ATCCAGGAAC TGGTGGCGGT GGAACAGGGC CTGCGCACCG GCGGCCTGAA GTCGCTGGAC CGTGACCGCT TGTTCAATCT CAAGCAGAAA GGGTTTTCCG ACGCCCGGCT GGCCCGCCTG CTGGGGGTGC GCGAGGCGGA CGTGCGTGCG CGCCGCCTGC AACTGGAGGT GCGCCCGGTG TTCAAGCGGG TGGACTCCTG CGCCGCCGAG TTCGCCTCCG CCACCGCCTA CATGTACTCC ACCTACGAGG AGGAGTGCGA GGCCGAGCCC ACCGACCGGC GCAAGATCAT GGTCCTTGGC GGCGGCCCTA ACCGGATCGG CCAGGGCATT GAGTTTGACT ACTGCTGCGT GCACGCCGCC CTGGCCATGC GTGAGGACGG CTATGAGACC ATCATGGTCA ACTGCAACCC GGAGACGGTC TCCACCGACT ACGACACCTC CGACCGGCTC TATTTCGAGC CGCTCACCCT GGAGGACGTG CTGGCGATCG TCGAGATCGA GCGGCCCGAG GGCATCATTG TTCAGTACGG TGGCCAGACG CCGCTCAAGC TGGCGCGCGA CCTGGAGGCC GCCGGGGCGC CGATCATTGG CACCACCCCG GACTCCATCG ACCTGGCGGA GGACCGCGAG CGCTTCCAGG GGCTGATCAA CAAGCTGGGG CTCAAGCAGC CGCCCAACCG CACCGCCCGC AGCGCCGATC AGGCCCTGCG CCTGGCCGCG GAGATTGGTT ACCCGCTGGT GGTGCGCCCG TCGTACGTAC TGGGTGGCCG CGCCATGGAC ATCGTCTATG GCGAGGATGA ATTGTTGCAG TACATGCACG AGGCCGTGCG GGTCTCCAAC GACTCGCCGG TGCTGCTGGA CCGCTTCCTG GACGATGCCG TGGAGGTAGA CGTGGACGCC ATCTGCGACG GCGAGGACGT GCTGATCGGC GGTATCATGG AGCACATCGA GCAGGCCGGC GTGCACTCCG GGGATTCCGC CTGTTCGCTG CCGCCCTATA CCCTGGCGCC CGATGTCCAG GACCGGTTGC GTGAGCAGAC CCGGGCCCTG GCGCTGGAGC TGGGTGTGGT CGGGCTGATG AACATCCAGT TCGCCATCAA GGGCAGTGAC GTCTATCTGC TGGAGGTTAA CCCGCGCGCC TCGCGTACGG TGCCCTATGT CTCCAAGGCC ATCGGCACCC CGTTGGCCAA GGTGGCCGCC CGCTGCATGG CCGGGCAGAC GCTGGCCGCG CAGGGGATCA CCCGTGAGGT TATTCCGGCC TATTATTCGG TGAAGGAGGC GGTCTTCCCC TTCATCAAGT TCCCCGGCGT GGACCCGATC CTCGGCCCGG AGATGAAGTC CACCGGCGAG GTCATGGGTA TCGGCCGCAG CTTTGGCGAG GCCTACGCCA AATCGCAGGT GGCGGCCAGC GTCAAGCTGC CCCGGAGCGG ACGCTGCTTC ATCAGTGTCC GTGACGTGGA CAAGCCGGGT GCCATCGAGG TGGCGCGCGA GCTCATCCGG CGGGGCTTCT CTTTGGTGGC CACCCGCGGC ACCGCCGCTG CGCTCTCCGA GGCCGGCGTC GAGTGCGACG TGATCAACAA GGTGCTGGAA GGCCGGCCGC ACATCGTTGA TGCCATCAAG AACGATGAGA TCGACCTGAT CGTGAACACG ACGGAGGGGC GTCAGGCCAT CGCCGACTCC TACTCCATCC GCCGCGAGGC CCTGCAGCAC AAAGTGTGTT ACACCACCAC CATCAATGGC GCCCGGGCCA CGCTGCTGGC GCTGGACTAC CTGGATGCCG CCGACGTCAA CCGCCTGCAG GATCTGCACA GGGAGGCAAC CGCATGA
|
Protein sequence | MRLRGMPKRT DIKSILIIGA GPIVIGQACE FDYSGAQACK ALREEGYRVI LVNSNPATIM TDPETADAVY IEPIEWQTVA RILEKERPDA VLPTMGGQTA LNCALDLSRE GVLERLGIEM IGANKEAIDM AEDRESFREA MARIGLETAH AEIAHSMEEA LDAQKRIGFP TIVRPSFTLG GSGGGIAYNR EEFIEICERG LDLSPTNELL IEESVLGWKE YETEVVRDKA DNCIIICFIE NLDPMGVHTG DSITVAPAQT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAINPADGRM VVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGYTLDEL RNEITGGATP ASFEPTIDYV VTKIPRFTFE KFPKAPPRLT TQMKSVGEVM AIGRTFQESL QKALRGLEND LTGLDERVDL SVEGGNDLIR QELRQPSPER LLYLADGFRA GFTLEELFEL TWIDPWFLAQ IQELVAVEQG LRTGGLKSLD RDRLFNLKQK GFSDARLARL LGVREADVRA RRLQLEVRPV FKRVDSCAAE FASATAYMYS TYEEECEAEP TDRRKIMVLG GGPNRIGQGI EFDYCCVHAA LAMREDGYET IMVNCNPETV STDYDTSDRL YFEPLTLEDV LAIVEIERPE GIIVQYGGQT PLKLARDLEA AGAPIIGTTP DSIDLAEDRE RFQGLINKLG LKQPPNRTAR SADQALRLAA EIGYPLVVRP SYVLGGRAMD IVYGEDELLQ YMHEAVRVSN DSPVLLDRFL DDAVEVDVDA ICDGEDVLIG GIMEHIEQAG VHSGDSACSL PPYTLAPDVQ DRLREQTRAL ALELGVVGLM NIQFAIKGSD VYLLEVNPRA SRTVPYVSKA IGTPLAKVAA RCMAGQTLAA QGITREVIPA YYSVKEAVFP FIKFPGVDPI LGPEMKSTGE VMGIGRSFGE AYAKSQVAAS VKLPRSGRCF ISVRDVDKPG AIEVARELIR RGFSLVATRG TAAALSEAGV ECDVINKVLE GRPHIVDAIK NDEIDLIVNT TEGRQAIADS YSIRREALQH KVCYTTTING ARATLLALDY LDAADVNRLQ DLHREATA
|
| |