Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2319 |
Symbol | carB |
ID | 7270580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2465458 |
End bp | 2468628 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643570924 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002467327 |
Protein GI | 219852895 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGGC GTACTGATAT CAAGAAGGTT CTCTTGATCG GTTCCGGACC AATCCAGATC GGACAGGCCG CTGAGTTCGA CTTCTCAGGT TCACAAGCCT GCAAGTCCCT CCGCGAAGAA GGGATTGAGG TGGTACTGGT CAACTCCAAC CCGGCGACGA TCCAAACCGA TCCCGAAACG GCTGACACGA TCTATATCGA ACCCCTGAGG GCCTCGATCA TCGCAAAGAT CATCGAAAAA GAGAAACCCG ATGGGATTCT TTCGGGGATG GGCGGACAGA CCGGTCTGAA CCTGACCGCA GAACTGGCAG AGCTCGGAGC CCTCCGAAAT GTCGAAATTC TTGGCACCCC GCTCGAGGCG ATCTACCAGG GAGAGGACCG AGAGAAGTTC AAGGCCCTGA TGCAGAAGAT AGGAGAACCG GTCCCGAGAA GCATGATCTT AAACCGGCTC GACCAGCTTG GCGAGGTGAT CGAAAAGGTC GGACTGCCAG TGATCATCAG GCCGGCCTAC ACCCTCGGGG GCGCCGGTGG CGGTATCGCC CATACCGTCG ACGAACTCAA ACGGATCGTC GAAATTGGCC TGCAGCGCTC ACGGATCCAC CAGGTACTGA TCGAAGAGAG CGTGATGGGC TGGAAGGAAC TCGAGTTCGA AGTGATGCGC GATGCGAAAG ACACCTGTGT GATCATCTGC TCGATGGAGA ATGTGGACCC TATGGGGGTT CACACAGGGG AGAGCGTGGT CGTCGCCCCG ATTCTGACGT TGCGGGACGA CGAATACCAG ATGATGCGGT CGGCTTCGAT CAAGATCATA AGGGCGCTCG ATGTGCAGGG AGGGTGTAAC ATTCAGTTCG CCTTTCAGGA CGGCGACTAC CGTGTGATCG AGGTAAACCC CCGGGTCTCT CGTTCATCTG CCCTCGCCTC CAAGGCGACC GGGTATCCGA TCGCGCGAGT TGCGGCCAAG ATCGCCATCG GCATGCACCT CGATGAGATC ACCAATGCCG TCACCGGATG CACACCGGCT TCGTTCGAGC CGTCGATCGA TTATGTGGTC GTCAAGGTCC CGCGCTGGCC GTTCGACAAG TTCACGAGGG CCGACCGGAC CCTAACGACG GCGATGAAGT CCACCGGCGA GGTGATGGCC ATCGGCCGGA CCCTCGAGGA AGGATTTAAG AAGGCGCTCC GTTCGATCGA CACCGATATC AACACCCATA CCAACCACAA CGAGATCAGG ATGATCCTGA CCAGCCCGAC CGATGAACGA TTCGGGTGCA TCTTTGACGC GTTCAGGGAG GGGTTCACGG TGGACGAGAT CGCATCCCTC ACCTCGATCA ATCCGTTCTT CCTTCACAAG ATGGAGAATA TCGTGAAGAT CGAGCGAACT CTCGCGACCG AGCCGACCGA TCTCAGGATC CAGGAGGCCG CTGCCGCCGG GTTCTCGATG AAGGAGATCG CAGAACTGAC CGGTCGACCG GTCGATGAGG TTCGAACCGC CGCCGGCGAT CCGGTCTACA AGATGGTCGA CACCTGTGCA GCCGAGTTCC CGGCCACGAC TCCGTACTAC TACTCGACCC ACGGGGTGAC CACCGATATC ATCCAGAACG ATAAGAAGAA GGTGCTGATC CTCGGGTCAG GGCCGATCCG GATTGGACAG GGGATCGAAT TCGATTACTG TACCGTCCAT GCCGTTAAGG CCCTACGGGA GGAGGGGGTC GAGGTCCATA TCGTCAACAA CAACCCCGAG ACCGTCTCAA CTGACTTCGA CACCTCGGAC CAGCTCTTCT TTGAACCGAT GCAACTCGAG GATGTCGTGA ACATCCTTAA AACGGACGAT TACTTTGGAG TGATGGTGCA GTTCGGAGGA CAGAACGCTG TCAATCTGGC CCTGCCGCTG CTGCAGGAGA TCAAAAAACT CGGCCTCCCA ACCGCGATCC TGGGCTCGTC CCCGGACGCA ATGGATATCG CCGAGGACCG GGACCGGTTC AGCGAACTGC TGGACGCCCT CAAGATTCCA TCGCCGCCGA ACAGTTCGGC CTACTCTGAG GAGGCCGCAC TGGCCATGGC CAATAAGATC GGCTTCCCGG TGCTGGTCCG CCCCAGTTAC GTACTCGGCG GGAGGGCGAT GGAGATCGTC CACGACAATT TCGAACTCGA GTCGTACATG AAGGAAGCAA TGCGGGTGAG CAAAAGCCAC CCAGTACTGA TCGATTCGTT CCTGCAGGAG GCCATCGAGC TGGACGTGGA CGCGGTCTGC GACGGCGACG AGGTGCTGAT CGGCGGGATC ATGGAGCATA TCGAGGAGGC TGGGGTTCAT TCGGGCGATT CGGCCTGTGT GATCCCGACC CAGTCCCTCT CCGATTCGGT GCTTGCCCGG GTCAGGGAGT ATACCAAGAA GATCGCCATG GGGCTCGGCG TTGTGGGGCT CGTAAACATT CAGCTGGCTG TCAAGGACGA CATCGTTTAT GTGCTTGAGG CCAACCCCCG AGCATCCCGT ACGGTCCCCT TCGTTTCCAA GGCAACCGGG ATCCCGCTGG CCAAGGTGGC CGCGAAGGTG ATGATCGGAA AGAAACTGAA GGACCTCGGG TATAAGGAGC GCACGTTCCG GCATGTGGCG GTGAAGGAGG TGCTTCTCCC ATTCAATAAA CTCCCCGGGG TTGACACCGT CCTCGGTCCA GAGATGAAAT CCACCGGCGA GGTGATGGGG ATCGACTACG ACTTCGGCAG GGCCTACTAC AAAGCCTGCA TATCAGCGGA CAACGAACTC CCGATCGAAG GGAACGTCTT CATCTCGGTC TCGACTGAGC AGAAGGAGGA GGTCCGAAGG ATCGCAGCTC AGCTCCGGGA CCTCGGGCTG ACCCTCTTTG GAACGAAAGG GACCGTCGAG ACGCTGATGC AGGCTGGGAT CGAGGCGAAC CTGGTCAGAA AGGTCCAGGA AGGATCCCCG AATGTGATCG ATATGGTGCG TAAGGGTGAG ATCAGGCTGA TCATCAACAC CCCGGTGGAC AAGCAGTCCC GGCTCGACCA TTACCAGATC ATGCGAGCGG CCGTCGACTA CGGGATCCCG TACATCACGA CCCTGCAGGC AGCCCGGGCA GCTGCGCTGG CCATCGATGC GATCAAGCGG GAGAAGATCA CGCTGGAACC AATCAGCCAT TATCTTTCAG AGGTTGAATG A
|
Protein sequence | MPRRTDIKKV LLIGSGPIQI GQAAEFDFSG SQACKSLREE GIEVVLVNSN PATIQTDPET ADTIYIEPLR ASIIAKIIEK EKPDGILSGM GGQTGLNLTA ELAELGALRN VEILGTPLEA IYQGEDREKF KALMQKIGEP VPRSMILNRL DQLGEVIEKV GLPVIIRPAY TLGGAGGGIA HTVDELKRIV EIGLQRSRIH QVLIEESVMG WKELEFEVMR DAKDTCVIIC SMENVDPMGV HTGESVVVAP ILTLRDDEYQ MMRSASIKII RALDVQGGCN IQFAFQDGDY RVIEVNPRVS RSSALASKAT GYPIARVAAK IAIGMHLDEI TNAVTGCTPA SFEPSIDYVV VKVPRWPFDK FTRADRTLTT AMKSTGEVMA IGRTLEEGFK KALRSIDTDI NTHTNHNEIR MILTSPTDER FGCIFDAFRE GFTVDEIASL TSINPFFLHK MENIVKIERT LATEPTDLRI QEAAAAGFSM KEIAELTGRP VDEVRTAAGD PVYKMVDTCA AEFPATTPYY YSTHGVTTDI IQNDKKKVLI LGSGPIRIGQ GIEFDYCTVH AVKALREEGV EVHIVNNNPE TVSTDFDTSD QLFFEPMQLE DVVNILKTDD YFGVMVQFGG QNAVNLALPL LQEIKKLGLP TAILGSSPDA MDIAEDRDRF SELLDALKIP SPPNSSAYSE EAALAMANKI GFPVLVRPSY VLGGRAMEIV HDNFELESYM KEAMRVSKSH PVLIDSFLQE AIELDVDAVC DGDEVLIGGI MEHIEEAGVH SGDSACVIPT QSLSDSVLAR VREYTKKIAM GLGVVGLVNI QLAVKDDIVY VLEANPRASR TVPFVSKATG IPLAKVAAKV MIGKKLKDLG YKERTFRHVA VKEVLLPFNK LPGVDTVLGP EMKSTGEVMG IDYDFGRAYY KACISADNEL PIEGNVFISV STEQKEEVRR IAAQLRDLGL TLFGTKGTVE TLMQAGIEAN LVRKVQEGSP NVIDMVRKGE IRLIINTPVD KQSRLDHYQI MRAAVDYGIP YITTLQAARA AALAIDAIKR EKITLEPISH YLSEVE
|
| |