Gene Mpal_2319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2319 
SymbolcarB 
ID7270580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2465458 
End bp2468628 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content58% 
IMG OID643570924 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002467327 
Protein GI219852895 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGGC GTACTGATAT CAAGAAGGTT CTCTTGATCG GTTCCGGACC AATCCAGATC 
GGACAGGCCG CTGAGTTCGA CTTCTCAGGT TCACAAGCCT GCAAGTCCCT CCGCGAAGAA
GGGATTGAGG TGGTACTGGT CAACTCCAAC CCGGCGACGA TCCAAACCGA TCCCGAAACG
GCTGACACGA TCTATATCGA ACCCCTGAGG GCCTCGATCA TCGCAAAGAT CATCGAAAAA
GAGAAACCCG ATGGGATTCT TTCGGGGATG GGCGGACAGA CCGGTCTGAA CCTGACCGCA
GAACTGGCAG AGCTCGGAGC CCTCCGAAAT GTCGAAATTC TTGGCACCCC GCTCGAGGCG
ATCTACCAGG GAGAGGACCG AGAGAAGTTC AAGGCCCTGA TGCAGAAGAT AGGAGAACCG
GTCCCGAGAA GCATGATCTT AAACCGGCTC GACCAGCTTG GCGAGGTGAT CGAAAAGGTC
GGACTGCCAG TGATCATCAG GCCGGCCTAC ACCCTCGGGG GCGCCGGTGG CGGTATCGCC
CATACCGTCG ACGAACTCAA ACGGATCGTC GAAATTGGCC TGCAGCGCTC ACGGATCCAC
CAGGTACTGA TCGAAGAGAG CGTGATGGGC TGGAAGGAAC TCGAGTTCGA AGTGATGCGC
GATGCGAAAG ACACCTGTGT GATCATCTGC TCGATGGAGA ATGTGGACCC TATGGGGGTT
CACACAGGGG AGAGCGTGGT CGTCGCCCCG ATTCTGACGT TGCGGGACGA CGAATACCAG
ATGATGCGGT CGGCTTCGAT CAAGATCATA AGGGCGCTCG ATGTGCAGGG AGGGTGTAAC
ATTCAGTTCG CCTTTCAGGA CGGCGACTAC CGTGTGATCG AGGTAAACCC CCGGGTCTCT
CGTTCATCTG CCCTCGCCTC CAAGGCGACC GGGTATCCGA TCGCGCGAGT TGCGGCCAAG
ATCGCCATCG GCATGCACCT CGATGAGATC ACCAATGCCG TCACCGGATG CACACCGGCT
TCGTTCGAGC CGTCGATCGA TTATGTGGTC GTCAAGGTCC CGCGCTGGCC GTTCGACAAG
TTCACGAGGG CCGACCGGAC CCTAACGACG GCGATGAAGT CCACCGGCGA GGTGATGGCC
ATCGGCCGGA CCCTCGAGGA AGGATTTAAG AAGGCGCTCC GTTCGATCGA CACCGATATC
AACACCCATA CCAACCACAA CGAGATCAGG ATGATCCTGA CCAGCCCGAC CGATGAACGA
TTCGGGTGCA TCTTTGACGC GTTCAGGGAG GGGTTCACGG TGGACGAGAT CGCATCCCTC
ACCTCGATCA ATCCGTTCTT CCTTCACAAG ATGGAGAATA TCGTGAAGAT CGAGCGAACT
CTCGCGACCG AGCCGACCGA TCTCAGGATC CAGGAGGCCG CTGCCGCCGG GTTCTCGATG
AAGGAGATCG CAGAACTGAC CGGTCGACCG GTCGATGAGG TTCGAACCGC CGCCGGCGAT
CCGGTCTACA AGATGGTCGA CACCTGTGCA GCCGAGTTCC CGGCCACGAC TCCGTACTAC
TACTCGACCC ACGGGGTGAC CACCGATATC ATCCAGAACG ATAAGAAGAA GGTGCTGATC
CTCGGGTCAG GGCCGATCCG GATTGGACAG GGGATCGAAT TCGATTACTG TACCGTCCAT
GCCGTTAAGG CCCTACGGGA GGAGGGGGTC GAGGTCCATA TCGTCAACAA CAACCCCGAG
ACCGTCTCAA CTGACTTCGA CACCTCGGAC CAGCTCTTCT TTGAACCGAT GCAACTCGAG
GATGTCGTGA ACATCCTTAA AACGGACGAT TACTTTGGAG TGATGGTGCA GTTCGGAGGA
CAGAACGCTG TCAATCTGGC CCTGCCGCTG CTGCAGGAGA TCAAAAAACT CGGCCTCCCA
ACCGCGATCC TGGGCTCGTC CCCGGACGCA ATGGATATCG CCGAGGACCG GGACCGGTTC
AGCGAACTGC TGGACGCCCT CAAGATTCCA TCGCCGCCGA ACAGTTCGGC CTACTCTGAG
GAGGCCGCAC TGGCCATGGC CAATAAGATC GGCTTCCCGG TGCTGGTCCG CCCCAGTTAC
GTACTCGGCG GGAGGGCGAT GGAGATCGTC CACGACAATT TCGAACTCGA GTCGTACATG
AAGGAAGCAA TGCGGGTGAG CAAAAGCCAC CCAGTACTGA TCGATTCGTT CCTGCAGGAG
GCCATCGAGC TGGACGTGGA CGCGGTCTGC GACGGCGACG AGGTGCTGAT CGGCGGGATC
ATGGAGCATA TCGAGGAGGC TGGGGTTCAT TCGGGCGATT CGGCCTGTGT GATCCCGACC
CAGTCCCTCT CCGATTCGGT GCTTGCCCGG GTCAGGGAGT ATACCAAGAA GATCGCCATG
GGGCTCGGCG TTGTGGGGCT CGTAAACATT CAGCTGGCTG TCAAGGACGA CATCGTTTAT
GTGCTTGAGG CCAACCCCCG AGCATCCCGT ACGGTCCCCT TCGTTTCCAA GGCAACCGGG
ATCCCGCTGG CCAAGGTGGC CGCGAAGGTG ATGATCGGAA AGAAACTGAA GGACCTCGGG
TATAAGGAGC GCACGTTCCG GCATGTGGCG GTGAAGGAGG TGCTTCTCCC ATTCAATAAA
CTCCCCGGGG TTGACACCGT CCTCGGTCCA GAGATGAAAT CCACCGGCGA GGTGATGGGG
ATCGACTACG ACTTCGGCAG GGCCTACTAC AAAGCCTGCA TATCAGCGGA CAACGAACTC
CCGATCGAAG GGAACGTCTT CATCTCGGTC TCGACTGAGC AGAAGGAGGA GGTCCGAAGG
ATCGCAGCTC AGCTCCGGGA CCTCGGGCTG ACCCTCTTTG GAACGAAAGG GACCGTCGAG
ACGCTGATGC AGGCTGGGAT CGAGGCGAAC CTGGTCAGAA AGGTCCAGGA AGGATCCCCG
AATGTGATCG ATATGGTGCG TAAGGGTGAG ATCAGGCTGA TCATCAACAC CCCGGTGGAC
AAGCAGTCCC GGCTCGACCA TTACCAGATC ATGCGAGCGG CCGTCGACTA CGGGATCCCG
TACATCACGA CCCTGCAGGC AGCCCGGGCA GCTGCGCTGG CCATCGATGC GATCAAGCGG
GAGAAGATCA CGCTGGAACC AATCAGCCAT TATCTTTCAG AGGTTGAATG A
 
Protein sequence
MPRRTDIKKV LLIGSGPIQI GQAAEFDFSG SQACKSLREE GIEVVLVNSN PATIQTDPET 
ADTIYIEPLR ASIIAKIIEK EKPDGILSGM GGQTGLNLTA ELAELGALRN VEILGTPLEA
IYQGEDREKF KALMQKIGEP VPRSMILNRL DQLGEVIEKV GLPVIIRPAY TLGGAGGGIA
HTVDELKRIV EIGLQRSRIH QVLIEESVMG WKELEFEVMR DAKDTCVIIC SMENVDPMGV
HTGESVVVAP ILTLRDDEYQ MMRSASIKII RALDVQGGCN IQFAFQDGDY RVIEVNPRVS
RSSALASKAT GYPIARVAAK IAIGMHLDEI TNAVTGCTPA SFEPSIDYVV VKVPRWPFDK
FTRADRTLTT AMKSTGEVMA IGRTLEEGFK KALRSIDTDI NTHTNHNEIR MILTSPTDER
FGCIFDAFRE GFTVDEIASL TSINPFFLHK MENIVKIERT LATEPTDLRI QEAAAAGFSM
KEIAELTGRP VDEVRTAAGD PVYKMVDTCA AEFPATTPYY YSTHGVTTDI IQNDKKKVLI
LGSGPIRIGQ GIEFDYCTVH AVKALREEGV EVHIVNNNPE TVSTDFDTSD QLFFEPMQLE
DVVNILKTDD YFGVMVQFGG QNAVNLALPL LQEIKKLGLP TAILGSSPDA MDIAEDRDRF
SELLDALKIP SPPNSSAYSE EAALAMANKI GFPVLVRPSY VLGGRAMEIV HDNFELESYM
KEAMRVSKSH PVLIDSFLQE AIELDVDAVC DGDEVLIGGI MEHIEEAGVH SGDSACVIPT
QSLSDSVLAR VREYTKKIAM GLGVVGLVNI QLAVKDDIVY VLEANPRASR TVPFVSKATG
IPLAKVAAKV MIGKKLKDLG YKERTFRHVA VKEVLLPFNK LPGVDTVLGP EMKSTGEVMG
IDYDFGRAYY KACISADNEL PIEGNVFISV STEQKEEVRR IAAQLRDLGL TLFGTKGTVE
TLMQAGIEAN LVRKVQEGSP NVIDMVRKGE IRLIINTPVD KQSRLDHYQI MRAAVDYGIP
YITTLQAARA AALAIDAIKR EKITLEPISH YLSEVE