Gene Mbar_A2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2374 
SymbolcarB 
ID3626715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2999934 
End bp3003149 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content47% 
IMG OID637701243 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_305875 
Protein GI73669860 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.572238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GCGAGGACAT AAAAAAGGTT TTGCTTATAG GCTCAGGACC AATCACTATC 
GGACAGGCTG CAGAATTCGA CTTCTCAGGC AGCCAGGCCT GCAGGTCCTT AAAAGAAGAA
GGAATAAAGG TTGTCCTTGT AAACTCAAAT CCTGCAACCA TAATGACCGA TCCTGAAATG
GCTGATTCGG TCTATATCGA GCCACTTGAT GCCAAGATAG TAGAAAAGAT TATTGAAAAA
GAACGCCCAG ACGGAATTAT TGCAGGTATT GGAGGGCAGA CCGGCCTTAA TATTACCAGT
GAACTTGCGG AAAAGGGTGT CTTTGAGAAA TATGGGGTCG AAATTCTGGG AACTCCTGTT
GAAGCCATTA AAAATACCGA AGACAGGGAA CTCTTCAAAG AGACCATGCT CAGGATTGGA
GAAAAGGTTC CCTTAAGCCG GGCAGTTAAT TCTTTAAAAG AAGCCGAAGA TGTTGTTGAT
GAACTCGGTC TTCCTCTTAT TGTCCGTCCG GCATACACCC TTGGAGGAGC AGGGGGCGGA
ATTGCCCGCA CAAAAGAAGA GCTGCTTGAA ATTACGGAAC GTGGGCTCAG GCGCAGCCGT
ATCAACCAGG TACTCATTGA AGAAAGTGTG CTTGGCTGGG CAGAGATCGA GTATGAGGTC
ATGAGAGATG AAAACGATAC CTGCATCGTG ATCTGTAACA TGGAAAATAT TGACCCCATG
GGCGTGCATA CAGGAGAATC GGCTGTTGTT GCTCCTTCCC AAACTTTAAG CGATGCCGAG
CACCAGATGC TCAGGAGTGC CTCAATCAAG ATTATCCGAG CTCTCAAGAT CGAGGGTGGG
TGCAATATCC AGTACGCCTT AAAAGAAGGC GATTACCGCG TTGTCGAGGT AAATCCAAGG
GTTTCAAGGT CATCAGCCCT TGCATCCAAG GCTACAGGTT ACCCGATTGC CCGCGTAACT
GCAAAAATTG CAATTGGAAT GAAGCTTGAT GAGATCATAA ACAATGTTAC TAAGAGCACA
CCTGCCTCTT TTGAACCTGC TCTGGACTAC GTAATTACCA AAATTCCCAG GTGGCCTTTT
GATAAGTTCA CAACTGCAGA CAAAACCCTG ACTACAGCCA TGAAAAGTAC GGGAGAAGTC
ATGGCAATTG GCAGGACCAT TGAAGAATCC CTGCTAAAGG CTTTTAAGTC CCTGGATATC
GACAATCAGT TAGGAAATAA GCACTGGGAC GAGCCTGAAA CTAAAACTCT CCTTAAGACT
CCTACAAGCG AACGCCTTTT TGTTATCTTC GATGCACTTG AGAAGGGTAT GTCGGTAAAA
GAAATTTTCG AGCTTTCGAG CATCAACCCC TTCTTTATCT CAAAGATAAA AAGGATCGTG
GATATGGAAA AACGCATCAG GGCAGAAGAA CTCACTCCTG AACTCCTGCG TGAAGCAAAA
AAGATGGGCT TCCCTGATAC TCGCCTTGCC GAACTGACTG GCAGTACCAG GCAGGAAATA
AGTGACCTCA GACATAAAGC CGGAATCCTG GCTACCTTCA AGATGGTAGA TACCTGTGCA
GCCGAGTTTG AAGCAGCTAC TCCTTATTAT TATTCTACTT ACGAGGACTC CTGCGAGACA
AATGCCACCA CAGATAAGAA GAAGATTCTT ATTCTTGGTG CAGGCCCGAT AAGGATAGGA
CAGGGAATTG AGTTCGATTA CTGTACTGTG CATGCAGTTA CTGCACTTCG GGAAGAAGGC
ATAGAAACCC ACATTATTAA TAACAACCCC GAAACCGTAT CTACAGACTT TGATACCTCA
GACAAGCTCT TTTTTGAACC TCTTACCCTT GAGTATGTGA TGAACGTAAT CGAGCGTGAG
AAGCCTGATG GAGTACTTGT GCAGTTCGGA GGACAGACCT CGGTAAACCT TGCAATTCCC
CTCAAGCAGG AGTTGAAGCG CAGGACCGAC CTTAACACCG TAATTCTGGG CACGGACCCT
GATGACATGG ACCTTGCCGA AGACAGGGAA AAGTTCTATA TCCTTATGAA GGAGCTTGGC
GTTCCACAGC CTGAAGGTGG ATATGCAACT TCCCATAAGG AAGCAATCGA GGTTGCGAAG
CGGATCGGCT TCCCTGTGCT TGTGCGTCCT TCCTATGTGC TCGGCGGGCG GGCAATGGAA
ATAGTATACG ATGAAATCGA CCTTGAACGC TACATGAAAG AGGCAGTCAG GGTCTCTCAC
GAACACCCAA TACTGATTGA TGATTTCCTT GAGGCAGCCT CTGAAATCGA TGTGGATGCG
GTCTGCGACC AGAAAGACGT AATTATCGGC GCAATAATGG AGCATATCGA GGAAGCAGGT
GTTCACTCCG GAGATTCGGC CTGTGTAATT CCACCGCAGA GCCTCTCACC TGAAGTTCTT
GATCAGGTAA GGGACTATAC CCGCAAAATA GCTCTTGCCC TCAAGGTTAA AGGACTGATT
AATATCCAGA TGGCAGAAAA ATGTGGGAAG GTCTATGTAC TTGAAGCAAA CCCGCGTTCA
AGCAGGACAA TTCCTTTTGT TTCAAAATCC GTTGGAATCC CACTTGCAAA GATTGCAGCC
AAGGTAATTG CAGGACACAG CTTAAAGAGC CTGGGCTACA CGGACGAGCC AAAACCCAAA
CATGTCTCAA TTAAGGAAGT CCTCCTGCCT TTCGATAAAT TACCAGGTGC AGACCCTGTC
CTTGGCCCGG AAATGAAAAG CACTGGCGAG GTAATGGGGA TTGACTACGA CTTCGGAAGG
GCATATTATA AAGCCGAACT TGCAGCCGAC AATGTCTTAC CTCTTACCGG AAAAGTCTTC
CTTTCCATAA GGAATGCAGA CAAAACCGAA CTTGTGGACG TTGCAAAGAA ACTGCAGGCA
GCAGGTCTTG AACTTATGGG CACAGAGGGC ACTGTAAACT ATCTTGCACG GCACGGGGTC
TTCATGGATG TAGTGAAAAA GGTCCATGAC GGAAGCCCGA ATGTCATAGA TATGATGCGC
AGGGACGAGG TTGACCTTAT CATCAATACC CCGACAAGCA AACAGTCTCG CAGGGACGGT
TCAAGGATCA GGCGGGCTGC TGTTGATTTC AAAGTCCCGT ACATCACCAC AATGCAGGCC
GCAATAGCCG CAGCCGCTGC CATAGAAACT ATGAAGAAAG GAGAGGAACT TACAATTAAA
TCCATCAATG AGTACCACAA AGAGATGGAA AATTAA
 
Protein sequence
MPKREDIKKV LLIGSGPITI GQAAEFDFSG SQACRSLKEE GIKVVLVNSN PATIMTDPEM 
ADSVYIEPLD AKIVEKIIEK ERPDGIIAGI GGQTGLNITS ELAEKGVFEK YGVEILGTPV
EAIKNTEDRE LFKETMLRIG EKVPLSRAVN SLKEAEDVVD ELGLPLIVRP AYTLGGAGGG
IARTKEELLE ITERGLRRSR INQVLIEESV LGWAEIEYEV MRDENDTCIV ICNMENIDPM
GVHTGESAVV APSQTLSDAE HQMLRSASIK IIRALKIEGG CNIQYALKEG DYRVVEVNPR
VSRSSALASK ATGYPIARVT AKIAIGMKLD EIINNVTKST PASFEPALDY VITKIPRWPF
DKFTTADKTL TTAMKSTGEV MAIGRTIEES LLKAFKSLDI DNQLGNKHWD EPETKTLLKT
PTSERLFVIF DALEKGMSVK EIFELSSINP FFISKIKRIV DMEKRIRAEE LTPELLREAK
KMGFPDTRLA ELTGSTRQEI SDLRHKAGIL ATFKMVDTCA AEFEAATPYY YSTYEDSCET
NATTDKKKIL ILGAGPIRIG QGIEFDYCTV HAVTALREEG IETHIINNNP ETVSTDFDTS
DKLFFEPLTL EYVMNVIERE KPDGVLVQFG GQTSVNLAIP LKQELKRRTD LNTVILGTDP
DDMDLAEDRE KFYILMKELG VPQPEGGYAT SHKEAIEVAK RIGFPVLVRP SYVLGGRAME
IVYDEIDLER YMKEAVRVSH EHPILIDDFL EAASEIDVDA VCDQKDVIIG AIMEHIEEAG
VHSGDSACVI PPQSLSPEVL DQVRDYTRKI ALALKVKGLI NIQMAEKCGK VYVLEANPRS
SRTIPFVSKS VGIPLAKIAA KVIAGHSLKS LGYTDEPKPK HVSIKEVLLP FDKLPGADPV
LGPEMKSTGE VMGIDYDFGR AYYKAELAAD NVLPLTGKVF LSIRNADKTE LVDVAKKLQA
AGLELMGTEG TVNYLARHGV FMDVVKKVHD GSPNVIDMMR RDEVDLIINT PTSKQSRRDG
SRIRRAAVDF KVPYITTMQA AIAAAAAIET MKKGEELTIK SINEYHKEME N