Gene Mboo_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0834 
SymbolcarB 
ID5410464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp810828 
End bp813989 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content57% 
IMG OID640868059 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001403995 
Protein GI154150377 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.249678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAC GCACAGATAT CAAAAAAGTC CTCCTCATCG GGTCCGGCCC CATCCAGATC 
GGCCAGGCTG CAGAGTTCGA TTTCTCCGGC TCACAGGCAT GCCGCTCCCT TAAGGAAGAG
GGAATTGAGG TCGTGCTGGT CAACTCGAAC CCTGCTACGA TCATGACCGA CCCGGACATG
GCCGACCAGA TCTACATCGA GCCGCTCCGG GCAAATATCA TCGCAAAGAT TATTGAGAAA
GAACGGCCGG ACGGGATCCT CTCCGGCATG GGCGGCCAGA CCGGCCTCAA CCTTACCGCG
GAGCTTGCGG AGATGGGGGC GCTCAAAGGG GTCGAGATCC TCGGTACTCC CCTTGAGGCA
ATCTACAAGG GAGAAGACCG GGAAAAATTC CGGGATCTCA TGAACGAGAT CGGCGAGCCG
GTCCCAAAGA GCATCGTGCT TAACACCTTG AGCCAGATCG ATGACGCGAT TGCAAAGATC
GGCCTGCCTG CGGTTGTCCG CCCCGCATAT ACCCTGGGTG GAGCCGGGGG CGGCATCGGC
AGGACCCGGG AAGAGCTGAC CCGCATCGTG GAACTGGGGC TTTCCCGCTC CCGCATCCAC
CAGGTGCTCA TCGAAGAGAG CGTGATGGGC TGGAAGGAAC TGGAATTCGA GGTCATGCGG
GACTCGACTG ACACCTGCAT TATCGTGTGC AGCATGGAAA ATGTTGATCC CATGGGCATC
CACACCGGGG AGAGCGTGGT TGTTGCCCCC ATCCTGACGC TGCGTGACGA TGAGTTCCAG
ATGATGCGCA GCGCTGCGAT CCATATCATC CGTGCGCTCG ATGTCCAGGG CGGCTGCAAT
ATCCAGTTCG CGTTTAAAGA TGGAGACTAC CGGGTTATCG AGGTCAACCC GCGTGTCTCC
CGGTCCTCGG CACTTGCCTC CAAGGCCACC GGTTATCCCA TTGCCCGGGT CGCAGCAAAG
ATCGCCATCG GCCTGCGCCT TGACGAGATC AAAAACTCGG TGACCGGATG CACTGCAGCG
TCGTTTGAAC CGACCATCGA TTATATCGTA GTCAAGGTCC CGCGCTGGCC CTTTGACAAG
TTCAAGGGTG CGGACCGGAC GCTTACTACC TCGATGAAGA GTACCGGTGA GGTCATGGCC
ATCGGAAGGA CTCTTGAAGA ATCCTTCATG AAGGCGAAAA GGTCGATCGA TACCGATGTC
CGGACCCACA CGAGCCCAAG CGAAATCCGG ATGATCCTCT CCCGCCCGAC CGATGAGCGG
TTCCACTGTC TCTTTGATGC ATTCCGTCAG GGTTTTACCC TTGATGAGAT CGCCGGCCTT
ACCTCCATTG TACCGTTTTT CCTGGAAAAG ATCAAAAACA TCGTGGACCT GGAAAAACGC
CTTGCTGCGG GGTGCACGGA TGAGGATATC TTCCTTGCAA AACGGTACGG GTTTGCCAAC
ACCGAGATTG CCGCACTCAC CGGCAGGGGT GCGGATACCA TCGAGGCTCT GGTGGGTGCC
CCGGCCTACA AGATGGTGGA TACCTGCGCA GCCGAATTTC CGGCAAGCAC CCCCTATTTC
TACTCGACCC GGGAAGGTAC GAGCGAGATT GTCAGGGACA AAAAACAAAA AATCCTCATC
CTTGGTTCCG GCCCGATACG GATCGGGCAG GGAATCGAGT TCGATTACTG CACGGTCCAT
GCGGTAAAAG CACTGCGGGA GGAGGGTGTT GAAGTCCATA TCGTGAACAA CAACCCGGAG
ACGGTCTCCA CCGATTTCGA CACCTCCGAC CGCCTGTTCT TTGAGCCCAT GCTGCTTGAG
GATGTCACAA ACATCCTTAT GACCGATGAG TATTACGGGG TGATGGTGCA GTTTGGCGGC
CAGAATGCGG TAAACCTTGC AGTTCCCCTG GAAAAAGAGC TGAAACGGCG GGGGATGTGC
ACACGGATCC TCGGTACCTC CCCGGATGCC ATGGATATTG CCGAGGACCG TGACCGGTTC
AGCGTTCTCC TCACTACCCT GCAAATCCCA AGCCCCGCGA ACAGCTCTGC GTATTCGGAG
GCAGAAGCCC GGGAGAAGGC CGAGCGGATT GGCTACCCGG TCCTTGTCCG GCCCTCGTAT
GTGCTGGGCG GCCGGGCAAT GGAGATCGTC CACAATACTG CCGAACTTGA AACGTACATG
AAGGAGGCGG TCCGGGTGAG CCAGCACCAC CCGGTGCTGA TTGACTCCTA CCTGAGGAAT
GCCATAGAAC TCGATGTCGA TGCAGTCTGC GATGGCAAAG AGGTCCTTAT CGGGGGGATC
ATGGAGCACA TCGAACAGGC CGGCATCCAT TCCGGGGATT CTGCCTGTGT CATCCCCACG
CAATCGCTTT CCCCTGAAGT TATCGCCACG GTCCGGGAAT ACACCAAAAA GATCGCACTT
GGCCTTGGGG TTGTGGGCCT GGTCAACATC CAGATGGCGG TTAAGGACAA TGTCGTGTAT
ATCCTCGAAG CAAACCCCCG GGCGAGCCGG ACCGTTCCCT TTGTCTCAAA AGCCACCGGT
CTCCCCATTG CAAAGATCGC GGCAAAAGTG ATGATCGGAA AGAAACTGTG TGACCTTGGT
TTCCACGAAG CAAAGATCAG GCATGTGGCA GTAAAAGAGG TGCTCCTTCC CTTTAACAAG
CTGGCCGGTG TTGATACCAT CCTTGGGCCC GAGATGAAGA GCACCGGGGA AGTCATGGGA
ATCGATTACG ATTTCGGGCT TGCCTTTTAC AAAGCCTGCA TCTCGGCCGA CAACGAGCTG
CCGCTCAAGG GAAATGTCTT TGTCTCGGTT AATATCGGCC AGAAAGACGA GGTTATCCCC
ATTGCCCGGC GTCTCCGTGA TCTCGGCCTC ACCCTGTACG GGACGGAAGG GACGGTCGAT
TACCTGCATG AAGCGGGCGT AGAGGCGCAC CTGGTACGGA AAGTCCAGGA AGGCTCCCCC
AATGTGCTGG ACATGATGCA CCACGGCGAG ATCCGGCTGA TCATCAACAC GCCCCAGGAC
CGGCAGTCGC GCCAGGATCA TTACCAGATC ATGCGTGCGG CAGTGGATTT CCAAATCCCC
TATATTACGA CCCTTCAGGC AGCGCGGGCC GCAGCGCTTG CGATCGATGC AATCAAGCGC
GAAAAAATAA CGCTCGAACC GCTGAGCCAT TATCTCCGCT GA
 
Protein sequence
MPKRTDIKKV LLIGSGPIQI GQAAEFDFSG SQACRSLKEE GIEVVLVNSN PATIMTDPDM 
ADQIYIEPLR ANIIAKIIEK ERPDGILSGM GGQTGLNLTA ELAEMGALKG VEILGTPLEA
IYKGEDREKF RDLMNEIGEP VPKSIVLNTL SQIDDAIAKI GLPAVVRPAY TLGGAGGGIG
RTREELTRIV ELGLSRSRIH QVLIEESVMG WKELEFEVMR DSTDTCIIVC SMENVDPMGI
HTGESVVVAP ILTLRDDEFQ MMRSAAIHII RALDVQGGCN IQFAFKDGDY RVIEVNPRVS
RSSALASKAT GYPIARVAAK IAIGLRLDEI KNSVTGCTAA SFEPTIDYIV VKVPRWPFDK
FKGADRTLTT SMKSTGEVMA IGRTLEESFM KAKRSIDTDV RTHTSPSEIR MILSRPTDER
FHCLFDAFRQ GFTLDEIAGL TSIVPFFLEK IKNIVDLEKR LAAGCTDEDI FLAKRYGFAN
TEIAALTGRG ADTIEALVGA PAYKMVDTCA AEFPASTPYF YSTREGTSEI VRDKKQKILI
LGSGPIRIGQ GIEFDYCTVH AVKALREEGV EVHIVNNNPE TVSTDFDTSD RLFFEPMLLE
DVTNILMTDE YYGVMVQFGG QNAVNLAVPL EKELKRRGMC TRILGTSPDA MDIAEDRDRF
SVLLTTLQIP SPANSSAYSE AEAREKAERI GYPVLVRPSY VLGGRAMEIV HNTAELETYM
KEAVRVSQHH PVLIDSYLRN AIELDVDAVC DGKEVLIGGI MEHIEQAGIH SGDSACVIPT
QSLSPEVIAT VREYTKKIAL GLGVVGLVNI QMAVKDNVVY ILEANPRASR TVPFVSKATG
LPIAKIAAKV MIGKKLCDLG FHEAKIRHVA VKEVLLPFNK LAGVDTILGP EMKSTGEVMG
IDYDFGLAFY KACISADNEL PLKGNVFVSV NIGQKDEVIP IARRLRDLGL TLYGTEGTVD
YLHEAGVEAH LVRKVQEGSP NVLDMMHHGE IRLIINTPQD RQSRQDHYQI MRAAVDFQIP
YITTLQAARA AALAIDAIKR EKITLEPLSH YLR