Gene Mkms_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2414 
SymbolcarB 
ID4613237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2530101 
End bp2533439 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content68% 
IMG OID639792083 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_938402 
Protein GI119868450 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.318582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGTC GGACAGACCT GCGCCACGTC CTGGTGATCG GCTCCGGGCC GATCCTGATC 
GGACAGGCCG CCGAATTCGA CTACTCCGGG ACCCAGGCCT GCCGCGTGCT GCGCGCCGAA
GGCCTCACGG TCACACTGAT CAACTCCAAC CCGGCGACGA TCATGACCGA CCCGGAGTAC
GCCGACTACA CCTACGTCGA ACCCATCACC CCGGACTTCG TCGAACGGGT GATCGCCCAG
CAGGCCGAAC GCGGTAACAA GATCGATGCG CTGCTGGCCA CCCTCGGCGG GCAGACCGCG
CTCAACACCG CGGTCGCGCT GTCGGAGAAC GGAGTGCTCG AACGCTACGA CGTCGAGTTG
ATCGGCGCCG ACTTCGACGC GATCCAGCGT GGCGAGGACC GGCAGCGGTT CAAGGACATC
GTCACCAAAG TCGGTGGGGA GTCGGCGAAG TCGAGAGTCT GTTTCACGAT GGAGGAAGTC
CGCGAGACCG TCGGGGAACT GGGCTTGCCC GTCGTCGTGC GGCCGTCGTT CACGATGGGC
GGTCTGGGCT CAGGCATGGC GTACTCGGCC GAGGACGTCG AGCGGATGGC GGGCCACGGC
CTGGCCTCGT CGCCGAGCGC CAATGTGCTG ATCGAGGAAT CGATCTTCGG CTGGAAGGAA
TACGAGCTCG AGTTGATGCG CGACCGCCAC GACAACGTGG TGGTGGTGTG CTCGATCGAG
AACTTCGACC CGATGGGCGT GCACACCGGC GATTCGGTGA CCGTCGCCCC GGCGATGACG
CTGACCGACC GCGAATACCA GACCATGCGC GACCTCGGCA TCGCGATCCT GCGGGAGGTC
GGTGTCGCGA CCGGCGGATG CAACATCCAG TTCGCGGTGA ACCCGAAAGA CGGCCGGCTC
ATCGTCATCG AGATGAACCC GCGGGTGTCG CGGTCGAGTG CGCTGGCGTC GAAGGCCACC
GGCTTCCCGA TCGCCAAGAT CGCGGCGAAG CTGGCGATCG GCTACACCCT CGACGAGATC
CTCAACGACA TCACCAAGGA GACCCCGGCC TGCTTCGAGC CGACGCTGGA CTACGTCGTG
GTCAAGGCGC CGCGGTTCGC GTTCGAGAAG TTCCCCGGCG CCGACGCCAC CCTGACCACG
ACGATGAAAT CGGTCGGCGA GGCGATGTCG TTGGGCCGCA ACTTCATCGA GGCGCTCGGC
AAGGTGATGC GTTCTCTGGA GACCGGTCGT GCCGGCTTCT GGACCGCGCC GGACCCGATC
GCCACCGTCG ACGAGGTGCT GGAGAACCTG CGCACCCCGA CCGATGGGCG GCTCTACGAC
ATCGAGTTCG CGCTGCGGCT CGGGGCGTCG GTCGAGCAGG TCGCGGAGGC CTCCGGTGTC
GACCCGTGGT TCGTCGACCA GATCGCCGGG CTGGTGGCGC TGCGCACCGA ACTCCTCGAC
GCCCCGGTCC TCGACGGCAC GCTGCTGCGC CGGGCCAAGA ACAGCGGGCT GTCCGACCGC
CAGATCGCCG CGCTGCGCCC GGAACTCGCC GGCGAGGTCG GGGTGCGGGC ACTGCGTCAG
CGCCTGGGCA TCCACCCGGT GTTCAAGACC GTCGACACCT GCGCCGCGGA GTTCGAGGCC
AAGACGCCCT ACCACTACAG CAGCTACGAA CTCGACCCCG CGGCCGAGTC GGAGGTGGCG
CCGCAGGCCG AGCGGCCCAA GGTGCTGATT CTCGGCTCCG GGCCCAACCG GATCGGGCAG
GGCATCGAAT TCGACTACAG CTGTGTGCAT GCCGCGACCA CCCTGAGCGA GGCCGGGTTC
GAGACGGTGA TGATCAACTG CAACCCGGAG ACGGTCTCCA CCGACTACGA CACCGCCGAC
CGGTTGTACT TCGAGCCGTT GACCTTCGAG GACGTCCTCG AGATCTACTA CGCCGAATCA
GCCTCCGGCG CAGGCGGACC CGGCGTGGCC GGGGTGATCG TGCAACTCGG CGGCCAGACG
CCGCTGGGCC TGGCCGAACG GCTCGAGCAG GCCGGTGTCC CGATCGTCGG CACCAGCCCC
AAGGCCATCG ACCTGGCCGA GGACCGCGGC GCATTCGGTG AGGTGCTGCG CACCGCCGGG
CTGCCCGCAC CGCGTTTCGG CCTGGCCACC ACGTTCGACC AGGCCCGCCG CATCGCCGCC
GACATCGGCT ACCCCGTGCT GGTGCGGCCG TCCTACGTGC TGGGCGGCCG GGGTATGGAG
ATCGTCTACG ACGAGCAGAC CCTCGAGGGT TACATCACCC GGGCCACCCA GCTCTCGCCG
GAACACCCGG TGCTGGTCGA CCGGTTCCTC GAGGATGCGA TCGAGATCGA CGTCGACGCC
CTGTGCGACG GCACCGAGGT CTACATCGGC GGCATCATGG AGCACATCGA GGAGGCCGGC
ATCCACTCCG GCGACTCGGC GTGTGCGCTG CCCCCGGTGA CGTTGGGCCG CAGCGACATC
GAGTCGGTGC GGCGCGCGAC CGAGGCGATC GCCCACGGCG TCGGCGTGGT CGGGCTGCTC
AACGTGCAGT ACGCGCTCAA GGACGACGTG CTCTACGTCC TGGAGGCCAA CCCGCGGGCC
AGCCGCACGG TGCCGTTCGT ATCCAAAGCC ACGGCAGTGC CACTCGCTAA GGCGTGCGCG
CGGATCATGC TCGGCGCCAG CATCGCCCAG TTGCGCGAGG AGGGCGTCCT GGCCGCGACC
GGCGACGGTG CGACCACTGC GCGCAATGCG CCCGTCGCGG TGAAGGAAGC GGTGTTGCCG
TTCCACCGGT TCCGCAAGGC CGACGGCGCC CAGATCGATT CGCTGCTCGG CCCGGAGATG
AAGTCGACCG GCGAGGTGAT GGGCATCGAC CACGACTTCG GCAGCGCGTT CGCCAAGAGT
CAGACCGCGG CCTACGGTTC GCTGCCGTCG GAGGGCACCG TGTTCGTGTC GGTGGCCAAC
CGCGACAAGC GGTCGCTGGT CTTCCCGGTC AAACGGCTCG CCGACCTCGG GTTCAAGGTG
CTCGCCACCG AGGGCACCGC GGAGATGCTG CGCCGCAACG GGATTCCGTG CGACGAGGTG
CGCAAGCATT TCGAGGAACC GGGTGCTGGC AGGCCCGCGC GTTCGGCGGT CGAGGCCATC
CGCGCCGGTG ATGTCGCGAT GGTGATCAAC ACGCCCTACG GCAACTCCGG TCCGCGCATC
GACGGGTACG AGATCCGGTC GGCCGCGGTG TCGATGAACA TCCCGTGCAT CACCACCGTG
CAGGGCGCGT CGGCGGCCGT GCAGGGCATC GAGGCCAGCC TGCGCGGCGA CATCGGGGTG
ATGAGCCTGC AGGAGTTGCA CAGCGAGCTG GGAAACTGA
 
Protein sequence
MPRRTDLRHV LVIGSGPILI GQAAEFDYSG TQACRVLRAE GLTVTLINSN PATIMTDPEY 
ADYTYVEPIT PDFVERVIAQ QAERGNKIDA LLATLGGQTA LNTAVALSEN GVLERYDVEL
IGADFDAIQR GEDRQRFKDI VTKVGGESAK SRVCFTMEEV RETVGELGLP VVVRPSFTMG
GLGSGMAYSA EDVERMAGHG LASSPSANVL IEESIFGWKE YELELMRDRH DNVVVVCSIE
NFDPMGVHTG DSVTVAPAMT LTDREYQTMR DLGIAILREV GVATGGCNIQ FAVNPKDGRL
IVIEMNPRVS RSSALASKAT GFPIAKIAAK LAIGYTLDEI LNDITKETPA CFEPTLDYVV
VKAPRFAFEK FPGADATLTT TMKSVGEAMS LGRNFIEALG KVMRSLETGR AGFWTAPDPI
ATVDEVLENL RTPTDGRLYD IEFALRLGAS VEQVAEASGV DPWFVDQIAG LVALRTELLD
APVLDGTLLR RAKNSGLSDR QIAALRPELA GEVGVRALRQ RLGIHPVFKT VDTCAAEFEA
KTPYHYSSYE LDPAAESEVA PQAERPKVLI LGSGPNRIGQ GIEFDYSCVH AATTLSEAGF
ETVMINCNPE TVSTDYDTAD RLYFEPLTFE DVLEIYYAES ASGAGGPGVA GVIVQLGGQT
PLGLAERLEQ AGVPIVGTSP KAIDLAEDRG AFGEVLRTAG LPAPRFGLAT TFDQARRIAA
DIGYPVLVRP SYVLGGRGME IVYDEQTLEG YITRATQLSP EHPVLVDRFL EDAIEIDVDA
LCDGTEVYIG GIMEHIEEAG IHSGDSACAL PPVTLGRSDI ESVRRATEAI AHGVGVVGLL
NVQYALKDDV LYVLEANPRA SRTVPFVSKA TAVPLAKACA RIMLGASIAQ LREEGVLAAT
GDGATTARNA PVAVKEAVLP FHRFRKADGA QIDSLLGPEM KSTGEVMGID HDFGSAFAKS
QTAAYGSLPS EGTVFVSVAN RDKRSLVFPV KRLADLGFKV LATEGTAEML RRNGIPCDEV
RKHFEEPGAG RPARSAVEAI RAGDVAMVIN TPYGNSGPRI DGYEIRSAAV SMNIPCITTV
QGASAAVQGI EASLRGDIGV MSLQELHSEL GN