Gene Mflv_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3743 
SymbolcarB 
ID4975059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3992199 
End bp3995537 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content68% 
IMG OID640457967 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001135003 
Protein GI145224325 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0558638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGTC GCTCAGACCT CAACCATGTG CTGGTGATCG GATCCGGACC GATCCTGATC 
GGGCAGGCCG CCGAGTTCGA CTACTCCGGC ACCCAGGCCT GCCGGGTGCT GCGGGCCGAG
GGCCTGCAGG TCACCCTGAT CAACTCCAAT CCGGCCACGA TCATGACCGA CCCGGAATAC
GCCGACCACA CCTACGTCGA GCCGATCACC GCGGACTTCG TCGAGAAGGT CATCGCCCAG
CAGGCCGAGC GCGGCAACAA GATCGACGCG CTGCTGCCGA CCCTGGGCGG GCAGACCGCG
CTCAACACCG CGGTGAAGCT GTACGAGAAC GGTGCGCTGG AGCGCTACGA CGTCGAGCTG
ATCGGCGCCA ACTTCGACGC GATCCAGCGC GGCGAGGATC GGCAGAAGTT CAAGGACATC
GTCACCAAGG TGGGCGGCGA GTCCGCGAAG TCCAGGGTGT GTTTCACCAT GGACGAGGTG
CGCGACACGG TCGCCGAACT CGGGCTGCCC GTGGTGGTCC GGCCGAGCTT CACCATGGGC
GGACTGGGCT CCGGGATGGC GTACTCGGCC GACGATGTGG AGCGCATGGC GGGGGAGGGC
CTCGCGGCGT CCCCGTCGGC GAACGTGCTG ATCGAGGAAT CCATCTACGG ATGGAAGGAG
TACGAGCTCG AGCTGATGCG CGACGGCCGC GACAACGTGG TGGTGGTCTG CTCGATCGAG
AACTTCGATC CGATGGGCGT GCACACCGGC GACTCGGTCA CCGTCGCGCC GGCGATGACA
CTCACCGACC GCGAGTACCA GAAGATGCGC ACCCTGGGCA TCGAGATCCT GCGTGAGGTC
GGCGTCGACA CCGGCGGCTG CAACATCCAG TTCGCCGTCA ACCCGAAGGA CGGCCGGCTC
ATCGTCATCG AGATGAACCC CCGGGTGTCG CGGTCCTCGG CGCTGGCGTC GAAGGCCACC
GGGTTCCCGA TCGCCAAGAT CGCGGCCAAG CTCGCGATCG GTTACACGCT CGACGAGATC
GTCAACGACA TCACCAAGGA AACCCCGGCG TGCTTCGAGC CGACGCTGGA CTACGTCGTG
GTCAAGGCGC CGCGGTTCGC GTTCGAGAAG TTCCCCGGCG CCGACGCGAC GCTGACCACC
ACCATGAAGT CGGTCGGCGA GGCGATGTCG TTGGGCCGCA ACTTCATCGA GGCGCTCGGC
AAGGTGATGC GCTCGCTGGA GACCGGCCGG GCGGGCTTCT GGACGGGGGA GGACCCCGTC
GGTGAGCTCG GCGAGGTGCT CGCGCGGCTG CGCACACCCA CCGACGGCCG GCTCTACGAC
ATCGAATACG CGCTGCGTAT CGGCGCGACC GTGGAAGAGG TCGCCGAGGC CTCCGGCGTC
GACCCGTGGT TCGTCGACCA GATCGGCGGC CTGGTCGAAC TGCGTGCCGA GCTGACCGAC
GCCCCCGTGC TCGGCGAGGA ACTGCTCCGC CGCAGCAAGC ACCACGGGCT CTCCGACCGC
CAGATCGCCG CGCTGCGACC CGAACTCGCC GGCGAGATGG GCGTACGGGC GCTGCGTCAG
CGGCTGGGGA TCCACCCGGT GTTCAAGACC GTCGACACCT GCGCGGCCGA GTTCGAGGCC
AAGACTCCGT ACCACTACAG CAGCTACGAG ATGGATCCCG CCGCGGAGAC CGAGGTCGCC
CCGCAGACCG AGCGGGGCAA GGTGCTGATC CTCGGGTCGG GCCCCAACCG GATCGGGCAG
GGCATCGAAT TCGACTACAG CTGTGTGCAC GCCGCGACCA CGCTCAGCGA GGCCGGCTTC
GAGACCGTGA TGATCAACTG CAACCCCGAG ACGGTGTCGA CCGACTACGA CACCGCCGAC
CGGCTGTACT TCGAACCGCT GACGTTCGAG GACGTGCTGG AGATCTACTA CGCCGAGCAG
AGATCGGGCG AGGGCGGCCC GGGCGTGATC GGGGTGATCG TGCAACTCGG CGGTCAGACG
CCGCTCGGAC TGGCCGAACG GCTGGAGAAA GCCGGGGTGC CGATCGTCGG CACCAAACCC
GAGGCGATCG ACCTGGCCGA GGACCGCGGC GAGTTCGGCG AGGTGCTGCG CCGCGCCGGA
CTGCCCGCGC CCCGGTTCGG GATGGCGACC AGCTTCGACC AGGCCCGCCG CATCGCCGCC
GAGATCGGCT ACCCGGTGCT GGTGCGGCCG TCTTATGTGC TGGGCGGGCG CGGCATGGAG
ATCGTCTACG ACGAGGACAC CCTCGAGGGC TACATCACCC GGGCAACCCA ACTCTCGCCC
GAGCACCCGG TGCTCGTGGA CCGCTTCCTC GAAGACGCGA TCGAGATCGA CGTCGACGCG
CTGTGCGACG GCACCGAGGT CTACATCGGC GGCGTGATGG AGCACATCGA GGAGGCCGGC
ATCCACTCCG GTGACTCGGC GTGCGCGCTG CCCCCTGTGA CGCTGGGCCG CAGCGACATC
GAGGCGGTGC GGCGCGCGAC CGAGGCGATC GCGTTCGGGG TCGGCGTGGT CGGCCTGCTC
AATGTGCAGT ACGCGCTGAA GGACGACGTC CTCTATGTCC TGGAGGCCAA TCCGCGCGCA
TCGCGCACCG TCCCCTTCGT CTCCAAGGCG ACCGCGGTAC CGCTGGCCAA GGCGTGCGCG
CGGATCATGC TGGGCGCCAG CATCGCCGAG CTCCGCGAGG AGGGCGTGCT GGCCAGGACC
GGTGACGGTG CGGCGACCGC GCGCAACGCG CCCGTGGCCG TGAAGGAAGC CGTCCTTCCC
TTCCACCGGT TCCGCAAGGC GGACGGCGCG CAGATCGACT CGCTGCTCGG GCCGGAGATG
AAGTCCACCG GCGAGGTGAT GGGCATCGCC CACGATTTCG GCAGCGCGTT CGCCAAGAGC
CAGACCGCCG CCTACGGCTC GCTGCCCGCC AGCGGGACCG TGTTCGTCTC GGTCGCCAAC
CGCGACAAGC GGTCCCTGGT GTTTCCGGTC AAGCGGCTCG CCGACCTCGG GTTCAAGATC
CTGGCCACCG AAGGCACCGC GGAGATGCTG CGGCGCAACG GAATCCCGTG TGAAGAAGTG
CGCAAGCACT TTGAAGAACC CAGTGCGGAC CGCCCACTGC GCTCTGCGGT CGAGGCGATC
AAGGCCGGCG ACGTCGACAT GGTGCTCAAC ACCCCGTACG GCAATTCGGG GCCGCGCATC
GACGGCTATG AGATCCGGTC GGCCGCGGTG TCGATGAACA TTCCGTGCGT GACCACCGTG
CAGGGCGCGT CGGCTGCGGT GCAGGGCATC GAGGCGGGGA TCCGCGGTGA CATCGGCGTG
ATGTCGCTGC AGGAACTGCA TTCCACGCTG GTCTCGTGA
 
Protein sequence
MPRRSDLNHV LVIGSGPILI GQAAEFDYSG TQACRVLRAE GLQVTLINSN PATIMTDPEY 
ADHTYVEPIT ADFVEKVIAQ QAERGNKIDA LLPTLGGQTA LNTAVKLYEN GALERYDVEL
IGANFDAIQR GEDRQKFKDI VTKVGGESAK SRVCFTMDEV RDTVAELGLP VVVRPSFTMG
GLGSGMAYSA DDVERMAGEG LAASPSANVL IEESIYGWKE YELELMRDGR DNVVVVCSIE
NFDPMGVHTG DSVTVAPAMT LTDREYQKMR TLGIEILREV GVDTGGCNIQ FAVNPKDGRL
IVIEMNPRVS RSSALASKAT GFPIAKIAAK LAIGYTLDEI VNDITKETPA CFEPTLDYVV
VKAPRFAFEK FPGADATLTT TMKSVGEAMS LGRNFIEALG KVMRSLETGR AGFWTGEDPV
GELGEVLARL RTPTDGRLYD IEYALRIGAT VEEVAEASGV DPWFVDQIGG LVELRAELTD
APVLGEELLR RSKHHGLSDR QIAALRPELA GEMGVRALRQ RLGIHPVFKT VDTCAAEFEA
KTPYHYSSYE MDPAAETEVA PQTERGKVLI LGSGPNRIGQ GIEFDYSCVH AATTLSEAGF
ETVMINCNPE TVSTDYDTAD RLYFEPLTFE DVLEIYYAEQ RSGEGGPGVI GVIVQLGGQT
PLGLAERLEK AGVPIVGTKP EAIDLAEDRG EFGEVLRRAG LPAPRFGMAT SFDQARRIAA
EIGYPVLVRP SYVLGGRGME IVYDEDTLEG YITRATQLSP EHPVLVDRFL EDAIEIDVDA
LCDGTEVYIG GVMEHIEEAG IHSGDSACAL PPVTLGRSDI EAVRRATEAI AFGVGVVGLL
NVQYALKDDV LYVLEANPRA SRTVPFVSKA TAVPLAKACA RIMLGASIAE LREEGVLART
GDGAATARNA PVAVKEAVLP FHRFRKADGA QIDSLLGPEM KSTGEVMGIA HDFGSAFAKS
QTAAYGSLPA SGTVFVSVAN RDKRSLVFPV KRLADLGFKI LATEGTAEML RRNGIPCEEV
RKHFEEPSAD RPLRSAVEAI KAGDVDMVLN TPYGNSGPRI DGYEIRSAAV SMNIPCVTTV
QGASAAVQGI EAGIRGDIGV MSLQELHSTL VS