Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1732 |
Symbol | carB |
ID | 7085698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1951572 |
End bp | 1954802 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698751 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002355381 |
Protein GI | 217970147 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.215987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAC GTACAGACAT CAAGACCATC CTGATCATCG GCGCCGGCCC GATCATCATC GGCCAGGCCT GCGAGTTCGA CTATTCCGGC GCCCAGGCCT GCAAGGCCCT GCGCGAGGAA GGCTACAAGG TCATCCTGGT CAACTCGAAC CCGGCGACCA TCATGACCGA CCCGGAGACG GCCGACGTCA CCTACATCGA ACCGATCACC TGGCAGGTGG TCGAGCGCAT CATCGACAAG GAGCGTCCCG ACGCGATCCT GCCGACCATG GGCGGCCAGA CCGCGCTCAA CTGCGCGCTC GACCTCGGTC GCCACGGCGT GCTCGCCAAG TACGGCGTCG AGCTCATCGG CGCTTCGGAA GAGGCGATCG ACAAGGCGGA GGACCGCCTC AAGTTCAAGG ACGCGATGAC CAGGATCGGT CTCGGCTCCG CGCGCTCGGG CATCGCCCAC AGCATGGAAG AGGCGCTCCA GGTCCAGGGT GGCATCGGCT TCCCGGTGAT CATCCGCCCG AGCTTCACGC TCGGCGGCAC CGGCGGCGGC ATCGCCTACA ACATGGAAGA GTTCCAGGAG ATCTGCAAGC GCGGCCTCGA GGCGAGTCCG ACCAACGAGC TGCTGATCGA GGAGTCGCTG CTGGGCTGGA AGGAATACGA GATGGAGGTC GTGCGCGATC GCGCCGACAA CTGCATCATC GTGTGCTCGA TCGAGAACCT CGACCCGATG GGCGTGCACA CCGGCGACTC GATCACCGTC GCGCCGGCGC AGACGCTCAC CGACAAGGAA TACCAGATCA TGCGCAACGC CTCGATCGCG GTGCTGCGCG AGATCGGGGT GGACACGGGC GGCTCGAACG TGCAGTTCGC CATCAATCCC CAGGACGGTC GCATGATCGT CATCGAGATG AACCCGCGCG TGTCGCGTTC GTCGGCGCTG GCCTCCAAGG CCACCGGTTT CCCGATCGCA AAGGTCGCGG CCAAGCTGGC GGTGGGCTAC ACCCTCGACG AGCTGCGCAA CGACATCACC GGTGGCGCCA CCCCGGCGTC CTTCGAGCCT TCGATCGACT ACGTCGTCAC CAAGATCCCG CGCTTCGCGT TCGAGAAATT CCCCCAGGCC AACGACCGCC TGACCACCCA GATGAAATCG GTGGGCGAGG TCATGGCGAT GGGCCGCAGC TTCCAGGAAT CCTTCCAGAA GGCGCTGCGT GGGCTGGAGG TCGGCGTGTA TGGCCTGGAC GAGGTCGAGG CCGACCGCGA GGATCTCGAG CACGAGATCG CCAGCCCGGG CGCGCAGCGC ATCTGGTACG TGGGCCAGGC TTTCCGCGAG GGTATGAGCC TCGAGCAGGT TTTCAACCTG TCCAAGATCG ACCCCTGGTT CCTCGCCCAG ATCGAGGACA TCGTGCTCTC CGAGAAGGCG CTCGCCGGGC GCTCGCTGAA GGCGCTGCAG GCCGCCGAGC TGCGCGAGCT CAAGAAGAAG GGCTTCTCCG ACCGCCGCCT GGCCAAGCTG CTCAATGCTG ACGAGACCGC GGTGCGCCTG CATCGGCATA CGCTGGGCGT GCGCCCGGTG TTCAAGCGCG TCGACACCTG CGCGGCCGAG TTCGCCACCT CCACCGCCTA CCTGTATTCG TCCTACGAGG ACGAGTGCGA GGCGATGCCG ACCGAGCGCA AGAAGATCAT GGTGCTCGGC GGTGGTCCCA ACCGCATCGG CCAGGGCATC GAGTTCGACT ACTGCTGCGT GCATGCGGCG CTCGCCCTGC GCGAGGATGG CTACGAGACC ATCATGGTCA ACTGCAACCC GGAGACGGTG TCCACCGACT ACGACACCTC GGACCGCCTG TACTTCGAGC CGATCACGCT CGAAGACCTG CTCGAGATCG TGCACATCGA GAAGCCGGTC GGCGTGATCG TGCAGTTCGG CGGCCAGACT CCGCTCAAGC GCGCGCAGGC GCTCGAAGCC AACGGCGTGC CGATCATCGG CACCAGCCCC GACATGATCG ACGCCGCCGA GGACCGCGAG CGTTTCCAGC AACTGCTCGT CGAGCTCGGT CTGCGTCAGC CGCCCAACCG CACCGCGCGG ACGCCCGCCG AAGCCGTGCG CCTCGCGGCC GAGATCGGCT ATCCGCTGGT GGTACGCCCG TCCTACGTGC TGGGTGGCCG CGCGATGGAG ATCGTGCACG AGCAGAAGGA CCTCGAGCGC TACATGCGCG AGGCGGTCAA GGTCTCGAAC GAGTCGCCGG TGCTGCTCGA CCGCTTCCTC AACGACGCCA CGGAGGTGGA CGTCGATGCG CTGTCCGACG GCAAGCAGGT GATGATCGGC GGCATCATGG AGCACATCGA GCAGGCCGGC GTGCATTCGG GCGACTCGGC CTGCTCGCTG CCACCCTATA CCCTGTCGGC GAGGCTGCAG GACGAGTTGC GCCGCCAGAC CGAGGCCATG GCGCGCGCGC TGAAGGTCGT CGGCCTGATG AACGTGCAGT TCGCGATCCA GGGCGAGGGC GACAATGCGG TCGTCTATGT GCTCGAGGTG AACCCGCGCG CCTCGCGCAC CGTGCCCTTC GTCTCCAAGG CCTGCTCGCT GCCGCTGGCC AAGATCGCCG CGCGCTGCAT GGCGGGTCAG AGCCTGGCCG AGCAGGGCGT CAGCGGTGAG ATCGTTCCGC CTTACTACTC GGTCAAGGAG GCGGTGTTCC CGTTCGTGAA GTTCCCCGGC GTGGATACCA TCCTCGGGCC CGAGATGAAG TCCACCGGCG AGGTCATGGG CGTGGGCCGC AGCTTCGCCG AGGCCTTCGT GAAGAGCCAG ATGGGCGCGG GCGTGCGCCT GCCAACCGCG GGCACGGTGT TCATCAGCGT GCGTCCGACC GACAAGCCGG TCGCGATCGA GGTCGCGCGC GAGCTGCACG AGATGGGCTT CGCGCTGGTG GCCACCCGCG GCACCGCAGC GGCGATCGAG GCCGCCGGGA TCCCGGTGGG CGTGGTCAAC AAGGTCAACG AGGGCCGTCC GCACATCGTC GACATCATCA AGAACGAGGA GATCGTGCTG GTGATCAACA CCGTCGACGA GAAGCGCCAG GCGATCGCTG ACTCGCGCTC GATCCGCACC AGCGCGCTCG CCGCAAAGGT GTCGATCTTC ACCACCATCG AGGGCGCGCG CGCCGCGTGC ATGGGCATGC GCCACCTTTC GAACGGTCTC GAGGTGTATT CCGTGCAGGG CCTGCACGCC GAACTGAAGC AGCAAGCATG A
|
Protein sequence | MPKRTDIKTI LIIGAGPIII GQACEFDYSG AQACKALREE GYKVILVNSN PATIMTDPET ADVTYIEPIT WQVVERIIDK ERPDAILPTM GGQTALNCAL DLGRHGVLAK YGVELIGASE EAIDKAEDRL KFKDAMTRIG LGSARSGIAH SMEEALQVQG GIGFPVIIRP SFTLGGTGGG IAYNMEEFQE ICKRGLEASP TNELLIEESL LGWKEYEMEV VRDRADNCII VCSIENLDPM GVHTGDSITV APAQTLTDKE YQIMRNASIA VLREIGVDTG GSNVQFAINP QDGRMIVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELRNDIT GGATPASFEP SIDYVVTKIP RFAFEKFPQA NDRLTTQMKS VGEVMAMGRS FQESFQKALR GLEVGVYGLD EVEADREDLE HEIASPGAQR IWYVGQAFRE GMSLEQVFNL SKIDPWFLAQ IEDIVLSEKA LAGRSLKALQ AAELRELKKK GFSDRRLAKL LNADETAVRL HRHTLGVRPV FKRVDTCAAE FATSTAYLYS SYEDECEAMP TERKKIMVLG GGPNRIGQGI EFDYCCVHAA LALREDGYET IMVNCNPETV STDYDTSDRL YFEPITLEDL LEIVHIEKPV GVIVQFGGQT PLKRAQALEA NGVPIIGTSP DMIDAAEDRE RFQQLLVELG LRQPPNRTAR TPAEAVRLAA EIGYPLVVRP SYVLGGRAME IVHEQKDLER YMREAVKVSN ESPVLLDRFL NDATEVDVDA LSDGKQVMIG GIMEHIEQAG VHSGDSACSL PPYTLSARLQ DELRRQTEAM ARALKVVGLM NVQFAIQGEG DNAVVYVLEV NPRASRTVPF VSKACSLPLA KIAARCMAGQ SLAEQGVSGE IVPPYYSVKE AVFPFVKFPG VDTILGPEMK STGEVMGVGR SFAEAFVKSQ MGAGVRLPTA GTVFISVRPT DKPVAIEVAR ELHEMGFALV ATRGTAAAIE AAGIPVGVVN KVNEGRPHIV DIIKNEEIVL VINTVDEKRQ AIADSRSIRT SALAAKVSIF TTIEGARAAC MGMRHLSNGL EVYSVQGLHA ELKQQA
|
| |