Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_0661 |
Symbol | carB |
ID | 3738310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | - |
Start bp | 721123 |
End bp | 724323 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637777939 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_383628 |
Protein GI | 78221881 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00149325 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.138255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGGC GCGACGACGT CAAAAAGGTT CTCATCATCG GTTCCGGCCC CATCATCATC GGCCAGGCTT GCGAGTTCGA CTACTCCGGC ACCCAGGCCT GCAAGGCCCT GCGCAAGCTC GGCTACCGGA TCGTGCTGGT GAACTCCAAC CCCGCCACCA TCATGACCGA CCCCGGCATG GCCGACGCCA CCTATATCGA ACCCCTCAAT GTGGACACCC TGACCGAGAT CATCCGCAAG GAGCGCCCCG ACGCCCTCCT CCCCAACCTG GGTGGCCAGA CCGGTCTCAA CCTCTCGTCG GCCCTGGCCC AGGCGGGGGT GCTGGACCGG TACGGGGTCA GGGTCATCGG CGTCAACCTC GACGCCATCA AGCGGGGGGA AGACCGTGAA ACTTTCAAGG AGACCATGAC GCGGTTGGGC ATCGAGACCG CACGGAGCGA GATTGCCACA ACCATGGAAG GGGCCCTGGA CGTGGTGTCC CGCATCGGGC TGCCGGTGGT GATCCGCCCC GCCTACACCA TGGGGGGGAC CGGCGGCGGC TTCGCCTACA ACATGGAGGA GTTCCGGACC ATCGTTGCTC GGGGACTCGC CGCCAGCCCG GTGAGCCAGA TCCTCGTGGA GGAATCGGTG CTGGGGTGGG AGGAGCTGGA GCTGGAGGTG GTGCGGGACG CCAAAAACCA GAAGATCACC GTCTGTTTCA TCGAGAACGT GGACGCCATG GGGGTTCACA CTGGCGACTC CTACTGCACG GCCCCAATGC TCACCATCTC TTCTGAACTC CAGGCCCGGC TCCAGGACTA CGCCTACCGG ATCGTCGACG CCATCGAGGT GATCGGCGGC ACCAACGTCC AGTTCGCCCA CGACCCCAAA ACCGGGCGGG TGGTGGTCAT CGAGATCAAC CCCCGCACCT CCCGCTCCTC GGCCCTGGCC TCCAAGGCCA CCGGGTTCCC CATTGCCATG GTGTCATCGC TCCTGGCCGC GGGGCTTACC CTGGACGAAA TCCCCTACTG GCGGGACGGC TCCCTGGAAA AGTACACCCC GAGCGGCGAC TACGTGGTGG TGAAGTTCGC CCGCTGGGCC TTCGAGAAGT TCAAGGGGGT CGAGGACAAG CTCGGCACCC AGATGCGGGC CGTGGGCGAA GTCATGAGCA TCGGCAAGAA CTACAAGGAG GCGCTCCAGA AGGCGATCCG CTCACTGGAG ATCGGCCGCC ACGGCCTCGG CTTTGCCCGG GACTTCAACG CCACATCGCT CCCCAAGCTC CTGGAGATGC TGGCCGAGCC ATCCAGTGAG CGTCAGTTCA TCCTCTACGA GGCCATCCGC AAGGGGGCCG ACCTGGACCA GCTCCACCAG CTGACCCATA TCAAGCTCTG GTTCCTGCAG CAGATGAAGG AGCTGGTGGA ACTGGAGGAG GAGATCCTCC GGCACCGGGG GAGCCTCCCC CCGGACGAAC TGCTCCTCCA GGCCAAGCGG GACGGCTTTG CGGACCGCTA CCTAGCGAAG CTTTTGGAGA TCCCGGAAAC CCAAGTCCGC GAAAAGCGCC TCGCCCTGGG GCTTACCGAG GCGTGGGAAG CGGTGCCGGT CAGCGGCGTG GAAAACGCCG CCTACTACTT CTCCACCTAC AACGCGCCGG ACACGGTGCC GGTCAGCGAC CGGAAGAAGA TCATGGTGCT GGGGGGCGGC CCCAACCGGA TCGGCCAGGG GATCGAGTTC GACTACTGCT GCGTCCACAC CGCCTTAGCC CTGCGGGAGG CCGGTTACGA GACCATCATG GTCAACTGCA ACCCGGAGAC GGTCTCCACC GACTACGACA CCTCGGACAA GCTCTACTTC GAGCCCCTCA CCGTGGAGGA TGTCCTCTCC ATCCATGCCA AGGAGAAGCC CGAAGGGGTG GTGGTCCAGT TCGGGGGGCA GACCCCCCTC AACATCGCCG TCGAACTGGA GGCGGCCGGA GTGCGAATCA TCGGCACCAC GCCGGAGACC ATCGACCTGG CCGAGGACCG GCAGCGCTTC GCCAGGGTGA TGACGGAGCT GGGGATACCC CAGCCCGAAT CGGGGATGGC AAGCACCCTG GATGAGGCCC TGGCCGTTGC GGGCCGGATC GACTACCCCC TCATGGTGCG CCCCTCCTAC GTACTGGGGG GCAGAGCCAT GGAGGTGGTC CACGACGAGG AGATGCTCCG GGAGTACGTC ACCAAAGCCG TGGATGTCTC CCCGGAGCGA CCGATCCTCA TCGACCGGTT CCTGGAGAAC GCCATCGAGG CCGAGGCAGA CGCCATCAGC GACGGGACCG ACGCCTTCGT GCCGGCGGTC ATGGAGCATA TCGAACTGGC TGGAGTCCAC TCGGGGGACT CGGCCTGCGT CATCCCCCCG GTCAGCATTC CCCAAAAACA CATCGATACG ATTGATGAGT ACACCCGCAA GATCGCTGTG GCCATGGGGG TGGTGGGGCT CATGAACATC CAGTACGCCA TCGCTGACAA CACCGTCTAC ATCCTGGAGG CGAACCCCCG GGCAAGCCGC ACCGTCCCCC TGGTCTCCAA GGTCTGCAAC ATCCCCATGC CCCGCATCGC CGTGCAGGTC ATGCTCGGGG CGAAGCTGAA GGATATGGGG CTCGCCCGCC GCACCTTCCC CCACTTCGGG GTCAAGGAGG CGGTCTTCCC CTTCCCCATG TTCCCGGAGG TGGACCCGGT GCTCGGCCCG GAGATGCGCT CCACCGGCGA GGTGCTGGGG CTGGCGGGCA ATTACGGCCT CGCCTTCTAC AAGGCACAGG AAGGGGCCAA CGCCCAACTC CCCCTCTCCG GTTCGGTCCT CTTCACCATC GCCGACCGGG ACAAGGAGGG TGCCCTGGCG GCGGCCCGCC GCTTCGCGGA ACTGGGCTTC ACCATCCGGG CCACGGAAGG AACCTGCCGG TTCCTGGCCG GCCACGGCAT CGCCGCGACG CCGGTCACCA AGCTTCACGA GGGGCGCCCC AACCTGGTGG ACGCCATCAA GAACCGGGAG ATCCATCTGG TAGTCAACAC TCCTGCCGGC AAGCAGAGCG CCCATGACGA CTCCTACATA CGCAAGGCCG CCATCGCCAA CAAGATCCCC CACATCACCA CCGTGGCCGC CGCCGTAGCC GCCGCCGAGG GGATCGCGGC CCGCCGCAAC GGCAAGGAAC CGGTCATGAG CCTGCAGGAG TACCACGCAG GCATCCGCTG A
|
Protein sequence | MPRRDDVKKV LIIGSGPIII GQACEFDYSG TQACKALRKL GYRIVLVNSN PATIMTDPGM ADATYIEPLN VDTLTEIIRK ERPDALLPNL GGQTGLNLSS ALAQAGVLDR YGVRVIGVNL DAIKRGEDRE TFKETMTRLG IETARSEIAT TMEGALDVVS RIGLPVVIRP AYTMGGTGGG FAYNMEEFRT IVARGLAASP VSQILVEESV LGWEELELEV VRDAKNQKIT VCFIENVDAM GVHTGDSYCT APMLTISSEL QARLQDYAYR IVDAIEVIGG TNVQFAHDPK TGRVVVIEIN PRTSRSSALA SKATGFPIAM VSSLLAAGLT LDEIPYWRDG SLEKYTPSGD YVVVKFARWA FEKFKGVEDK LGTQMRAVGE VMSIGKNYKE ALQKAIRSLE IGRHGLGFAR DFNATSLPKL LEMLAEPSSE RQFILYEAIR KGADLDQLHQ LTHIKLWFLQ QMKELVELEE EILRHRGSLP PDELLLQAKR DGFADRYLAK LLEIPETQVR EKRLALGLTE AWEAVPVSGV ENAAYYFSTY NAPDTVPVSD RKKIMVLGGG PNRIGQGIEF DYCCVHTALA LREAGYETIM VNCNPETVST DYDTSDKLYF EPLTVEDVLS IHAKEKPEGV VVQFGGQTPL NIAVELEAAG VRIIGTTPET IDLAEDRQRF ARVMTELGIP QPESGMASTL DEALAVAGRI DYPLMVRPSY VLGGRAMEVV HDEEMLREYV TKAVDVSPER PILIDRFLEN AIEAEADAIS DGTDAFVPAV MEHIELAGVH SGDSACVIPP VSIPQKHIDT IDEYTRKIAV AMGVVGLMNI QYAIADNTVY ILEANPRASR TVPLVSKVCN IPMPRIAVQV MLGAKLKDMG LARRTFPHFG VKEAVFPFPM FPEVDPVLGP EMRSTGEVLG LAGNYGLAFY KAQEGANAQL PLSGSVLFTI ADRDKEGALA AARRFAELGF TIRATEGTCR FLAGHGIAAT PVTKLHEGRP NLVDAIKNRE IHLVVNTPAG KQSAHDDSYI RKAAIANKIP HITTVAAAVA AAEGIAARRN GKEPVMSLQE YHAGIR
|
| |