Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2308 |
Symbol | |
ID | 8137648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2685068 |
End bp | 2688316 |
Gene Length | 3249 bp |
Protein Length | 1082 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869922 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003022114 |
Protein GI | 253700925 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAAC GCACAGACAT CAAGAAGATC CTCATTATCG GCGCGGGCCC GATCGTCATC GGCCAGGCGT GCGAGTTCGA CTACTCCGGT ACCCAGGCCT GCAAGGCGCT CAAGGAAGAG GGGTTCGAGG TGGTGCTCCT GAACTCCAAC CCGGCTACCA TCATGACGGA CCCTGATTTC GCCGACTTCA CTTATATCGA ACCGGTCACG CCCGAGATCC TCGCGGCGAT CATCGAGAAA GAGCGCCCTG ACGCGCTGCT ACCGACCCTG GGGGGGCAGA CGGCGCTGAA CACGGCCGTG GCGGTAGCGG AAAACGGCAC TCTGGAGAAG TTCGGCGTGG AGCTGATCGG CGCGAAGCTG CCGGCCATCA AAAAGGCCGA GGACCGCACC CTGTTCAAGG AAGCGATGGT CAAGATCGGC CTCGACGTCC CGAGATCGGG TCTCGCCCAC AACTATCAGG AGGCGATGGA GGTCATCAAG GTCGTCGGCT TCCCTGCCAT CATCCGTCCC TCATTCACCC TAGGCGGCAC CGGCGGCGGC ATCGCCTACA ACATGGAAGA GTACGAGCGT ATGTCCATGG CCGGCATCGA GGCGTCGCCC ACCGACGAGA TCCTGGTCGA GGAGTCGCTG ATCGGCTGGA AGGAGTACGA GCTGGAGGTG ATGAGGGATA CCGCCGACAA CGTGGTCATC ATCTGCTCCA TCGAAAACTT CGACGCCATG GGCGTGCACA CCGGCGACTC CATCACGGTT GCGCCCGCCC AGACCTTGAC CGACAAGGAA TACCAGATCC TGCGCGACGC CTCGCTGAAG ATCATCCGCG AGATCGGCGT CGACACCGGC GGCTCCAACA TCCAGTTCGG CACCAACCCG AAAAACGGCC GCCTCATCGT CATCGAGATG AACCCGCGCG TCTCCCGCTC CTCGGCGCTC GCCTCGAAGG CCACCGGCTT CCCGATCGCG AAGATCGCCG CCAAGCTTGC CGTCGGCTAC ACGCTGGACG AGATCACCAA CGACATCACC AAGGAGACGC CGGCCTGCTT CGAGCCGACC ATCGACTACG TGGTCACCAA GATCCCGCGC TTCACCTTCG AGAAGTTCCC GGCCGCCGAC GCCACCCTCA CCACCCAGAT GAAATCGGTG GGCGAGGTGA TGTCCATCGG CCGCACCTTC AAGGAGTCCT TCCAGAAGGC GCTCCGCTCG CTGGAGATCG GCTCCTGCGG CTTCGAGTCC AAATTTTTCG GCGTAGGAGG CGACACCCGC CGCGCACTCA CCGAAAAAGA GAGGAACCTC TTAAACGACA AGCTGAGGAC CCCCAACTGC GACCGCCTCT GGTACGTCGG CGACGCCTTC CGCTGCGGCA TGACCGTGGA AGAGATCTAC GCCCTCACCG CCATCGACCC CTGGTTCCTG AACAACATCC GCCAGATCAT CGAGATGGAG GAGGAACTGA AGCCGGTAAA TATCAAGAAG GAATCAGGCG AGAAGCTGCA CGACATCCTC TGGGACGCTA AACGCTACGG CTTCTCCGAC AAATACCTCG GGCAACTCTG GAAAATCCCG GAAGCCGAGG TGCGCGAGTT GCGCCTGTCC GTCGGGGTCA AGCCGGTCTT TAAAAGGGTG GATACCTGCG CCGCCGAGTT CGTGGCGCAC ACCCCGTACC TCTACTCCAC TTACGAGGAG GAGTGCGAGG CGGAGCCGAC CGACAGGAAG AAAATCATCA TCCTCGGTGG CGGACCCAAC CGCATCGGCC AGGGGATCGA GTTCGACTAC TGCTGCGTGC ACGGTGTTTT CGCCCTCTCC GAGGACGGCT ACGAGACCAT CATGGTCAAC TGCAACCCGG AGACCGTTTC CACCGACTAC GACACCTCGG ACCGCCTCTA CTTCGAGCCG CTCACCTTCG AGGACGTGCT GCACATCGTG GACGTCGAGA AGCCGACCGG CGTCATCGTG CAGTTCGGCG GCCAGACCCC GCTGAAACTC GCCGTGGCGC TTGAGAAGGC GGGCGTTCCC ATCATCGGCA CCTCGCCCGA CGCCATCGAC CGCGCCGAGG ACCGCGAGCG CTTCCAGGAG ATGCTGCAAA AGCTCAAGCT CAGGCAGCCT GAAAACGGCA CCGCCCGCTC CTTCGAGGAG TCCGAGGTGG TCGCCGAGCG TATCGGCTAC CCGGTGGTGG TGCGCCCCTC CTACGTCCTT GGCGGGCGCG CCATGGAGAT CGTCTACGAC GTGGACAACC TGCGCCGCTA CATGCACACC GCGGTTCAGG CCTCCCCGGA GCACCCGATC CTGATCGACA AGTTCCTGGA CGAGGCGATC GAGATCGACG TCGACGCCCT TTGCGACGGC CAAGTCGCCG TCATCGGCGG CATCATGGAG CACATCGAGG AGGCGGGTAT CCACTCCGGC GACTCGGCCT GCTCGCTGCC GCCTTACTCC ATCTCCAAGG AGATCGTCGA GGAGATCAGG CGCCAGACCA AGATGATGGC GCTGGAGTTG AACGTGAAGG GGCTCATGAA CGTGCAGTTC GCCGTCAAGG GGAACGACAT CTACATCATC GAGGTCAACC CCCGCGCCTC GCGCACCTCC CCCTTCGTCT CCAAGGCGAC CGGAAGGCCC CTGGCGAAGA TCGCCGCGCG CGTCATGGCG GGCAAGACCC TGGCCGAGCT CGGGGTTACC GAGGAGATCG TCCCGGTCCA CATCTCGGTC AAGGAATCGG TCTTCCCCTT CGCCAAGTTC CCCGGCGTCG ACACCATCCT GGGGCCGGAG ATGAAGTCGA CCGGCGAGGT CATGGGGATC GGCGACACCT TCGCCAAGGC GTACGCCAAG GCCCAGATGG GGGCCAACGT GAAGCTCCCG GCCTCGGGTA AAGTGTTCAT TTCAGTGAAG GACACGGACA AAAAACATAT TGTCAGCGCT GCAAAAAGAC TGTATGATCA GGGCTTCGAA TTGGTTGCTA CGCGCGGCAC GGCGAGCTAT CTGCAGGAAA AAGGGATCCC GGTTCAGGTA GTAAACAAGG TAATCGAAGG ACGCCCCCAC ATAGTCGATG CGATCAAGAA CAACGAGATC TGCATGGTCA TCAACACCAC CCACGGTGCA CAGGCCGTTG CCGATTCCTA CTCGATCCGC AGGAACACCC TGATCAACAA CGTCGCTTAC TACACCACAG CCTCCGGCGC GAGAGCGGCC GTAGACGGTA TCATAGCGAT GTCAAAGTCG AAGCTGGAGG TCAACTCGAT CCAGCACTAC CTGAAGTAA
|
Protein sequence | MPKRTDIKKI LIIGAGPIVI GQACEFDYSG TQACKALKEE GFEVVLLNSN PATIMTDPDF ADFTYIEPVT PEILAAIIEK ERPDALLPTL GGQTALNTAV AVAENGTLEK FGVELIGAKL PAIKKAEDRT LFKEAMVKIG LDVPRSGLAH NYQEAMEVIK VVGFPAIIRP SFTLGGTGGG IAYNMEEYER MSMAGIEASP TDEILVEESL IGWKEYELEV MRDTADNVVI ICSIENFDAM GVHTGDSITV APAQTLTDKE YQILRDASLK IIREIGVDTG GSNIQFGTNP KNGRLIVIEM NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDEITNDIT KETPACFEPT IDYVVTKIPR FTFEKFPAAD ATLTTQMKSV GEVMSIGRTF KESFQKALRS LEIGSCGFES KFFGVGGDTR RALTEKERNL LNDKLRTPNC DRLWYVGDAF RCGMTVEEIY ALTAIDPWFL NNIRQIIEME EELKPVNIKK ESGEKLHDIL WDAKRYGFSD KYLGQLWKIP EAEVRELRLS VGVKPVFKRV DTCAAEFVAH TPYLYSTYEE ECEAEPTDRK KIIILGGGPN RIGQGIEFDY CCVHGVFALS EDGYETIMVN CNPETVSTDY DTSDRLYFEP LTFEDVLHIV DVEKPTGVIV QFGGQTPLKL AVALEKAGVP IIGTSPDAID RAEDRERFQE MLQKLKLRQP ENGTARSFEE SEVVAERIGY PVVVRPSYVL GGRAMEIVYD VDNLRRYMHT AVQASPEHPI LIDKFLDEAI EIDVDALCDG QVAVIGGIME HIEEAGIHSG DSACSLPPYS ISKEIVEEIR RQTKMMALEL NVKGLMNVQF AVKGNDIYII EVNPRASRTS PFVSKATGRP LAKIAARVMA GKTLAELGVT EEIVPVHISV KESVFPFAKF PGVDTILGPE MKSTGEVMGI GDTFAKAYAK AQMGANVKLP ASGKVFISVK DTDKKHIVSA AKRLYDQGFE LVATRGTASY LQEKGIPVQV VNKVIEGRPH IVDAIKNNEI CMVINTTHGA QAVADSYSIR RNTLINNVAY YTTASGARAA VDGIIAMSKS KLEVNSIQHY LK
|
| |