Gene GM21_2308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2308 
Symbol 
ID8137648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2685068 
End bp2688316 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content62% 
IMG OID644869922 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003022114 
Protein GI253700925 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAC GCACAGACAT CAAGAAGATC CTCATTATCG GCGCGGGCCC GATCGTCATC 
GGCCAGGCGT GCGAGTTCGA CTACTCCGGT ACCCAGGCCT GCAAGGCGCT CAAGGAAGAG
GGGTTCGAGG TGGTGCTCCT GAACTCCAAC CCGGCTACCA TCATGACGGA CCCTGATTTC
GCCGACTTCA CTTATATCGA ACCGGTCACG CCCGAGATCC TCGCGGCGAT CATCGAGAAA
GAGCGCCCTG ACGCGCTGCT ACCGACCCTG GGGGGGCAGA CGGCGCTGAA CACGGCCGTG
GCGGTAGCGG AAAACGGCAC TCTGGAGAAG TTCGGCGTGG AGCTGATCGG CGCGAAGCTG
CCGGCCATCA AAAAGGCCGA GGACCGCACC CTGTTCAAGG AAGCGATGGT CAAGATCGGC
CTCGACGTCC CGAGATCGGG TCTCGCCCAC AACTATCAGG AGGCGATGGA GGTCATCAAG
GTCGTCGGCT TCCCTGCCAT CATCCGTCCC TCATTCACCC TAGGCGGCAC CGGCGGCGGC
ATCGCCTACA ACATGGAAGA GTACGAGCGT ATGTCCATGG CCGGCATCGA GGCGTCGCCC
ACCGACGAGA TCCTGGTCGA GGAGTCGCTG ATCGGCTGGA AGGAGTACGA GCTGGAGGTG
ATGAGGGATA CCGCCGACAA CGTGGTCATC ATCTGCTCCA TCGAAAACTT CGACGCCATG
GGCGTGCACA CCGGCGACTC CATCACGGTT GCGCCCGCCC AGACCTTGAC CGACAAGGAA
TACCAGATCC TGCGCGACGC CTCGCTGAAG ATCATCCGCG AGATCGGCGT CGACACCGGC
GGCTCCAACA TCCAGTTCGG CACCAACCCG AAAAACGGCC GCCTCATCGT CATCGAGATG
AACCCGCGCG TCTCCCGCTC CTCGGCGCTC GCCTCGAAGG CCACCGGCTT CCCGATCGCG
AAGATCGCCG CCAAGCTTGC CGTCGGCTAC ACGCTGGACG AGATCACCAA CGACATCACC
AAGGAGACGC CGGCCTGCTT CGAGCCGACC ATCGACTACG TGGTCACCAA GATCCCGCGC
TTCACCTTCG AGAAGTTCCC GGCCGCCGAC GCCACCCTCA CCACCCAGAT GAAATCGGTG
GGCGAGGTGA TGTCCATCGG CCGCACCTTC AAGGAGTCCT TCCAGAAGGC GCTCCGCTCG
CTGGAGATCG GCTCCTGCGG CTTCGAGTCC AAATTTTTCG GCGTAGGAGG CGACACCCGC
CGCGCACTCA CCGAAAAAGA GAGGAACCTC TTAAACGACA AGCTGAGGAC CCCCAACTGC
GACCGCCTCT GGTACGTCGG CGACGCCTTC CGCTGCGGCA TGACCGTGGA AGAGATCTAC
GCCCTCACCG CCATCGACCC CTGGTTCCTG AACAACATCC GCCAGATCAT CGAGATGGAG
GAGGAACTGA AGCCGGTAAA TATCAAGAAG GAATCAGGCG AGAAGCTGCA CGACATCCTC
TGGGACGCTA AACGCTACGG CTTCTCCGAC AAATACCTCG GGCAACTCTG GAAAATCCCG
GAAGCCGAGG TGCGCGAGTT GCGCCTGTCC GTCGGGGTCA AGCCGGTCTT TAAAAGGGTG
GATACCTGCG CCGCCGAGTT CGTGGCGCAC ACCCCGTACC TCTACTCCAC TTACGAGGAG
GAGTGCGAGG CGGAGCCGAC CGACAGGAAG AAAATCATCA TCCTCGGTGG CGGACCCAAC
CGCATCGGCC AGGGGATCGA GTTCGACTAC TGCTGCGTGC ACGGTGTTTT CGCCCTCTCC
GAGGACGGCT ACGAGACCAT CATGGTCAAC TGCAACCCGG AGACCGTTTC CACCGACTAC
GACACCTCGG ACCGCCTCTA CTTCGAGCCG CTCACCTTCG AGGACGTGCT GCACATCGTG
GACGTCGAGA AGCCGACCGG CGTCATCGTG CAGTTCGGCG GCCAGACCCC GCTGAAACTC
GCCGTGGCGC TTGAGAAGGC GGGCGTTCCC ATCATCGGCA CCTCGCCCGA CGCCATCGAC
CGCGCCGAGG ACCGCGAGCG CTTCCAGGAG ATGCTGCAAA AGCTCAAGCT CAGGCAGCCT
GAAAACGGCA CCGCCCGCTC CTTCGAGGAG TCCGAGGTGG TCGCCGAGCG TATCGGCTAC
CCGGTGGTGG TGCGCCCCTC CTACGTCCTT GGCGGGCGCG CCATGGAGAT CGTCTACGAC
GTGGACAACC TGCGCCGCTA CATGCACACC GCGGTTCAGG CCTCCCCGGA GCACCCGATC
CTGATCGACA AGTTCCTGGA CGAGGCGATC GAGATCGACG TCGACGCCCT TTGCGACGGC
CAAGTCGCCG TCATCGGCGG CATCATGGAG CACATCGAGG AGGCGGGTAT CCACTCCGGC
GACTCGGCCT GCTCGCTGCC GCCTTACTCC ATCTCCAAGG AGATCGTCGA GGAGATCAGG
CGCCAGACCA AGATGATGGC GCTGGAGTTG AACGTGAAGG GGCTCATGAA CGTGCAGTTC
GCCGTCAAGG GGAACGACAT CTACATCATC GAGGTCAACC CCCGCGCCTC GCGCACCTCC
CCCTTCGTCT CCAAGGCGAC CGGAAGGCCC CTGGCGAAGA TCGCCGCGCG CGTCATGGCG
GGCAAGACCC TGGCCGAGCT CGGGGTTACC GAGGAGATCG TCCCGGTCCA CATCTCGGTC
AAGGAATCGG TCTTCCCCTT CGCCAAGTTC CCCGGCGTCG ACACCATCCT GGGGCCGGAG
ATGAAGTCGA CCGGCGAGGT CATGGGGATC GGCGACACCT TCGCCAAGGC GTACGCCAAG
GCCCAGATGG GGGCCAACGT GAAGCTCCCG GCCTCGGGTA AAGTGTTCAT TTCAGTGAAG
GACACGGACA AAAAACATAT TGTCAGCGCT GCAAAAAGAC TGTATGATCA GGGCTTCGAA
TTGGTTGCTA CGCGCGGCAC GGCGAGCTAT CTGCAGGAAA AAGGGATCCC GGTTCAGGTA
GTAAACAAGG TAATCGAAGG ACGCCCCCAC ATAGTCGATG CGATCAAGAA CAACGAGATC
TGCATGGTCA TCAACACCAC CCACGGTGCA CAGGCCGTTG CCGATTCCTA CTCGATCCGC
AGGAACACCC TGATCAACAA CGTCGCTTAC TACACCACAG CCTCCGGCGC GAGAGCGGCC
GTAGACGGTA TCATAGCGAT GTCAAAGTCG AAGCTGGAGG TCAACTCGAT CCAGCACTAC
CTGAAGTAA
 
Protein sequence
MPKRTDIKKI LIIGAGPIVI GQACEFDYSG TQACKALKEE GFEVVLLNSN PATIMTDPDF 
ADFTYIEPVT PEILAAIIEK ERPDALLPTL GGQTALNTAV AVAENGTLEK FGVELIGAKL
PAIKKAEDRT LFKEAMVKIG LDVPRSGLAH NYQEAMEVIK VVGFPAIIRP SFTLGGTGGG
IAYNMEEYER MSMAGIEASP TDEILVEESL IGWKEYELEV MRDTADNVVI ICSIENFDAM
GVHTGDSITV APAQTLTDKE YQILRDASLK IIREIGVDTG GSNIQFGTNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDEITNDIT KETPACFEPT IDYVVTKIPR
FTFEKFPAAD ATLTTQMKSV GEVMSIGRTF KESFQKALRS LEIGSCGFES KFFGVGGDTR
RALTEKERNL LNDKLRTPNC DRLWYVGDAF RCGMTVEEIY ALTAIDPWFL NNIRQIIEME
EELKPVNIKK ESGEKLHDIL WDAKRYGFSD KYLGQLWKIP EAEVRELRLS VGVKPVFKRV
DTCAAEFVAH TPYLYSTYEE ECEAEPTDRK KIIILGGGPN RIGQGIEFDY CCVHGVFALS
EDGYETIMVN CNPETVSTDY DTSDRLYFEP LTFEDVLHIV DVEKPTGVIV QFGGQTPLKL
AVALEKAGVP IIGTSPDAID RAEDRERFQE MLQKLKLRQP ENGTARSFEE SEVVAERIGY
PVVVRPSYVL GGRAMEIVYD VDNLRRYMHT AVQASPEHPI LIDKFLDEAI EIDVDALCDG
QVAVIGGIME HIEEAGIHSG DSACSLPPYS ISKEIVEEIR RQTKMMALEL NVKGLMNVQF
AVKGNDIYII EVNPRASRTS PFVSKATGRP LAKIAARVMA GKTLAELGVT EEIVPVHISV
KESVFPFAKF PGVDTILGPE MKSTGEVMGI GDTFAKAYAK AQMGANVKLP ASGKVFISVK
DTDKKHIVSA AKRLYDQGFE LVATRGTASY LQEKGIPVQV VNKVIEGRPH IVDAIKNNEI
CMVINTTHGA QAVADSYSIR RNTLINNVAY YTTASGARAA VDGIIAMSKS KLEVNSIQHY
LK