Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42930 |
Symbol | carB |
ID | 7763167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4330755 |
End bp | 4333976 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643807149 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002801390 |
Protein GI | 226946317 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGACAT CAAAAGCATC CTGATCCTCG GCGCCGGCCC CATCGTCATC GGTCAGGCCT GCGAGTTCGA CTACTCGGGC GCCCAGGCTT GCAAGGCACT GAAAGAAGAA GGCTTTCGCG TCATTTTGGT GAACTCCAAC CCGGCCACCA TCATGACTGA CCCGACCATG GCCGACGCCA CCTACATCGA GCCGATCAAG TGGCAGACCG TGGCCAAGAT CATCGAGAAG GAACGACCCG ACGCCCTGCT GCCGACCATG GGCGGCCAGA CCGCGCTGAA CTGCGCCCTG GCGCTGGAGA AACACGGCGT GTTGACCAAG TTCGGCGTCG AGATGATCGG TGCCAATGCC GATACCATCG ACAAGGCCGA GGACCGCTCG CGCTTCGACA AGGCCATGCG CGCCATCGGC CTGGAATGCC CGCGCTCCGG CATCGCCCAC AGCATGGACG AAGCCTATGG CGTATTGGAC AAGGTCGGCT TCCCCTGCAT CATCCGTCCG TCCTTCACCA TGGGTGGCAC CGGTGGCGGT ATCGCCTACA ACCGCGAAGA GTTCGAGGAA ATCTGTACCC GCGGCCTGGA CCTGTCGCCG ACCAGCGAGC TTTTGATCGA CGAATCCCTG ATCGGCTGGA AGGAGTACGA GATGGAGGTG GTCCGCGACA AGAAGGACAA CTGCATCATC GTCTGCTCGA TCGAGAACTT CGATCCGATG GGCGTGCATA CCGGTGACTC GATCACCGTG GCTCCGGCGC AGACCCTGAC CGACAAGGAA TACCAGATCA TGCGCAACGC ATCGCTTGCG GTGCTGCGCG AGATCGGCGT GGAGACCGGC GGCTCCAATG TGCAGTTCGG CATCAATCCG GTCGACGGCC GCATGGTGGT GATCGAGATG AACCCGCGCG TGTCGCGCTC CTCGGCGCTG GCCTCGAAGG CTACCGGCTT CCCGATCGCC AAGATCGCCG CCAAGCTGGC GGTGGGCTAT ACCCTCGACG AGTTGTCCAA CGACATCACC GGCGGCCGCA CGCCGGCCTC CTTCGAGCCG GCCATCGACT ACGTGGTTAC CAAGGTGCCA CGCTTCGCCT TCGAGAAGTT TCCCAAGGCC GACGCTCGCC TGACCACCCA GATGAAATCC GTGGGCGAGG TGATGGCCAT CGGTCGGACC TTCCAGGAGT CCGTACAGAA GGCCCTGCGT GGCCTGGAGG TTGGGGTCAG TGGTTTCGAT CCCAAACTGG ATCCGGGCAA CCCGGAAGCC GGGAGCATTC TCAAGCGCGA GCTGACCGTG CCGGGCGCCG AGCGCATTTG GTATGTGGCC GATGCCTTCC GTGCCGGCAA GAGCGTCGAC GATGTGTTCG CGATGACCAG GATCGACCCC TGGTTCCTGG TGCAGATCGA GGATCTGGTC AAGGAAGAGG AGCGGGTCAA GACTCTCGGC CTTTCCAGCA TCGATCGCAA CCTGATGTGG CGGCTCAAGC GCAAGGGCTT TTCCGACGCA CGCCTGGCCA AGCTGCTCGG CGTGACCGAG AAGAACCTGC GCAGCCACCG GCAGAAGCTC AAGGTGCAGC CGGTGTACAA GCGTGTCGAT ACCTGCGCCG CCGAGTTCGC CAGCGATACC GCCTACATGT ACTCGACTTA CGAGGAGGAG TGCGAGGCCA AGCCGTCCAG CCGCGACAAG ATCATGGTCA TCGGCGGCGG TCCGAACCGT ATCGGCCAGG GCATCGAGTT CGACTATTGC TGCGTGCATG CTGCACTGGC CATGCGCGAA GACGGTTACG AGACCATCAT GGTCAACTGC AACCCAGAAA CCGTCTCCAC CGACTACGAC ACGTCCGATC GCCTGTATTT CGAGCCGGTG ACCCTGGAGG ACGTGTTGGA AATCGTCCGT GTCGAACAGC CCAAGGGCGT CATCGTGCAG TACGGCGGTC AGACCCCGCT GAAGATCTGC CGCGCGCTGG AGGAAGCCGG TGTGCCGATC ATCGGCACCA GCCCGGACGC CATCGACCGC GCCGAGGACC GCGAGCGCTT CCAGCACATG GTCGAGCGCC TCAACCTGCG CCAGCCGCCG AACGCCACTG CCCGCAGCGA GGACGAGGCC ATCGCCGCCT CGAAGGCGAT CGGCTATCCG CTGGTGGTGC GCCCGTCTTA TGTGCTGGGT GGCCGGGCCA TGGAGATCGT CTATGAGGAA GACGAACTCA AGCGCTACAT GCGCGAGGCC GTGCAGGTCT CCAACGACAG CCCGGTACTG CTCGACCACT TCCTCAATTG CGCCATCGAG GTCGATATCG ATGCCGTCTG TGACGGCGAG GACGTAGTGA TCGGTGCGAT CATGCAGCAC ATCGAGCAGG CCGGCGTGCA TTCCGGCGAT TCTGCCTGTT CGCTGCCGCC CTATTCGCTG CCGGCGCACA TCCAGGACGA TATCCGCGAA CAGGTCAAGA AGATGGCCCT GGAGCTCGGC GTCGTCGGTC TGATGAACGT CCAGATGGCG GTGCAGGGCG AAGACATCTA CGTCCTGGAG GTGAACCCGC GCGCCTCGCG CACCGTGCCT TTCGTCTCCA AGTGCATCGG TCAGTCCTTG GCCAAGATCG CCGCGCGCGT GATGGCCGGC AAGACGCTCA AGGAGATCGG TTTCACCCGC GAGATCATCC CGACGTACTT CAGCGTGAAG GAAGCGGTGT TCCCGTTTGC CAAATTCCCC GGCGTCGACC CCATTCTCGG CCCGGAGATG AAGTCCACCG GCGAGGTGAT GGGGGTCGGC GACAGTTTCG CCGAGGCTTT TGCCAAGGCC CAACTGGGGG CCAGCGAGAC CCTGCCAGCC GGTGGTTGCG CCTTCATCAG CGTGCGAGAA GACGACAAGC CGTTCGTCGC CGAGGTGGCG CGCAACCTGG TCGCCCTCGG CTTCGAGGTG GTGGCCACTG CCGGTACCGC TCGAATAATC GAAGCGGCCG GCCTGCCGGT TCGCCGGGTG AACAAGGTGA CCGAGGGGCG TCCCCATGTG GTCGACATGA TCAAGAACGA TGAGGTCACC CTGATCATCA ACACTACCGA GGGGCGCCAG TCGATCGCCG ACTCCTTCTC GATCCGTCGC AACGCTCTGC AGCACAAGAT CTGCATCACC ACCACCATCG CCGGTGGCCA GGCGATCTGT GAGGCGCTCA GGTTCGGTCC CGAGAAGACC GTGCGCCGTC TGCAGGATCT CCATGCAGGA ATCAACGCAT GA
|
Protein sequence | MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALKEE GFRVILVNSN PATIMTDPTM ADATYIEPIK WQTVAKIIEK ERPDALLPTM GGQTALNCAL ALEKHGVLTK FGVEMIGANA DTIDKAEDRS RFDKAMRAIG LECPRSGIAH SMDEAYGVLD KVGFPCIIRP SFTMGGTGGG IAYNREEFEE ICTRGLDLSP TSELLIDESL IGWKEYEMEV VRDKKDNCII VCSIENFDPM GVHTGDSITV APAQTLTDKE YQIMRNASLA VLREIGVETG GSNVQFGINP VDGRMVVIEM NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDELSNDIT GGRTPASFEP AIDYVVTKVP RFAFEKFPKA DARLTTQMKS VGEVMAIGRT FQESVQKALR GLEVGVSGFD PKLDPGNPEA GSILKRELTV PGAERIWYVA DAFRAGKSVD DVFAMTRIDP WFLVQIEDLV KEEERVKTLG LSSIDRNLMW RLKRKGFSDA RLAKLLGVTE KNLRSHRQKL KVQPVYKRVD TCAAEFASDT AYMYSTYEEE CEAKPSSRDK IMVIGGGPNR IGQGIEFDYC CVHAALAMRE DGYETIMVNC NPETVSTDYD TSDRLYFEPV TLEDVLEIVR VEQPKGVIVQ YGGQTPLKIC RALEEAGVPI IGTSPDAIDR AEDRERFQHM VERLNLRQPP NATARSEDEA IAASKAIGYP LVVRPSYVLG GRAMEIVYEE DELKRYMREA VQVSNDSPVL LDHFLNCAIE VDIDAVCDGE DVVIGAIMQH IEQAGVHSGD SACSLPPYSL PAHIQDDIRE QVKKMALELG VVGLMNVQMA VQGEDIYVLE VNPRASRTVP FVSKCIGQSL AKIAARVMAG KTLKEIGFTR EIIPTYFSVK EAVFPFAKFP GVDPILGPEM KSTGEVMGVG DSFAEAFAKA QLGASETLPA GGCAFISVRE DDKPFVAEVA RNLVALGFEV VATAGTARII EAAGLPVRRV NKVTEGRPHV VDMIKNDEVT LIINTTEGRQ SIADSFSIRR NALQHKICIT TTIAGGQAIC EALRFGPEKT VRRLQDLHAG INA
|
| |