Gene Avin_42930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42930 
SymbolcarB 
ID7763167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4330755 
End bp4333976 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content63% 
IMG OID643807149 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002801390 
Protein GI226946317 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGACAT CAAAAGCATC CTGATCCTCG GCGCCGGCCC CATCGTCATC 
GGTCAGGCCT GCGAGTTCGA CTACTCGGGC GCCCAGGCTT GCAAGGCACT GAAAGAAGAA
GGCTTTCGCG TCATTTTGGT GAACTCCAAC CCGGCCACCA TCATGACTGA CCCGACCATG
GCCGACGCCA CCTACATCGA GCCGATCAAG TGGCAGACCG TGGCCAAGAT CATCGAGAAG
GAACGACCCG ACGCCCTGCT GCCGACCATG GGCGGCCAGA CCGCGCTGAA CTGCGCCCTG
GCGCTGGAGA AACACGGCGT GTTGACCAAG TTCGGCGTCG AGATGATCGG TGCCAATGCC
GATACCATCG ACAAGGCCGA GGACCGCTCG CGCTTCGACA AGGCCATGCG CGCCATCGGC
CTGGAATGCC CGCGCTCCGG CATCGCCCAC AGCATGGACG AAGCCTATGG CGTATTGGAC
AAGGTCGGCT TCCCCTGCAT CATCCGTCCG TCCTTCACCA TGGGTGGCAC CGGTGGCGGT
ATCGCCTACA ACCGCGAAGA GTTCGAGGAA ATCTGTACCC GCGGCCTGGA CCTGTCGCCG
ACCAGCGAGC TTTTGATCGA CGAATCCCTG ATCGGCTGGA AGGAGTACGA GATGGAGGTG
GTCCGCGACA AGAAGGACAA CTGCATCATC GTCTGCTCGA TCGAGAACTT CGATCCGATG
GGCGTGCATA CCGGTGACTC GATCACCGTG GCTCCGGCGC AGACCCTGAC CGACAAGGAA
TACCAGATCA TGCGCAACGC ATCGCTTGCG GTGCTGCGCG AGATCGGCGT GGAGACCGGC
GGCTCCAATG TGCAGTTCGG CATCAATCCG GTCGACGGCC GCATGGTGGT GATCGAGATG
AACCCGCGCG TGTCGCGCTC CTCGGCGCTG GCCTCGAAGG CTACCGGCTT CCCGATCGCC
AAGATCGCCG CCAAGCTGGC GGTGGGCTAT ACCCTCGACG AGTTGTCCAA CGACATCACC
GGCGGCCGCA CGCCGGCCTC CTTCGAGCCG GCCATCGACT ACGTGGTTAC CAAGGTGCCA
CGCTTCGCCT TCGAGAAGTT TCCCAAGGCC GACGCTCGCC TGACCACCCA GATGAAATCC
GTGGGCGAGG TGATGGCCAT CGGTCGGACC TTCCAGGAGT CCGTACAGAA GGCCCTGCGT
GGCCTGGAGG TTGGGGTCAG TGGTTTCGAT CCCAAACTGG ATCCGGGCAA CCCGGAAGCC
GGGAGCATTC TCAAGCGCGA GCTGACCGTG CCGGGCGCCG AGCGCATTTG GTATGTGGCC
GATGCCTTCC GTGCCGGCAA GAGCGTCGAC GATGTGTTCG CGATGACCAG GATCGACCCC
TGGTTCCTGG TGCAGATCGA GGATCTGGTC AAGGAAGAGG AGCGGGTCAA GACTCTCGGC
CTTTCCAGCA TCGATCGCAA CCTGATGTGG CGGCTCAAGC GCAAGGGCTT TTCCGACGCA
CGCCTGGCCA AGCTGCTCGG CGTGACCGAG AAGAACCTGC GCAGCCACCG GCAGAAGCTC
AAGGTGCAGC CGGTGTACAA GCGTGTCGAT ACCTGCGCCG CCGAGTTCGC CAGCGATACC
GCCTACATGT ACTCGACTTA CGAGGAGGAG TGCGAGGCCA AGCCGTCCAG CCGCGACAAG
ATCATGGTCA TCGGCGGCGG TCCGAACCGT ATCGGCCAGG GCATCGAGTT CGACTATTGC
TGCGTGCATG CTGCACTGGC CATGCGCGAA GACGGTTACG AGACCATCAT GGTCAACTGC
AACCCAGAAA CCGTCTCCAC CGACTACGAC ACGTCCGATC GCCTGTATTT CGAGCCGGTG
ACCCTGGAGG ACGTGTTGGA AATCGTCCGT GTCGAACAGC CCAAGGGCGT CATCGTGCAG
TACGGCGGTC AGACCCCGCT GAAGATCTGC CGCGCGCTGG AGGAAGCCGG TGTGCCGATC
ATCGGCACCA GCCCGGACGC CATCGACCGC GCCGAGGACC GCGAGCGCTT CCAGCACATG
GTCGAGCGCC TCAACCTGCG CCAGCCGCCG AACGCCACTG CCCGCAGCGA GGACGAGGCC
ATCGCCGCCT CGAAGGCGAT CGGCTATCCG CTGGTGGTGC GCCCGTCTTA TGTGCTGGGT
GGCCGGGCCA TGGAGATCGT CTATGAGGAA GACGAACTCA AGCGCTACAT GCGCGAGGCC
GTGCAGGTCT CCAACGACAG CCCGGTACTG CTCGACCACT TCCTCAATTG CGCCATCGAG
GTCGATATCG ATGCCGTCTG TGACGGCGAG GACGTAGTGA TCGGTGCGAT CATGCAGCAC
ATCGAGCAGG CCGGCGTGCA TTCCGGCGAT TCTGCCTGTT CGCTGCCGCC CTATTCGCTG
CCGGCGCACA TCCAGGACGA TATCCGCGAA CAGGTCAAGA AGATGGCCCT GGAGCTCGGC
GTCGTCGGTC TGATGAACGT CCAGATGGCG GTGCAGGGCG AAGACATCTA CGTCCTGGAG
GTGAACCCGC GCGCCTCGCG CACCGTGCCT TTCGTCTCCA AGTGCATCGG TCAGTCCTTG
GCCAAGATCG CCGCGCGCGT GATGGCCGGC AAGACGCTCA AGGAGATCGG TTTCACCCGC
GAGATCATCC CGACGTACTT CAGCGTGAAG GAAGCGGTGT TCCCGTTTGC CAAATTCCCC
GGCGTCGACC CCATTCTCGG CCCGGAGATG AAGTCCACCG GCGAGGTGAT GGGGGTCGGC
GACAGTTTCG CCGAGGCTTT TGCCAAGGCC CAACTGGGGG CCAGCGAGAC CCTGCCAGCC
GGTGGTTGCG CCTTCATCAG CGTGCGAGAA GACGACAAGC CGTTCGTCGC CGAGGTGGCG
CGCAACCTGG TCGCCCTCGG CTTCGAGGTG GTGGCCACTG CCGGTACCGC TCGAATAATC
GAAGCGGCCG GCCTGCCGGT TCGCCGGGTG AACAAGGTGA CCGAGGGGCG TCCCCATGTG
GTCGACATGA TCAAGAACGA TGAGGTCACC CTGATCATCA ACACTACCGA GGGGCGCCAG
TCGATCGCCG ACTCCTTCTC GATCCGTCGC AACGCTCTGC AGCACAAGAT CTGCATCACC
ACCACCATCG CCGGTGGCCA GGCGATCTGT GAGGCGCTCA GGTTCGGTCC CGAGAAGACC
GTGCGCCGTC TGCAGGATCT CCATGCAGGA ATCAACGCAT GA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALKEE GFRVILVNSN PATIMTDPTM 
ADATYIEPIK WQTVAKIIEK ERPDALLPTM GGQTALNCAL ALEKHGVLTK FGVEMIGANA
DTIDKAEDRS RFDKAMRAIG LECPRSGIAH SMDEAYGVLD KVGFPCIIRP SFTMGGTGGG
IAYNREEFEE ICTRGLDLSP TSELLIDESL IGWKEYEMEV VRDKKDNCII VCSIENFDPM
GVHTGDSITV APAQTLTDKE YQIMRNASLA VLREIGVETG GSNVQFGINP VDGRMVVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDELSNDIT GGRTPASFEP AIDYVVTKVP
RFAFEKFPKA DARLTTQMKS VGEVMAIGRT FQESVQKALR GLEVGVSGFD PKLDPGNPEA
GSILKRELTV PGAERIWYVA DAFRAGKSVD DVFAMTRIDP WFLVQIEDLV KEEERVKTLG
LSSIDRNLMW RLKRKGFSDA RLAKLLGVTE KNLRSHRQKL KVQPVYKRVD TCAAEFASDT
AYMYSTYEEE CEAKPSSRDK IMVIGGGPNR IGQGIEFDYC CVHAALAMRE DGYETIMVNC
NPETVSTDYD TSDRLYFEPV TLEDVLEIVR VEQPKGVIVQ YGGQTPLKIC RALEEAGVPI
IGTSPDAIDR AEDRERFQHM VERLNLRQPP NATARSEDEA IAASKAIGYP LVVRPSYVLG
GRAMEIVYEE DELKRYMREA VQVSNDSPVL LDHFLNCAIE VDIDAVCDGE DVVIGAIMQH
IEQAGVHSGD SACSLPPYSL PAHIQDDIRE QVKKMALELG VVGLMNVQMA VQGEDIYVLE
VNPRASRTVP FVSKCIGQSL AKIAARVMAG KTLKEIGFTR EIIPTYFSVK EAVFPFAKFP
GVDPILGPEM KSTGEVMGVG DSFAEAFAKA QLGASETLPA GGCAFISVRE DDKPFVAEVA
RNLVALGFEV VATAGTARII EAAGLPVRRV NKVTEGRPHV VDMIKNDEVT LIINTTEGRQ
SIADSFSIRR NALQHKICIT TTIAGGQAIC EALRFGPEKT VRRLQDLHAG INA