Gene Anae109_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2072 
SymbolcarB 
ID5374325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2348970 
End bp2352221 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content70% 
IMG OID640843585 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001379259 
Protein GI153004934 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.471557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC GCACCGACAT CCGGAAGATC ATGATCGTGG GCTCCGGGCC GATCGTCATC 
GGCCAGGCCT GCGAGTTCGA CTACTCGGGC ACCCAGGCCT GCAAGGCGCT GAAGGAGGAG
GGCTACGAGA TCGTCCTCCT CAACTCGAAC CCGGCCACGA TCATGACGGA CCCGGGCTTC
GCCGACCGGA CCTACGTCGA GCCCATCACC CCCGCCGTCG CCGAGCAGAT CCTCGCCCGC
GAGAAGCCGG ACGTGCTCCT CCCGACCCTC GGCGGCCAGA CGGCGCTCAA CCTCGCCGTG
GCGCTCGCGA AGAACGGGGC GCTGGCGCGG CACGGGGTCG AGCTCATCGG GGCCCAGCTC
GAGGCGATCG AGAAGGCCGA GGACCGGCTC CTCTTCAAGG CCGCGATGGA GCGCGTCGGC
GTCGAGCTGC CGAAGTCCGG CTACGCGACG AGCTGGGAGG AGGCGCGCGC CATCGCCGAG
GACATCGGCT TCCCCATCAT CATCCGCCCC TCGTTCACGA TGGGCGGCGA GGGCGGCGGC
GTCGCCTACA ACCGCGAGGA GTTCGAGCCG CTCGCGCGGC GCGCGCTCAC CCTCTCCCCG
ACGCACACGA TCCTGTGCGA GGAGTCGATC ATCGGGTGGA AGGAGTACGA GCTCGAGGTG
ATGCGCGACC GCAACGACAA CGTCGTCATC ATCTGCTCGA TCGAGAACTT CGACCCGATG
GGCGTCCACA CCGGCGACTC GATCACCGTC GCGCCGGCGC AGACGCTCAC CGACAAGGAG
TACCAGCGGA TGCGCGACGC GGGCATCCGC ATCATCCGCG AGATCGGCGT CGAGACCGGC
GGCTCCAACA TCCAGTTCGG CGTGCACCCC CGCACCGGGC GGATGGTGGT CATCGAGATG
AACCCGCGCG TGTCGCGCTC CTCCGCGCTC GCCTCCAAGG CGACCGGCTT CCCGATCGCC
AAGATCGCGG CGAAGCTCGC GGTGGGCTAC ACGCTCGACG AGCTCAAGAA CGACATCACC
CGCTACACGC CGGCGTCCTT CGAGCCGACC ATCGACTACG TGGTCACGAA GGTGCCGCGC
TTCGCGTTCG AGAAGTTCAA GGGCGCGAAC GACACGCTCA CCACGCAGAT GAAGTCGGTC
GGCGAGGTGA TGGCGATCGG GCGGACCTTC CAGGAGAGCC TGCAGAAGGC GATCCGCGGC
CTCGAGATAG ACCGCTGCGG GCTGGAGTCG CCCCTCGGCA AGCGCCCGGG CGACGCGTAC
GCGTCGGAGG AGCTCGAGCG GATCAAGGCG GAGGTGCGCG TGCCGCGCGA CCGCCGGGTG
TTCTGGGTCG CCGAGGCGCT CCGCGCCGGG CTCTCCGTCG ACGACGTCCA CGCGCTCACC
TACATCGACC CCTGGTTCCT GCGGGAGATG GAGGAGCTCG TCCACGCCGA GGAGGCGCTC
GCGAAGGGCG TCCCGCAGGG CGCGGAGCCG CTCCGCGCCG TGAAGCGGAT GGGCTTCTCC
GACAAGCGCA TCGCGCAGCT CGCGGGGACG ACCGAGAAGG CCGTGCGCGA GGCGCGCTGG
CAGGCGGGCG TGCGGCCGGT CTTCAAGCGC GTCGACACCT GCGCCGCCGA GTTCGAGGCC
TACACGCCGT ACCTGTACTC CACCTACGAG GAGGAGTGCG AGGCGACGCC GACCGACCGC
CGGAAGGTGA TGATCCTCGG CGGCGGCCCG AACCGCATCG GACAGGGCAT CGAGTTCGAC
TACTGCTGCG TGCACGCGTC GTTCGCGCTG TCCAGGGCCG GGTACGAGAC CATCATGGTC
AACTGCAACC CGGAGACGGT CTCGACCGAC TACGACACCT CCGACCGGCT CTACTTCGAG
CCGGTCACGC TCGAGGACGT GCTCGAGATC GTGCACGTCG AGAAGCCGGA GGGGCTCATC
GTCCAGTACG GCGGGCAGAC GCCGCTCAAG CTCGCGGTGC CGCTCCACGA GCTCGGGGTG
CCGATCTTCG GGACGACGCC GGACGCCATC GATCGGGCCG AGGACCGCGA GCGCTTCGCG
GCGCTCATCG AGAAGCTGGG GCTCCGCCAG CCGCAGAACG GCGTGGCGCG CAGCGCCGAC
GAGGCGTTCG CGGTGGCGCG GCGCATCGGC TACCCGGTGA TGGTGCGCCC CTCGTACGTG
CTCGGCGGGC GCGCCATGGA GGTCGTCTAC GACGACAAGG ACCTCGACAC CTACCTCCGC
GAGGCGGTGC AGGCCTCGAA CGAGCGCCCG GTGCTCGTGG ACCGCTTCCT CAGGGACGCG
GCCGAGGTGG ACGTGGACGT CGTGTCGGAC GGCGAGGACG TCGTCGTGGG CGGCGTCATG
GAGCACATCG AGGAGGCGGG CATCCACTCC GGCGACTCCG CCTGCGCGCT GCCGCCCTTC
AGTCTGGCGC CGGAGAGGGT CGCGGAGATC GAGCAGCAGT CGATCGCGCT GGCGAAGGAG
CTGGGCGTCG TCGGCCTCAT GAACGTCCAG TTCGCCATCC AGGGGAACGA CGTCTACGTG
CTCGAGGTGA ACCCGCGCGC GAGCCGCACC GTGCCTTTCG TCGGCAAGGC GACGGGCCTG
CCGCTCGCCA AGGCGGGCTC GCTGTGCATG GTGGGCAAGA GCCTCGAGGA GGCCGGGGCC
CTCGTCGACG GCCGGCGGGG CCACATCTCC GTGAAGGAGG CGGTGTTCCC GTTCGCGCGC
TTCCCTGGCG TGGACACGAT GCTCGGGCCC GAGATGCGCT CGACGGGCGA GGTGATGGGG
ATCGACCAGG ACTTCTACCG CGCGTTCTTC AAGGCGCAGA CCGCGGCCGG GAACACGCTG
CCGGCCTCCG GCGCGGGCCG CCGCGCGTTC GTCTCGGTGA AGGACTCGGA CAAGCCCGCC
ATCGCCGAGC TCGCCCGGCG GCTCGTCGCG CTGGGCTTCG AGGTGCTCGC CACCGCCGGT
ACGAGCGCCT ACCTCGGCGC CCGCGGCGTC CCCACCACCC TGGTCCTGAA GGTGCACGAG
GGGCGCCCCT CCGTGGTGGA CCGCATCAAG GACGGGGACG TGCACCTCGT CTTCAACACC
ACGGCCGGCA AGCAGGAGAT CGCGGACAGC TACTCCATCC GGCGCGAGAC GCTCATGAAG
GGCCTGCCGT ACTTCACGAC CCTGACCGGC GCCCGCGCGG CGGTGGGGGC GATGGAGGCC
GCGCACGGGG GCGCACCGAG CGTGCGCTCG ATCCAGGAAT ACCATGGTGA GGGGCGCCAG
GCGTCCCGTT GA
 
Protein sequence
MPRRTDIRKI MIVGSGPIVI GQACEFDYSG TQACKALKEE GYEIVLLNSN PATIMTDPGF 
ADRTYVEPIT PAVAEQILAR EKPDVLLPTL GGQTALNLAV ALAKNGALAR HGVELIGAQL
EAIEKAEDRL LFKAAMERVG VELPKSGYAT SWEEARAIAE DIGFPIIIRP SFTMGGEGGG
VAYNREEFEP LARRALTLSP THTILCEESI IGWKEYELEV MRDRNDNVVI ICSIENFDPM
GVHTGDSITV APAQTLTDKE YQRMRDAGIR IIREIGVETG GSNIQFGVHP RTGRMVVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDELKNDIT RYTPASFEPT IDYVVTKVPR
FAFEKFKGAN DTLTTQMKSV GEVMAIGRTF QESLQKAIRG LEIDRCGLES PLGKRPGDAY
ASEELERIKA EVRVPRDRRV FWVAEALRAG LSVDDVHALT YIDPWFLREM EELVHAEEAL
AKGVPQGAEP LRAVKRMGFS DKRIAQLAGT TEKAVREARW QAGVRPVFKR VDTCAAEFEA
YTPYLYSTYE EECEATPTDR RKVMILGGGP NRIGQGIEFD YCCVHASFAL SRAGYETIMV
NCNPETVSTD YDTSDRLYFE PVTLEDVLEI VHVEKPEGLI VQYGGQTPLK LAVPLHELGV
PIFGTTPDAI DRAEDRERFA ALIEKLGLRQ PQNGVARSAD EAFAVARRIG YPVMVRPSYV
LGGRAMEVVY DDKDLDTYLR EAVQASNERP VLVDRFLRDA AEVDVDVVSD GEDVVVGGVM
EHIEEAGIHS GDSACALPPF SLAPERVAEI EQQSIALAKE LGVVGLMNVQ FAIQGNDVYV
LEVNPRASRT VPFVGKATGL PLAKAGSLCM VGKSLEEAGA LVDGRRGHIS VKEAVFPFAR
FPGVDTMLGP EMRSTGEVMG IDQDFYRAFF KAQTAAGNTL PASGAGRRAF VSVKDSDKPA
IAELARRLVA LGFEVLATAG TSAYLGARGV PTTLVLKVHE GRPSVVDRIK DGDVHLVFNT
TAGKQEIADS YSIRRETLMK GLPYFTTLTG ARAAVGAMEA AHGGAPSVRS IQEYHGEGRQ
ASR