Gene Sfum_3911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3911 
SymbolcarB 
ID4457766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4761715 
End bp4764915 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content61% 
IMG OID639704684 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_848015 
Protein GI116751328 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.123492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAC GCAACGACAT TCAAAAGGTG ATGATCATCG GCTCGGGGCC GATCATCATC 
GGCCAGGCCT GTGAATTCGA CTATTCGGGA ACCCAGGCGT GCAAGGCGCT TCGAAAACTG
GGTTATTCCA TTCTGTTGGT CAACTCCAAC CCGGCGACCA TCATGACGGA TCCGGGAATG
GCCGATGTGA CGTACATCGA GCCGCTGACC TTCGAATCGG TGAAAAAGAT CATCGAAAAG
GAACGCCCGG ACGCCATCCT GCCCAATCTC GGCGGTCAGA CCGGGCTGAA CCTCACGGCC
GAGCTTCACC GCAAGGGCGT CCTGGATGCC TGCGGGGTCA AAATCATCGG CGTCCAGGCG
GATGCCATCG AACGCGGGGA AGACCGGATC GCCTTCAAGG AGACCATGAA CCGGCTCAAC
ATTGAAATGC CGAGGAGCGA GCCGGCTCTG TCGGTGAAGG AGGCCGAGGA GATCGCCGCC
CGGCTGGGAT ATCCGGTGGT GATACGGCCG GCCTACACGC TCGGGGGCAC CGGCGGAGGA
CTCGTCTACA ACGTGGAGGA ATTGCGCACC ATTGCCGGCC GGGGCATTTC CGCGAGCCTG
GTGGGACAGG TGCTCGTCGA AGAATCGGTG CTCGGGTGGG AGGAATTGGA GCTCGAAGTG
GTGCGTGACG CCAAGAACCG GCGAATCACC GTGTGTTTCA TTGAAAACGT GGACGCCATG
GGGATTCATA CGGGGGATTC GTTCTGCACC GCCCCCATGC TCACCATCTC CCCTGCTTTG
CAGGAGAAGC TGCAGAAGTA TTCCTATGAC ATTGTCGAAG CCATCGAGGT CATCGGCGGA
ACCAACATCC AGTTCGCGCA CGATCCGAAA ACCGGCCGCG TGGTGGTGAT CGAGATCAAC
CCGAGAACAT CCCGCTCGTC GGCACTCGCC TCCAAGGCGA CCGGTTTCCC CATCGCCATG
ATTTCCTCGA TGCTGGCCGG AGGCCTCACC CTCGATGAAA TCCCCTACTG GCGCGAAGGC
ACCCTGGACA AATACACGCC CTGGGGGGAC TACGTGGTCG TGAAGTTCGC CCGCTGGGCG
TTCGAGAAAT TCAAGGGCGT CGAGGACCGC CTCGGCACGC AGATGCGTGC GGTGGGAGAG
GTGATGAGCC TGGGCAAGAA CTACAAGGAA GCCCTTCAGA AGTCCATCCG CTCCCTGGAA
ATCGGCCGGC ACGGATTCGG TTTTGCCGGG GATTTCCGTG GCAGGCCGCT GGAAGAACTG
ATGGAAGCAC TCTGCACGGC CACCAGCGAA CGCCAGTTCC TCATGTACGA AGCCTTGAGG
AAAGGCGCCG ACGTCGGGGC AATCTACGCC AAGACCCACA TCAAGCCCTG GTTCATCGAA
CAGATGAAGG AACTGGTCGA ACTTGAGGAA ACGATCCTTT CCCACAAGGG GCGCGAGCTG
CCCGATGATC TGCTCGCCCG GGCCAAGCGG GACGGCTTTG CCGACAAGTA CCTGGCGCAA
CTGCTGCAAG TGCCCGAAGC CCGGATCCGC AATCGAAGAA AAGAGCTTGG AATCGTCCAG
GGGTGGCAGG CGGTGCCGGT CAGCGGGGTC GAGAACGCCG CTTACTTCTA TTCGACCTAT
AACGCTCCCG ATGCGGTCGA AACCAGCAGC CGCCGCAAGA TCATGGTCCT CGGGGGAGGT
CCGAACCGGA TCGGCCAGGG AATAGAGTTC GACTACTGCT GCGTGCACGC CGCCTTCGCC
CTCAGAGACG AAGGTTTCGA ATCCATCATG GTGAACTGCA ACCCGGAAAC GGTTTCCACC
GACTACGACA CCTCGGACAA ACTGTATTTC GAACCGGTGA CCGTGGAAGA CGTCCTGTCG
ATCTATGAAA AGGAGAAACC GGAGGGCGTC ATCGTTCAGT TCGGCGGGCA GACCCCGTTG
AACATCGCGC GGGAGCTTTG CGACGAGGGG GTGAACATCC TGGGAACGAC GGTGGACACC
ATCGATCTTG CGGAAGACCG GGATCGATTC CGCGGGATGA TGAAGAAGCT CGGCATACCG
ATGCCGGAAT CGGGCATGGC GAGCACCCTG GAAGAAGCCC TGGAGGTGGC GCGCAAAATC
GGCTACCCGC TCATGGTGAG GCCGTCCTAC GTGCTCGGCG GACGCGGCAT GGAGATCGTC
CATGACGAGG AGATGCTCGA GCGTTACGTG GCGGCGGCGA CGGGCGTCAC CCCCGATCGG
CCCATCCTCA TCGACAAGTT CCTCGAGAAC GCCATCGAGG CCGAGGCGGA CGCCATTTCC
GACGGGACCG ACGCTTTCGT GCCCGCCGTC ATGGAACACA TCGAGCTGGC GGGGATTCAT
TCCGGCGATT CGGCGTGCGT CATTCCACCC ATCAGCATCC CGCCGCGTCA CCTCGAAACC
ATCTATGAAT ACACGCGCAA GATCGCCGTC GAGCTCAACG TGGTCGGCCT CATGAACATT
CAGTACGCCA TTGCGAGGGA CACGGTCTAC GTCTTGGAAG CCAACCCGAG GGCATCCCGC
ACCGTTCCCC TGGTGTCCAA GGTGTGTAAT ATCCCCATGG CCCGCATGGC CACCTGGATC
ATGCTCGGCC GAAAACTCGG CGAGCTTGGC GTCAAGAGCC GGCACATACC GCATTTCGGA
GTCAAGGAAG CGGTGTTTCC CTTCAACATG TTTCCGGAAG TGGATCCCGT GCTCGGGCCG
GAAATGCGCT CAACCGGCGA AGTCCTCGGA ATGGCGGATT CTTTCGGCCT GGCCTACTTC
AAGGCTCAGG AGGCCACACT CCAGGGCCTC CCGCCGACCG GAACGGTGCT CATCACGGTC
GCGGACGAGG ACAAGGAAGC CGCACTGCAG GTGGCCAAGC ATTTCGAAGG GCTTGGATTC
AAGCTCATGA CCACGCGCGG CACCCACAGA TTCATGCGGG AAAACGGCAT CCGGACGGAG
CGCATCGACA AGCTGCACGA GGGACGGCCC AATATCGTGG ATGCCATCAA GAACAAGGAG
ATCCACCTGG TCATCAACAC ACCCGACGGC AAGGTCAGCC TCCACGACGA TTCCTACATC
CGCAAGGCGG CTATCACTTA CAAAGTGCCC TACATCACGA CCATCGCCGC CGCCGTTGCC
GCGGCCAGGG GGATCGAAGC GTTCCGGAAG GGCTGGAAGG CCGCCAAATC ACTCCAGGAA
TACCACGCGG ACATAAAGTG A
 
Protein sequence
MPKRNDIQKV MIIGSGPIII GQACEFDYSG TQACKALRKL GYSILLVNSN PATIMTDPGM 
ADVTYIEPLT FESVKKIIEK ERPDAILPNL GGQTGLNLTA ELHRKGVLDA CGVKIIGVQA
DAIERGEDRI AFKETMNRLN IEMPRSEPAL SVKEAEEIAA RLGYPVVIRP AYTLGGTGGG
LVYNVEELRT IAGRGISASL VGQVLVEESV LGWEELELEV VRDAKNRRIT VCFIENVDAM
GIHTGDSFCT APMLTISPAL QEKLQKYSYD IVEAIEVIGG TNIQFAHDPK TGRVVVIEIN
PRTSRSSALA SKATGFPIAM ISSMLAGGLT LDEIPYWREG TLDKYTPWGD YVVVKFARWA
FEKFKGVEDR LGTQMRAVGE VMSLGKNYKE ALQKSIRSLE IGRHGFGFAG DFRGRPLEEL
MEALCTATSE RQFLMYEALR KGADVGAIYA KTHIKPWFIE QMKELVELEE TILSHKGREL
PDDLLARAKR DGFADKYLAQ LLQVPEARIR NRRKELGIVQ GWQAVPVSGV ENAAYFYSTY
NAPDAVETSS RRKIMVLGGG PNRIGQGIEF DYCCVHAAFA LRDEGFESIM VNCNPETVST
DYDTSDKLYF EPVTVEDVLS IYEKEKPEGV IVQFGGQTPL NIARELCDEG VNILGTTVDT
IDLAEDRDRF RGMMKKLGIP MPESGMASTL EEALEVARKI GYPLMVRPSY VLGGRGMEIV
HDEEMLERYV AAATGVTPDR PILIDKFLEN AIEAEADAIS DGTDAFVPAV MEHIELAGIH
SGDSACVIPP ISIPPRHLET IYEYTRKIAV ELNVVGLMNI QYAIARDTVY VLEANPRASR
TVPLVSKVCN IPMARMATWI MLGRKLGELG VKSRHIPHFG VKEAVFPFNM FPEVDPVLGP
EMRSTGEVLG MADSFGLAYF KAQEATLQGL PPTGTVLITV ADEDKEAALQ VAKHFEGLGF
KLMTTRGTHR FMRENGIRTE RIDKLHEGRP NIVDAIKNKE IHLVINTPDG KVSLHDDSYI
RKAAITYKVP YITTIAAAVA AARGIEAFRK GWKAAKSLQE YHADIK