Gene Cwoe_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3204 
Symbol 
ID8733653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3410363 
End bp3413491 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content71% 
IMG OID646503822 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003394998 
Protein GI284044658 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.610402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.762296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC GCACCGACAT CAGAAAGATC CTCATCATCG GCTCCGGGCC GATCGTGATC 
GGACAGGCGG CCGAGTTCGA CTACTCCGGC ACGCAGGCCT GCAAGGTCCT CATGGAGGAG
GGCTACGAGG TCGTGCTCGT CAACTCGAAT CCGGCGACGA TCATGACCGA CCCGGAGATC
GCGACCGCGA CCTACGTCGA GCCGCTGCTG CCCGGCCCGG TCGCGCAGGT GATCGAGCGC
GAGCGGCCCG ACGCGCTGCT GCCGACGCTC GGCGGCCAGA CCGCGCTCAA CCTCGCCAAG
GCGCTGCACG AGGACGGCAC CCTCACGAGA TACGACGTCG AGCTGATCGG CGCGAACTAC
GAGGCGATCG ACCGCGCCGA GGACCGCGAC CGCTTCCGCG AGACGATGGA GACGGCGAGG
CTGCGCGTCC CGCGCTCCGC GATCGCCACG ACGCTGGAGG AGGCGCGCGG CGCCCTCCAG
GACATCGGCC TGCCGATGAT CATCCGCCCG GCGTTCACGC TCGGCGGCCG CGGCGGCGGC
ATCGCCCGCA CCGAGGCCGA GTTCGAGGCG ATCTGCGCGC GCGGCATCGA GGCGTCGCCG
ATCGACCAGA TCCTGATCGA CGAGTCGGTC CTCGGCTGGG GCGAGTTCGA GCTGGAGGTG
ATGCGCGACC ACGCCGACAA CGTCGTGATC ATCTGCTCGA TCGAGAACCT CGACCCGATG
GGCGTCCACA CGGGCGACTC CGTCTGCGTC GCGCCGCAGC AGACGCTCAC GGACAAGCAG
TACCAGAAGC TCCGCGACCA GGCGATCGCG GTGATCCGCG CGGTCGGCGT CGAGACCGGC
GGCTCCAACG TCCAGTTCGC CGTCAACCCG GAGACCGACG AGATCATCGT CATCGAGATG
AACCCGCGCG TCTCCCGGTC GAGCGCGCTC GCGTCGAAGG CGACCGGCTT CCCGATCGCG
AAGATCGCCG CGCGGCTGGC GGTCGGCTAC ACGCTGCAGG AGATCGACAA CGACATCACG
CGCGCCACGC CGGCGAGCTT CGAGCCGACG ATCGACTACT GCGTCGTGAA GTGGCCGCGC
TTCGCGTTCG AGAAGTTCCC CGGCTCCGAC GCCGGGCTGA CGACGCACAT GAAGTCGGTC
GGCGAGGCGA TGGCGATCGG CCGCACCTTC AAGCAGGCGT TCGCGAAGGC GCTGCGCTCG
CGCGAGCTGG ACTCGCCCGG CGTCCCGCAC GACGACCTGG AGGAGCTGCT GCTCTCGCTG
GAGCAGGGCG GACCGGACCG CTTCGACCTC GTGCTGGAGG CGTTCCGGCG CGGCGTTGAG
GTCGAGACAC TGCACGCGCG CACGCAGATC GACCCGTGGT TCCTGCGCGA GCTGCAGGAG
CTGGCGCTCG ATCCGGCGGC GGCCGAGGCC GGCGAGCGGA CGTTCAAGTC GGTCGACACC
TGCGCGGCCG AGTTCGCTGC GCGCACGCCG TACTACTACT CCGCCCGCGA GCGGCCGCGC
AGATCGGGCG TGGTCGAGAA CGAGGTCGTG CGCGGCGATC GCGCCAGCGT CGTGATCCTC
GGCGCAGGCC CGAACCGGAT CGGCCAGGGG ATCGAGTTCG ACTACTGCTG CGTGCACGCC
GCGATGACGG TGCGCGAGTC CGGCAAGGAC GCGGTGATGG TCAACTGCAA TCCCGAGACG
GTCTCGACCG ACTACGACAC CTCCGACCGG CTCTACTTCG AGCCGCTGAC GCTGGAGGAC
GTGCTCGGCG TCTGCGAGAT CGAGAAGCCC GAGGGCGTGA TCGTGCAGTT CGGCGGCCAG
ACGCCGCTGC GGCTCGCGGC CGGCCTGGAG GCGGCGGGCG TGCCGATCCT CGGCACGAGC
ATCGACGCGA TCGACCACGC GGAGGACCGC GGCCGCTTCG GCAGGCTGCT GGAGCAGCTC
GGCTTCAGCG CGCCGCCGTA CGCGACGGCG CACTCGCCCG AGGAGGCGCT GGCGAAGGCG
CCCGGCGTCG GCTTCCCTCT GCTCGTGCGG CCGAGCTACG TGCTCGGCGG CCGCGCGATG
GAGATCGTCT ACTCGCTCGA CGGCCTGCAG GACTACCTGA CCCGTGTCGG CGCGGCGCAC
GGCTCGGGCA AGGAGATCTT CCTCGACCGC TTCCTGGAGG ACTCGATCGA GGTCGACGTC
GACGCGCTCT GCGACGGCAC CGACGTCTGG ATCGGCGGCA TCATGCAGCA CGTCGAGGAG
GCCGGGATCC ACTCCGGCGA CTCCGCCTGC GTGCTGCCGC CGCACTCGCT CGGCCCCGAC
GCGCTCGCGC AGATCCGCGC GCACACCGAG GGGATCGCGA AGGCGCTCGG CGTCGTCGGC
CTGCTGAACG TCCAGTACGC GGTCGACAAG TCCGGTCAGC TGTACGTGAT CGAGGCGAAC
CCGCGCGCCT CGCGCACGGT CCCGTTCGTC TCGAAGGCGA TCGGCCTCCC GCTCGCGAAG
CTCGCCTGCC GCATCATGCT CGGCGAGAAG ATCGCTGACC TCGGCCTGCC GGAGGACCCG
GTCGGCGACG TCGTCTGCGT CAAGGAGGCG GTGATGCCGT TCGATCGCTT CGCCGGCGCG
GACTCGCTGC TCGGCCCCGA GATGCGCTCG ACCGGCGAGG TGATGGGGAT CGCCCACGAC
TTCCCGACCG CGTTCGCGAA GGCGCAGGCG GCGGCCGGCT CGGTGCTGCC GTCCGAGGGC
ACCGTCTTCA TCACCGTCAC CGACGGCGAC AAGCCGGCCG CGGCGGGCGT CGCGATGGCG
CTGCACGGGC TCGGCTTCAG AATCGTCGCG ACCGCCGGGA CCGCGCAGGC GATCAAACGG
ATGGGCATCC CCGTCGAGGC GCTGGAGAAG ATCGGCTCGG GCTCGCCGAA CGTGCTGGAG
CTGATCGAGC GCGGCGAGGT CAAGCTCGTC GTCAACACGC CCGTCGGGAC CGGCGCGCGG
ATCGACGGCT GGGAGATCCG CTCCGCCGCG ATCGCCGCCG GGATCCCCTG CATCACGACG
ATGACGGGCG CGATGGCCGC CGCGCAGGCG ATCGCCGCAG GCGCGCGGGG CGTGCCGGCC
GTGATCGCGC TGCAAGAGCT GCAGCGGGTC GGAGACGGCT CGCCCGCGGC CGGAGCGCCG
GTCGCGTGA
 
Protein sequence
MPKRTDIRKI LIIGSGPIVI GQAAEFDYSG TQACKVLMEE GYEVVLVNSN PATIMTDPEI 
ATATYVEPLL PGPVAQVIER ERPDALLPTL GGQTALNLAK ALHEDGTLTR YDVELIGANY
EAIDRAEDRD RFRETMETAR LRVPRSAIAT TLEEARGALQ DIGLPMIIRP AFTLGGRGGG
IARTEAEFEA ICARGIEASP IDQILIDESV LGWGEFELEV MRDHADNVVI ICSIENLDPM
GVHTGDSVCV APQQTLTDKQ YQKLRDQAIA VIRAVGVETG GSNVQFAVNP ETDEIIVIEM
NPRVSRSSAL ASKATGFPIA KIAARLAVGY TLQEIDNDIT RATPASFEPT IDYCVVKWPR
FAFEKFPGSD AGLTTHMKSV GEAMAIGRTF KQAFAKALRS RELDSPGVPH DDLEELLLSL
EQGGPDRFDL VLEAFRRGVE VETLHARTQI DPWFLRELQE LALDPAAAEA GERTFKSVDT
CAAEFAARTP YYYSARERPR RSGVVENEVV RGDRASVVIL GAGPNRIGQG IEFDYCCVHA
AMTVRESGKD AVMVNCNPET VSTDYDTSDR LYFEPLTLED VLGVCEIEKP EGVIVQFGGQ
TPLRLAAGLE AAGVPILGTS IDAIDHAEDR GRFGRLLEQL GFSAPPYATA HSPEEALAKA
PGVGFPLLVR PSYVLGGRAM EIVYSLDGLQ DYLTRVGAAH GSGKEIFLDR FLEDSIEVDV
DALCDGTDVW IGGIMQHVEE AGIHSGDSAC VLPPHSLGPD ALAQIRAHTE GIAKALGVVG
LLNVQYAVDK SGQLYVIEAN PRASRTVPFV SKAIGLPLAK LACRIMLGEK IADLGLPEDP
VGDVVCVKEA VMPFDRFAGA DSLLGPEMRS TGEVMGIAHD FPTAFAKAQA AAGSVLPSEG
TVFITVTDGD KPAAAGVAMA LHGLGFRIVA TAGTAQAIKR MGIPVEALEK IGSGSPNVLE
LIERGEVKLV VNTPVGTGAR IDGWEIRSAA IAAGIPCITT MTGAMAAAQA IAAGARGVPA
VIALQELQRV GDGSPAAGAP VA