Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3204 |
Symbol | |
ID | 8733653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3410363 |
End bp | 3413491 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646503822 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003394998 |
Protein GI | 284044658 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.610402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.762296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGC GCACCGACAT CAGAAAGATC CTCATCATCG GCTCCGGGCC GATCGTGATC GGACAGGCGG CCGAGTTCGA CTACTCCGGC ACGCAGGCCT GCAAGGTCCT CATGGAGGAG GGCTACGAGG TCGTGCTCGT CAACTCGAAT CCGGCGACGA TCATGACCGA CCCGGAGATC GCGACCGCGA CCTACGTCGA GCCGCTGCTG CCCGGCCCGG TCGCGCAGGT GATCGAGCGC GAGCGGCCCG ACGCGCTGCT GCCGACGCTC GGCGGCCAGA CCGCGCTCAA CCTCGCCAAG GCGCTGCACG AGGACGGCAC CCTCACGAGA TACGACGTCG AGCTGATCGG CGCGAACTAC GAGGCGATCG ACCGCGCCGA GGACCGCGAC CGCTTCCGCG AGACGATGGA GACGGCGAGG CTGCGCGTCC CGCGCTCCGC GATCGCCACG ACGCTGGAGG AGGCGCGCGG CGCCCTCCAG GACATCGGCC TGCCGATGAT CATCCGCCCG GCGTTCACGC TCGGCGGCCG CGGCGGCGGC ATCGCCCGCA CCGAGGCCGA GTTCGAGGCG ATCTGCGCGC GCGGCATCGA GGCGTCGCCG ATCGACCAGA TCCTGATCGA CGAGTCGGTC CTCGGCTGGG GCGAGTTCGA GCTGGAGGTG ATGCGCGACC ACGCCGACAA CGTCGTGATC ATCTGCTCGA TCGAGAACCT CGACCCGATG GGCGTCCACA CGGGCGACTC CGTCTGCGTC GCGCCGCAGC AGACGCTCAC GGACAAGCAG TACCAGAAGC TCCGCGACCA GGCGATCGCG GTGATCCGCG CGGTCGGCGT CGAGACCGGC GGCTCCAACG TCCAGTTCGC CGTCAACCCG GAGACCGACG AGATCATCGT CATCGAGATG AACCCGCGCG TCTCCCGGTC GAGCGCGCTC GCGTCGAAGG CGACCGGCTT CCCGATCGCG AAGATCGCCG CGCGGCTGGC GGTCGGCTAC ACGCTGCAGG AGATCGACAA CGACATCACG CGCGCCACGC CGGCGAGCTT CGAGCCGACG ATCGACTACT GCGTCGTGAA GTGGCCGCGC TTCGCGTTCG AGAAGTTCCC CGGCTCCGAC GCCGGGCTGA CGACGCACAT GAAGTCGGTC GGCGAGGCGA TGGCGATCGG CCGCACCTTC AAGCAGGCGT TCGCGAAGGC GCTGCGCTCG CGCGAGCTGG ACTCGCCCGG CGTCCCGCAC GACGACCTGG AGGAGCTGCT GCTCTCGCTG GAGCAGGGCG GACCGGACCG CTTCGACCTC GTGCTGGAGG CGTTCCGGCG CGGCGTTGAG GTCGAGACAC TGCACGCGCG CACGCAGATC GACCCGTGGT TCCTGCGCGA GCTGCAGGAG CTGGCGCTCG ATCCGGCGGC GGCCGAGGCC GGCGAGCGGA CGTTCAAGTC GGTCGACACC TGCGCGGCCG AGTTCGCTGC GCGCACGCCG TACTACTACT CCGCCCGCGA GCGGCCGCGC AGATCGGGCG TGGTCGAGAA CGAGGTCGTG CGCGGCGATC GCGCCAGCGT CGTGATCCTC GGCGCAGGCC CGAACCGGAT CGGCCAGGGG ATCGAGTTCG ACTACTGCTG CGTGCACGCC GCGATGACGG TGCGCGAGTC CGGCAAGGAC GCGGTGATGG TCAACTGCAA TCCCGAGACG GTCTCGACCG ACTACGACAC CTCCGACCGG CTCTACTTCG AGCCGCTGAC GCTGGAGGAC GTGCTCGGCG TCTGCGAGAT CGAGAAGCCC GAGGGCGTGA TCGTGCAGTT CGGCGGCCAG ACGCCGCTGC GGCTCGCGGC CGGCCTGGAG GCGGCGGGCG TGCCGATCCT CGGCACGAGC ATCGACGCGA TCGACCACGC GGAGGACCGC GGCCGCTTCG GCAGGCTGCT GGAGCAGCTC GGCTTCAGCG CGCCGCCGTA CGCGACGGCG CACTCGCCCG AGGAGGCGCT GGCGAAGGCG CCCGGCGTCG GCTTCCCTCT GCTCGTGCGG CCGAGCTACG TGCTCGGCGG CCGCGCGATG GAGATCGTCT ACTCGCTCGA CGGCCTGCAG GACTACCTGA CCCGTGTCGG CGCGGCGCAC GGCTCGGGCA AGGAGATCTT CCTCGACCGC TTCCTGGAGG ACTCGATCGA GGTCGACGTC GACGCGCTCT GCGACGGCAC CGACGTCTGG ATCGGCGGCA TCATGCAGCA CGTCGAGGAG GCCGGGATCC ACTCCGGCGA CTCCGCCTGC GTGCTGCCGC CGCACTCGCT CGGCCCCGAC GCGCTCGCGC AGATCCGCGC GCACACCGAG GGGATCGCGA AGGCGCTCGG CGTCGTCGGC CTGCTGAACG TCCAGTACGC GGTCGACAAG TCCGGTCAGC TGTACGTGAT CGAGGCGAAC CCGCGCGCCT CGCGCACGGT CCCGTTCGTC TCGAAGGCGA TCGGCCTCCC GCTCGCGAAG CTCGCCTGCC GCATCATGCT CGGCGAGAAG ATCGCTGACC TCGGCCTGCC GGAGGACCCG GTCGGCGACG TCGTCTGCGT CAAGGAGGCG GTGATGCCGT TCGATCGCTT CGCCGGCGCG GACTCGCTGC TCGGCCCCGA GATGCGCTCG ACCGGCGAGG TGATGGGGAT CGCCCACGAC TTCCCGACCG CGTTCGCGAA GGCGCAGGCG GCGGCCGGCT CGGTGCTGCC GTCCGAGGGC ACCGTCTTCA TCACCGTCAC CGACGGCGAC AAGCCGGCCG CGGCGGGCGT CGCGATGGCG CTGCACGGGC TCGGCTTCAG AATCGTCGCG ACCGCCGGGA CCGCGCAGGC GATCAAACGG ATGGGCATCC CCGTCGAGGC GCTGGAGAAG ATCGGCTCGG GCTCGCCGAA CGTGCTGGAG CTGATCGAGC GCGGCGAGGT CAAGCTCGTC GTCAACACGC CCGTCGGGAC CGGCGCGCGG ATCGACGGCT GGGAGATCCG CTCCGCCGCG ATCGCCGCCG GGATCCCCTG CATCACGACG ATGACGGGCG CGATGGCCGC CGCGCAGGCG ATCGCCGCAG GCGCGCGGGG CGTGCCGGCC GTGATCGCGC TGCAAGAGCT GCAGCGGGTC GGAGACGGCT CGCCCGCGGC CGGAGCGCCG GTCGCGTGA
|
Protein sequence | MPKRTDIRKI LIIGSGPIVI GQAAEFDYSG TQACKVLMEE GYEVVLVNSN PATIMTDPEI ATATYVEPLL PGPVAQVIER ERPDALLPTL GGQTALNLAK ALHEDGTLTR YDVELIGANY EAIDRAEDRD RFRETMETAR LRVPRSAIAT TLEEARGALQ DIGLPMIIRP AFTLGGRGGG IARTEAEFEA ICARGIEASP IDQILIDESV LGWGEFELEV MRDHADNVVI ICSIENLDPM GVHTGDSVCV APQQTLTDKQ YQKLRDQAIA VIRAVGVETG GSNVQFAVNP ETDEIIVIEM NPRVSRSSAL ASKATGFPIA KIAARLAVGY TLQEIDNDIT RATPASFEPT IDYCVVKWPR FAFEKFPGSD AGLTTHMKSV GEAMAIGRTF KQAFAKALRS RELDSPGVPH DDLEELLLSL EQGGPDRFDL VLEAFRRGVE VETLHARTQI DPWFLRELQE LALDPAAAEA GERTFKSVDT CAAEFAARTP YYYSARERPR RSGVVENEVV RGDRASVVIL GAGPNRIGQG IEFDYCCVHA AMTVRESGKD AVMVNCNPET VSTDYDTSDR LYFEPLTLED VLGVCEIEKP EGVIVQFGGQ TPLRLAAGLE AAGVPILGTS IDAIDHAEDR GRFGRLLEQL GFSAPPYATA HSPEEALAKA PGVGFPLLVR PSYVLGGRAM EIVYSLDGLQ DYLTRVGAAH GSGKEIFLDR FLEDSIEVDV DALCDGTDVW IGGIMQHVEE AGIHSGDSAC VLPPHSLGPD ALAQIRAHTE GIAKALGVVG LLNVQYAVDK SGQLYVIEAN PRASRTVPFV SKAIGLPLAK LACRIMLGEK IADLGLPEDP VGDVVCVKEA VMPFDRFAGA DSLLGPEMRS TGEVMGIAHD FPTAFAKAQA AAGSVLPSEG TVFITVTDGD KPAAAGVAMA LHGLGFRIVA TAGTAQAIKR MGIPVEALEK IGSGSPNVLE LIERGEVKLV VNTPVGTGAR IDGWEIRSAA IAAGIPCITT MTGAMAAAQA IAAGARGVPA VIALQELQRV GDGSPAAGAP VA
|
| |