Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5538 |
Symbol | |
ID | 8736013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5931736 |
End bp | 5933526 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646506168 |
Product | Carbamoyl-phosphate synthase L chain ATP- binding protein |
Protein accession | YP_003397318 |
Protein GI | 284046978 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0551401 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCACGA AGATCCTGAT CGCCAACCGC GGTGAGATCG CGGTTCGCGT GATCCGCGCC TGCGAGGAGA TGGGCATCGC GTCCGTCGCC GTCTATTCGG AGCTCGATCG CGACGCGCTC CACGTCCGCC GCGCCGACGA GGCGTACCTG ATCGGGCCCG GCCCGGCGGC CGAGTCCTAC CTGAGAGTCG ACAAGATCCT CGAGGTCGCC AAGAGATCCG GCGCCGAGGC GATTCACCCC GGCTACGGCT TCCTGGCGGA GAACGCGGCG TTCGCCGCGG CCTGCGAGGA AGCCGGCATC ACGTTCATCG GCCCGCCCGC CAGCGCGATC GACGCGATGG GCTCGAAGAC CGCCGCGCGC GACCTGATGA AGAAGGCCGG CGTCCCGATC GTGCCCGGCA CGACCGAGCC GGTGGCGGAC GTGAAATCAG CGCGCAGAAT CATCGAGAGA ACGATCGGCT TCCCGGTCGC GGTGAAGGCG GCGGGCGGCG GCGGGGGCAA GGGCTTCCGC GTCGCGCTGA CCGACGACGA GCTGGAGGCC GCCTTCGAGG GCGCCGCGCG CGAAGGCGAG AAGTTCTTCT CCGATGCGAC CGTCTACCTC GAGCGCTATC TGCCCGACCC GCGCCACGTC GAGGTGCAGG TGCTGGCCGA CCGTCACGGC ACCGTGATCC ACCTCGGCGA GCGCGACTGC TCGGTCCAGC GCCGCCACCA GAAGCTGATC GAGGAGTCTC CCGCCCCGGC CGTGGACGAG GAACTCCGCC AGAAGATCGG CAAGATCGCG ACCGACGCGG CCGCCGCCGT CCACTACGTC GGTGCCGGCA CGATCGAGGG CCTGCTGCAG GACGGGGAGT ACTACTTCCT CGAGATGAAC ACGCGCGTCC AGGTCGAGCA CTGCGTGACC GAGATGACGA CGGGCGTCGA CATCGTCAAG GAGGGCATCC GCGCCGCCGC CGGCGAGCCG CTGTCGATCG CGCAGGAGGA CGTGCAGCTG CGCGGCCACG CGATCGAGTG CCGCATCAAC GCCGAGGACG CGTCGAAGAA CTTCGCGCCC GCGCCGGGCA GAATCGGCGC CTACCGCGAG CCGTCGGGAC CGGGCGTGCG CGTCGACTCG GGCGTCGGCC CGGGCGGCGA GGTCTCGCCG ATGTACGACC CGATGGTGGC GAAGCTGATC GTCTGGGACG TCGACCGCGA GTCGGCGACG AGACGGATGC TGCGCGCGCT GTCGGAGTAC GAGATCACCG AGCTGAAGAC GCTGATCCCG TTCCACACGG CGCTGCTCGC GACGAGACAG TGGGGCAACG CGGAGACGTG CCGCGACCTC GTCGAGGACC GCAAGTGGCT CAGAGAGCTG GCGTTCCCGC CGCCGACGCC GAGCGACGAC GAGGACGACC CGAAGGTCGA GCAGACCTAC ACGGTCGAGG TCTCCGGCCG CCGCTTCGAC GTCAGAGTGA TCGGCGCGCC GTTCGCGGGC GGCGGCGCAG GGTCGCTGAA CGGCAGCGGC CCGGCGGGCG CCGCGAAGAA GCCGCGCCGC GAGCGCAAGA GCGGCGGTGG CGGCGGTGGC GCGGACACGC TCCCCTCACC GATGCAGGGC AACATGTGGA GAGTCAAGGT GAAGCAGGGC GACACGGTCG AGGAGGGCCA GCTGCTCTGC ATCATCGAGG CGATGAAGAT GGAGAACGAG ATCACCGCCC ACAAGGCCGG CGTGATCGCC GAGATCCCCA TCACCGAGGG CGCCGCGATC GGCGCGGGCG ACACGATCGC GGTCATCAGA TCGCCGCCCG CGGCGGAGTA G
|
Protein sequence | MFTKILIANR GEIAVRVIRA CEEMGIASVA VYSELDRDAL HVRRADEAYL IGPGPAAESY LRVDKILEVA KRSGAEAIHP GYGFLAENAA FAAACEEAGI TFIGPPASAI DAMGSKTAAR DLMKKAGVPI VPGTTEPVAD VKSARRIIER TIGFPVAVKA AGGGGGKGFR VALTDDELEA AFEGAAREGE KFFSDATVYL ERYLPDPRHV EVQVLADRHG TVIHLGERDC SVQRRHQKLI EESPAPAVDE ELRQKIGKIA TDAAAAVHYV GAGTIEGLLQ DGEYYFLEMN TRVQVEHCVT EMTTGVDIVK EGIRAAAGEP LSIAQEDVQL RGHAIECRIN AEDASKNFAP APGRIGAYRE PSGPGVRVDS GVGPGGEVSP MYDPMVAKLI VWDVDRESAT RRMLRALSEY EITELKTLIP FHTALLATRQ WGNAETCRDL VEDRKWLREL AFPPPTPSDD EDDPKVEQTY TVEVSGRRFD VRVIGAPFAG GGAGSLNGSG PAGAAKKPRR ERKSGGGGGG ADTLPSPMQG NMWRVKVKQG DTVEEGQLLC IIEAMKMENE ITAHKAGVIA EIPITEGAAI GAGDTIAVIR SPPAAE
|
| |