Gene Cwoe_5538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5538 
Symbol 
ID8736013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5931736 
End bp5933526 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content70% 
IMG OID646506168 
ProductCarbamoyl-phosphate synthase L chain ATP- binding protein 
Protein accessionYP_003397318 
Protein GI284046978 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0551401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCACGA AGATCCTGAT CGCCAACCGC GGTGAGATCG CGGTTCGCGT GATCCGCGCC 
TGCGAGGAGA TGGGCATCGC GTCCGTCGCC GTCTATTCGG AGCTCGATCG CGACGCGCTC
CACGTCCGCC GCGCCGACGA GGCGTACCTG ATCGGGCCCG GCCCGGCGGC CGAGTCCTAC
CTGAGAGTCG ACAAGATCCT CGAGGTCGCC AAGAGATCCG GCGCCGAGGC GATTCACCCC
GGCTACGGCT TCCTGGCGGA GAACGCGGCG TTCGCCGCGG CCTGCGAGGA AGCCGGCATC
ACGTTCATCG GCCCGCCCGC CAGCGCGATC GACGCGATGG GCTCGAAGAC CGCCGCGCGC
GACCTGATGA AGAAGGCCGG CGTCCCGATC GTGCCCGGCA CGACCGAGCC GGTGGCGGAC
GTGAAATCAG CGCGCAGAAT CATCGAGAGA ACGATCGGCT TCCCGGTCGC GGTGAAGGCG
GCGGGCGGCG GCGGGGGCAA GGGCTTCCGC GTCGCGCTGA CCGACGACGA GCTGGAGGCC
GCCTTCGAGG GCGCCGCGCG CGAAGGCGAG AAGTTCTTCT CCGATGCGAC CGTCTACCTC
GAGCGCTATC TGCCCGACCC GCGCCACGTC GAGGTGCAGG TGCTGGCCGA CCGTCACGGC
ACCGTGATCC ACCTCGGCGA GCGCGACTGC TCGGTCCAGC GCCGCCACCA GAAGCTGATC
GAGGAGTCTC CCGCCCCGGC CGTGGACGAG GAACTCCGCC AGAAGATCGG CAAGATCGCG
ACCGACGCGG CCGCCGCCGT CCACTACGTC GGTGCCGGCA CGATCGAGGG CCTGCTGCAG
GACGGGGAGT ACTACTTCCT CGAGATGAAC ACGCGCGTCC AGGTCGAGCA CTGCGTGACC
GAGATGACGA CGGGCGTCGA CATCGTCAAG GAGGGCATCC GCGCCGCCGC CGGCGAGCCG
CTGTCGATCG CGCAGGAGGA CGTGCAGCTG CGCGGCCACG CGATCGAGTG CCGCATCAAC
GCCGAGGACG CGTCGAAGAA CTTCGCGCCC GCGCCGGGCA GAATCGGCGC CTACCGCGAG
CCGTCGGGAC CGGGCGTGCG CGTCGACTCG GGCGTCGGCC CGGGCGGCGA GGTCTCGCCG
ATGTACGACC CGATGGTGGC GAAGCTGATC GTCTGGGACG TCGACCGCGA GTCGGCGACG
AGACGGATGC TGCGCGCGCT GTCGGAGTAC GAGATCACCG AGCTGAAGAC GCTGATCCCG
TTCCACACGG CGCTGCTCGC GACGAGACAG TGGGGCAACG CGGAGACGTG CCGCGACCTC
GTCGAGGACC GCAAGTGGCT CAGAGAGCTG GCGTTCCCGC CGCCGACGCC GAGCGACGAC
GAGGACGACC CGAAGGTCGA GCAGACCTAC ACGGTCGAGG TCTCCGGCCG CCGCTTCGAC
GTCAGAGTGA TCGGCGCGCC GTTCGCGGGC GGCGGCGCAG GGTCGCTGAA CGGCAGCGGC
CCGGCGGGCG CCGCGAAGAA GCCGCGCCGC GAGCGCAAGA GCGGCGGTGG CGGCGGTGGC
GCGGACACGC TCCCCTCACC GATGCAGGGC AACATGTGGA GAGTCAAGGT GAAGCAGGGC
GACACGGTCG AGGAGGGCCA GCTGCTCTGC ATCATCGAGG CGATGAAGAT GGAGAACGAG
ATCACCGCCC ACAAGGCCGG CGTGATCGCC GAGATCCCCA TCACCGAGGG CGCCGCGATC
GGCGCGGGCG ACACGATCGC GGTCATCAGA TCGCCGCCCG CGGCGGAGTA G
 
Protein sequence
MFTKILIANR GEIAVRVIRA CEEMGIASVA VYSELDRDAL HVRRADEAYL IGPGPAAESY 
LRVDKILEVA KRSGAEAIHP GYGFLAENAA FAAACEEAGI TFIGPPASAI DAMGSKTAAR
DLMKKAGVPI VPGTTEPVAD VKSARRIIER TIGFPVAVKA AGGGGGKGFR VALTDDELEA
AFEGAAREGE KFFSDATVYL ERYLPDPRHV EVQVLADRHG TVIHLGERDC SVQRRHQKLI
EESPAPAVDE ELRQKIGKIA TDAAAAVHYV GAGTIEGLLQ DGEYYFLEMN TRVQVEHCVT
EMTTGVDIVK EGIRAAAGEP LSIAQEDVQL RGHAIECRIN AEDASKNFAP APGRIGAYRE
PSGPGVRVDS GVGPGGEVSP MYDPMVAKLI VWDVDRESAT RRMLRALSEY EITELKTLIP
FHTALLATRQ WGNAETCRDL VEDRKWLREL AFPPPTPSDD EDDPKVEQTY TVEVSGRRFD
VRVIGAPFAG GGAGSLNGSG PAGAAKKPRR ERKSGGGGGG ADTLPSPMQG NMWRVKVKQG
DTVEEGQLLC IIEAMKMENE ITAHKAGVIA EIPITEGAAI GAGDTIAVIR SPPAAE