Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4972 |
Symbol | |
ID | 8735438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5300677 |
End bp | 5301846 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 646505599 |
Product | Capsule synthesis protein, CapA |
Protein accession | YP_003396758 |
Protein GI | 284046418 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.162067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCCC AACCCCCTCC CCCGCCGCCG GCTCCCGCTC CGCCACCGCT CGTGCTGCGC GCACCGCGGC TGGTCGCCAC CGGCGTCGCG TTCGAGCTGC GCGGGCGCGG GGCGAGCGCC GGCGAGCGCG TGCGGGTGCA GCTGCGGCTC GACGGCCGCT GGCGCGAGCT GGCGGCGGCG CGCGCCGGCA GGGACGGCCG CGTCACGACG CGTGTCGCGC CGCGCACGCC GCGGCCCGCG TACCGCCTGC GGATCGCGAC GCGCGACGGG CGCCTCTCGG ACGGCGTCGT CGTGCGCACG CGCGACGTCA CGCTCGCGGC GGTCGGCGAC ATCAACCTCG GCGACGCGAC GGCCGGGGCG ATCGCGGCCG GCGGCGTCAA CTATCCGTGG ACGAGCGTCG CGCCGGCGCT GCGCGGCGCC GACGTCGCGT TCGGCAACCT CGAGTGCGCC ATCTCGACGC GCGGCGCGCC GGTGCAGAAG CAGTACACCT TCCGCGGCAG CCCGGCGGCA CTGCGGGCGA TGCGCGACTA CGCCGGCTTC GACGTGCTCA ACCTCGCCAA CAACCACGTC GGCGACTACG GCACCGCGGC GCTGCTCGAC ACCGTCGAGC ACGTCCGCGC CGGCGGTATG GAGGCGGTCG GCGCCGGCGG CTCGCTCGCC TCCGCCGCGG CGCCGCGCGT CGTCGAGCGG CTGGGGCTGC GGATCGCCTT CGTCGGCTTC TCGAACATCC TCCCGAGCGA GTTCTTCGCG ACGCCGTCAC GGGCCGGCAC GCAGCCGGCG ACGACGGCGC AGATCCGCGC CTCCGTCGCC GCCGCCAGAC GCCGCGCCGA CGTCGTGATC GCGACCTTCC ACTGGGGCGT CGAGCTGGAC CCGGTCGAGA ACGGCGCCGA GCAGGCGTTC GCCGCGACCG CGCTGGCGGC CGGCGCGACC GCCGTGATCG GCGGTCACCC GCACGTGCTG CAGCCGATCC GCATGCTCGA CGGCGGCCGC CGCCTCGTCG CGTACAGCCT CGGCAACTTC GTCTTCGCCT CCCACCGCGC GGCGACCGTC CGTACCGGCG TCCTGCACCT CGACCTGTCG GCCCGCGGCG TCGAGCGGAC GCGCTTCCAG CACGCCCGCA TCGACGGCGT CAAGCCGCTG CTGACGGGGC GCTGGACGCG CGTCGGCTGA
|
Protein sequence | MVAQPPPPPP APAPPPLVLR APRLVATGVA FELRGRGASA GERVRVQLRL DGRWRELAAA RAGRDGRVTT RVAPRTPRPA YRLRIATRDG RLSDGVVVRT RDVTLAAVGD INLGDATAGA IAAGGVNYPW TSVAPALRGA DVAFGNLECA ISTRGAPVQK QYTFRGSPAA LRAMRDYAGF DVLNLANNHV GDYGTAALLD TVEHVRAGGM EAVGAGGSLA SAAAPRVVER LGLRIAFVGF SNILPSEFFA TPSRAGTQPA TTAQIRASVA AARRRADVVI ATFHWGVELD PVENGAEQAF AATALAAGAT AVIGGHPHVL QPIRMLDGGR RLVAYSLGNF VFASHRAATV RTGVLHLDLS ARGVERTRFQ HARIDGVKPL LTGRWTRVG
|
| |