Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0291 |
Symbol | |
ID | 3785537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 312280 |
End bp | 313641 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810367 |
Product | capsular polysaccharide biosynthesis protein CapK |
Protein accession | YP_410991 |
Protein GI | 82701425 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1541] Coenzyme F390 synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.326701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA GGGACTGGTA TACATCGCTT GTATCGGGTC TTCTCTTTCC GCTGCAAGAG CGCCTCAAGG ATCACTCCAC GGTATCGGTA CGCAAGGCGC TGGAGCTGTC TCAATGGTGG AATCGGGAGC GCCTGGAGAA CTTGCAGCTG TTGAAGTTGC GCCATCTGCT GGCTGAAGCA GAAGCGCACG TACCTTATTA CCGCCGAATA TTTGCGGAGA TTGGCTTCAA GGCAGCCGAG GTGTCCAGCC TTGCCGATCT GGCGCGCTTG CCGCTTCTCG ATAAACCTGC GATCCGCGCC GATACGGAGG CATTGAAGTC CCAAAAAGCC AGAAGCCTGC GCTCTTTCAA CACTGGAGGG TCGAGCGGAG AACCGCTCAC CTTTTATATC GGCAGGGAGC GCGTCAGCCA TGATGTGGCG GCAAAGTGGC GTGCCACACG CTGGTGGGAC GTGGATATCG GCGATCCTGA GATGGTGGTC TGGGGCTCTC CCATCGAACT CGGGGCGCAG GATCGTCTTC GCATGCTTCG TGATCGGCTG CTCAGAACAA GGTTGTTTCC GGCGTTCGAA ATGTCGGAGC AAAAGCTCGA TCGCTTCCTG GGCGAACTGC GCGCCGCGCC CCCCAGAATG TTTTTCGGCT ACCCTTCGGC CTTGTCCCAT ATTGCCCGCC ATGCGCAGGC AAGAGGACAG CGAATGGACG ACCTGGGTAT CAACGTGGCA TTCGTTACTT CGGAACGACT TTACGATGAA CAGCGACAGC AGATCAGTAA AACCTTTGGA TGTCCTGTTG CCAATGGCTA TGGAGGACGC GATGCGGGTT TCATCGCCCA TGAATGCCCG GAGGGTGGCA TGCACATAAC GGCAGAGGAT ATTATCGTGG AGATCGTGGA TCGACAGGGG GTCCCGCTAC CCTGTGGCGA AGCAGGAGAA ATCATAGTCA CTCATCTCTC TACCGCAGAA TTTCCGTTCA TTCGCTATCG CACGGGGGAT ATAGGAATAC TGGATGATCG AATCTGTCGC TGCGGGCGAG GCCTTCCCCT TCTCCGCGAA ATTCAGGGTC GCAGCACGGA TTTTGTCGTC GCCCAGGATG GGACGGTCAT GCATGGTCTG GCCTTGATAT ACATCCTGCG AGAATTGCCG CAGATCAGTC ATTTCAAAAT CATCCAGGAG AGTCTGAACC TTATCCATAT ATGGGTGGTT TCCGGGGCAA AGCTCGATCG GGAGATCACT GCAAAAATCG AGGAAGAATT CAAGGCGCGG CTTGGGCAAT CCGTTGAGGT TCTGATCGAG GAGACAACTG AAATTCCAGC AGAAAAATCC GGCAAGTTTC GTTATGTGAT AAGTAAAATC GCCGGGGCTT GA
|
Protein sequence | MKNRDWYTSL VSGLLFPLQE RLKDHSTVSV RKALELSQWW NRERLENLQL LKLRHLLAEA EAHVPYYRRI FAEIGFKAAE VSSLADLARL PLLDKPAIRA DTEALKSQKA RSLRSFNTGG SSGEPLTFYI GRERVSHDVA AKWRATRWWD VDIGDPEMVV WGSPIELGAQ DRLRMLRDRL LRTRLFPAFE MSEQKLDRFL GELRAAPPRM FFGYPSALSH IARHAQARGQ RMDDLGINVA FVTSERLYDE QRQQISKTFG CPVANGYGGR DAGFIAHECP EGGMHITAED IIVEIVDRQG VPLPCGEAGE IIVTHLSTAE FPFIRYRTGD IGILDDRICR CGRGLPLLRE IQGRSTDFVV AQDGTVMHGL ALIYILRELP QISHFKIIQE SLNLIHIWVV SGAKLDREIT AKIEEEFKAR LGQSVEVLIE ETTEIPAEKS GKFRYVISKI AGA
|
| |