Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0617 |
Symbol | |
ID | 3784413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 698972 |
End bp | 700189 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637810699 |
Product | poly-gamma-glutamate synthesis protein (capsule biosynthesis protein) |
Protein accession | YP_411316 |
Protein GI | 82701750 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTAG GTCCGCTCTA CGCCCAAGCC GTGCGGGACG GTGTCCAGAT TTCTGTAAAG AATGGGGATC AGACCGCTCC TGGTCCTCGA TCGCCGGGCG GACATACTTC GATCAAGTTA TTTCTCTGTG GCGATGTCAT GACGGGCAGA GGCATAGACC AGGTCTTGCC TCACCCCGGC AATCCCATTC TTTTTGAGGG GTACATGAAA AGCGCAACAG GCTATGTTGA GCTTGCCGAA GAGGCAAACG GGCCGATTCC GCACCCAGTC CCTTTTTCCT ATATCTGGGG AGATGCGCTT GCCGAACTGG AGCGTAGAAA GCCCGATGTT CGCGTCATTA ACCTCGAAAC CGCGCTTACC CGTAGTGACG AAGTGCAGGA CAAGGCAGTG AACTACCGCA TGAATCCGGA TAATATCCCC TGTATCACTG CCGCGAAGAT CGATTGCTGC GTGCTGGCGA ACAACCATGT TCTGGACTGG GGGTATGAAG GTCTCGCCGA GACGTTGAAA ACGCTCAAAC GAGCCGACAT AAAAATAGCT GGCGCCGGCT TAAACGTTCA AGAGGCGCGG CAACCCGCGG AGATGACAGT TCCCGGAAAG GGACGTGTAC TGGTTTTCTC GCTCGGATCG GAAACAAGCG GCATACCCTG GAACTGGGCG GCTCGAACAG ACCGGGCAGG CGTGAATCTG TTGCCGGATT TTTCAGCAAA AACGGTTCGG GAAATTCGCG ACAGAATAAA GCAGGTCCGC CTGCCCGGTG ACATTGTGGT TGCTTCAATC CATTGGGGTA ATAACTGGGG TTATGCAATT CCAGTCGAGC AACAGGATTT TGCACACGGC CTGATCGATG AAGCGGGTGT TGACGTCATC CACGGCCATT CATCGCATCA TGTGAAGGGC ATCGAGGTTT ACAGGGGAAA GCTCATTCTT TATGGATGCG GCGACTTCCT GAATGATTAT GAAGGCATCT CAGGGCACGA GACTTATCGG GGTGACCTGA CGTTGATGTA TTTCGTGAGC GCGGAGCCGC AGACCGGCAA ACTCGTCAGC CTGTCGATGG TGCTCATGCA GGTCAGGCAT TTCAAATTGA ATCGGGCATC TGATGTCGAT GCTTCCTGGC TGAAGAATAT CCTGAACAGG GAGGGGAAGA AGCTGGGGAC GTCGGTGGAA CTGACGGCGG ATAATACCTT GATGCTCCGA TGGATGCTCC AAGGATAG
|
Protein sequence | MTVGPLYAQA VRDGVQISVK NGDQTAPGPR SPGGHTSIKL FLCGDVMTGR GIDQVLPHPG NPILFEGYMK SATGYVELAE EANGPIPHPV PFSYIWGDAL AELERRKPDV RVINLETALT RSDEVQDKAV NYRMNPDNIP CITAAKIDCC VLANNHVLDW GYEGLAETLK TLKRADIKIA GAGLNVQEAR QPAEMTVPGK GRVLVFSLGS ETSGIPWNWA ARTDRAGVNL LPDFSAKTVR EIRDRIKQVR LPGDIVVASI HWGNNWGYAI PVEQQDFAHG LIDEAGVDVI HGHSSHHVKG IEVYRGKLIL YGCGDFLNDY EGISGHETYR GDLTLMYFVS AEPQTGKLVS LSMVLMQVRH FKLNRASDVD ASWLKNILNR EGKKLGTSVE LTADNTLMLR WMLQG
|
| |