Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2205 |
Symbol | |
ID | 3786230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2504798 |
End bp | 2505943 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637812292 |
Product | hypothetical protein |
Protein accession | YP_412889 |
Protein GI | 82703323 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00288] conserved hypothetical protein TIGR00288 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACAC TTCAGGAAAA TGTCAGCATG GCGGTATTTT GTGACTTCGA AAATGTCGCT CTTGGCGTGC GTGATGCGAA ATACGAGAAG TTCGATATCA AACCGGTGCT GGAGCGATTG TTGCTCAAAG GCAGCATTGT CATAAAAAAG GCTTATTGCG ACTGGGATCG TTACAAGACC TTTAAAACTG CGCTACACGA AGCAAATTTC GAGCTGATCG AGATTCCGCA CATTCGTCAG TCCGGTAAAA ACTCAGCCGA CATCCGCCTG GTCGTGGATG CGCTCGATCT TTGTTACACG AAATCGCATG TGAACACGTT CGTCATTATC AGTGGCGATT CCGATTTCTC GCCCCTCGTC TCCAAGTTGC GCGAGAACGC TAAACAAGTA ATCGGTGTTG GAGTCAAGAA ATCAACTTCA GATCTGCTGA TAGCCAACTG CGATGAATTC ATTTTTTATG ACGATCTCGT GCGGGAAATC CAGCGTACTG CCGCGAAACA CGATACGGCA GAAGCACCGC CTGTGGTAAA ACAGGCGCCT GCAGAGGAAA AACGCCCAGG AGAGGAACTC GAAGCACGCA AGGGCCAGGC AATCAACATG GCGGTAGAGA CTTTCGACGC CCTGGTATCC GAGCGGGGCG ACAGCGGAAA AATCTGGGCA TCGACACTCA AGCAAGCGAT CAAGCGCCGC AAACCTGGTT TTAACGAGTC TTATTATGGC TTTCGTGCCT TCGGTAATTT GCTTGAGGAG GCGCAAGCCC GAGGCCTGCT TGAACTGGGC CGTGACGAGA AATCGGGCAC GTATGTTTAT CGCAGCAGCA GCGCGGCCGA TAGCGGTACA GCGGTGGAAG CGTCGGCGCC TGCTCCTGCT TCGCTGGAGA TTGCGGCCCC CTCAGCGCCG CTCGACGGGC TCATCTCCCT GGATGAAACA CATCAGCGGA GAAAGAGTCG TAAGGCCGGG GAAAAGAAAG CAGATTTGCT GCCAGCAACT CCTGTGCCCG GGGCAGAAAC TGCACAGACT GCCGGCAATC CGGCAGATCA AACAGTGCCG GAAAAACCAG GAACCAAGGG AAGGAAAAAG CCAGCCACAC GCCCTCCGCG CAAACCGGAA GCCGGGACAG ACAAGATCGA CAAGAAGCCT GGTTAG
|
Protein sequence | MATLQENVSM AVFCDFENVA LGVRDAKYEK FDIKPVLERL LLKGSIVIKK AYCDWDRYKT FKTALHEANF ELIEIPHIRQ SGKNSADIRL VVDALDLCYT KSHVNTFVII SGDSDFSPLV SKLRENAKQV IGVGVKKSTS DLLIANCDEF IFYDDLVREI QRTAAKHDTA EAPPVVKQAP AEEKRPGEEL EARKGQAINM AVETFDALVS ERGDSGKIWA STLKQAIKRR KPGFNESYYG FRAFGNLLEE AQARGLLELG RDEKSGTYVY RSSSAADSGT AVEASAPAPA SLEIAAPSAP LDGLISLDET HQRRKSRKAG EKKADLLPAT PVPGAETAQT AGNPADQTVP EKPGTKGRKK PATRPPRKPE AGTDKIDKKP G
|
| |