Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1129 |
Symbol | |
ID | 3784242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1298315 |
End bp | 1299343 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811214 |
Product | biotin synthase |
Protein accession | YP_411824 |
Protein GI | 82702258 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAT CAGCTTCCGG TATCGTCAAC ATTGAAACGT CTTCGGGCGG AGCAGTCGAC AGTTTCGCTG ACAAACTTTC CGGCATCAGT GCTGATTTTC GGCGGTGGGA CGTTCGCCGC GTCGAAGCGT TGCTCGATCT GCCGTTCAGC GATCTGATTC ATCGGGCGCA ACTGGTTCAT CGTCAATATC ACGATCCCAA CGGAGTGCAA CTTTCCACAC TGATTTCCGT CAAGACCGGG GGGTGTCCTG AGGATTGCGG ATATTGTCCG CAGGCGGCGC GCTATCATAC GAATGTGGAA AATCAGGACA TGCTCACGGT TGAAGACGTG ATAGCAGCAG CGGCAGCCGC CAAAGCGCAG GGGGCCACGC GGTTTTGCAT GGGCGCAGCG TGGCGCGGTC CCAAGGAGCG GGATATGAAA AGCGTGGTGG AAATGGTGAG GGCTGTAAAA GCGTTGGGGC TTGAAGCGTG CACGACGTTG GGCATGTTGA AGCCGGGACA GGCGGAACAG CTCCAAGAGG CGGGTCTCGA CTATTACAAC CATAACCTCG ATACGTCACC CGAATTTTAC GGGGAGATCA TTACAACCCG CGATTACGAG GATCGGCTCG ACACCTTGCA AAAGGTTCGT CATGCGGGTA TCAATGTGTG CTGTGGAGGC ATTGTGGGCA TGGGCGAATC GCGTCGCGCC CGGGCTGGAC TCATCGCCCA GTTGGCTAAT CTCGATCCTT ATCCCGAGTC CGTTCCCATC AACCATCTGG TGCAGGTCGA AGGCACGCCG CTTCACGGGA CAGAGGCGCT GGATCCTCTG GAATTCGTGC GCACGATTGC GGCTGCGCGC ATTACCATGC CGAAGGCAAT GGTAAGGTTA TCAGCCGGCA GGAGAGAAAT GCCGGATGCA GTGCAGGCAC TGTGTTTTCT CGCTGGCGCC AATTCCATTT TTTACGGCGA TAAGCTGCTT ACTACCGGCA ATCCTGAAGC CGAACGGGAT AAAGCGTTAT TTGACAGGCT GGGGCTGCAT TCCATATAG
|
Protein sequence | MSQSASGIVN IETSSGGAVD SFADKLSGIS ADFRRWDVRR VEALLDLPFS DLIHRAQLVH RQYHDPNGVQ LSTLISVKTG GCPEDCGYCP QAARYHTNVE NQDMLTVEDV IAAAAAAKAQ GATRFCMGAA WRGPKERDMK SVVEMVRAVK ALGLEACTTL GMLKPGQAEQ LQEAGLDYYN HNLDTSPEFY GEIITTRDYE DRLDTLQKVR HAGINVCCGG IVGMGESRRA RAGLIAQLAN LDPYPESVPI NHLVQVEGTP LHGTEALDPL EFVRTIAAAR ITMPKAMVRL SAGRREMPDA VQALCFLAGA NSIFYGDKLL TTGNPEAERD KALFDRLGLH SI
|
| |