Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0867 |
Symbol | |
ID | 3784437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 986247 |
End bp | 987602 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810949 |
Product | hypothetical protein |
Protein accession | YP_411562 |
Protein GI | 82701996 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.886139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTGA ACCCATCATT GATCCCTCAT TCGTCAGAGC AGCACGTTGA CTGGGAGAAA GCGCTGCATG GCGGCAACGA TGCGCTGCTG ACGCAACTTG CAGCCGATGA TGATCCCGCT GCTGCTCTTC TGGAACTTGA AAGCTACATC AATGGGATCT ACCGCGGCAG CCCGTCTCCC TTATCCAAAC CGCTGCCGGA TATCCGCTTG GAGGTGCGAC GGGCGCTACC CTATGTGGAT GAAATTCGGG ATCGCCTGGG ATGGCAGGTG CGGGATTTCG ATCTTGGGTT GCTGCGCTTG ATTCGTGGCT CGAGCACGTT ATCGCCCGTG CTGGGGGCAG GGGTCTCGAT GGATGCAGGG GCGCCCTCCT GGCCCGAACT GGTGCGTTTG ATGCTGGAGG AAACGCTCGA CAAGGGTCTG GAGTTCTACG AGTCCGTTCC CGCCGCTGAC AATCCGGCTC AGCCGCCTAT CGAGTTTCTT CCCGACGGCA CGGTGCGCAC GGGCGGAACC GGAACCTGGC GTTTCGAGCA ACGCGTCAGC GAAGTGAAGC GCTATACGGC TGAGCAGGAA CAAACGGCGA GGAACGTCCT TGCAGAGGTC AAGGCCAAAG GTTCTTCAAC CGATGTCGAG ACGCTCATGC ACGGGGCGCA GGTCTGCTAC GATCTTTGCG GCCAGCATCT TTTTCGTCTG CTCACGAAAA TCATCTATAC ACGTGCGAAA GAGCCAAGCG AAGTCCATCG GGTTATTGCC GAACTGGCGC AGGCGCAAGA GGTACCCGTG CGCGGCCCTG GTTTGTTTCC CGGATGGGAT TCAATTATCA CCTACAATAT CGATGCACTA ATGTCGGAAG CGCTCGCGGA GCAGAAAATA CCGCACGCTG CCTGGGCGAT GAAGGGTGAC AAGCTGCGGG GCAATCCGGA TGAACTCGCT CAGAAGAGTT CATGGCATGA ACCTATCTTT CATCTTCACG GTTTTTCGCC GCGGCGGCTG TTCATGATCA CGAATGTGCG CTTTGTATTT TCCACTTCTC AGTACCTCAC GACGTACAAA GGGCCGCGAT CAAGGATATT GGAAGCGGTC TACGACGAAT TTCTGGCGAA TCCTGTGCGC ATTGCGCTTT ATATAGGGTG TTCGTTTGCC GATGAAGCGA TGAACGGCCT CCTGCGTGAG GCCTTCGCGG AATACCCGGG GCGCTATCAC TATGCATTGC TGAAATGGCC GCGAGACAGG AAAGGAAAGG AGCCGGACAG GAGCGAGATC GCCGCCGAGT CCGCAAAATA TCTCGAGTTT GGGGTGCGGC CGATTTGGTT CGATGATTTC GCCGAATTAC CCGGGTTGAT CCGGCAGCTG CAATGA
|
Protein sequence | MQLNPSLIPH SSEQHVDWEK ALHGGNDALL TQLAADDDPA AALLELESYI NGIYRGSPSP LSKPLPDIRL EVRRALPYVD EIRDRLGWQV RDFDLGLLRL IRGSSTLSPV LGAGVSMDAG APSWPELVRL MLEETLDKGL EFYESVPAAD NPAQPPIEFL PDGTVRTGGT GTWRFEQRVS EVKRYTAEQE QTARNVLAEV KAKGSSTDVE TLMHGAQVCY DLCGQHLFRL LTKIIYTRAK EPSEVHRVIA ELAQAQEVPV RGPGLFPGWD SIITYNIDAL MSEALAEQKI PHAAWAMKGD KLRGNPDELA QKSSWHEPIF HLHGFSPRRL FMITNVRFVF STSQYLTTYK GPRSRILEAV YDEFLANPVR IALYIGCSFA DEAMNGLLRE AFAEYPGRYH YALLKWPRDR KGKEPDRSEI AAESAKYLEF GVRPIWFDDF AELPGLIRQL Q
|
| |