Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2652 |
Symbol | |
ID | 3785263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 3041957 |
End bp | 3042937 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637812741 |
Product | hypothetical protein |
Protein accession | YP_413331 |
Protein GI | 82703765 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCGGCA CTTCGACAAC AGCCCAAGGC GAGACTCATG CCTTTCTAAC CGGGCCCAAC GGCATGGGAA TGAATAGAGT GGGCATAACC ACTGACCCCT TGTTAAGCTC CTTTAGTGGG GCTGTCGGAA TCAACAATAC TGGACAGGTT ATCGGAAATT ATCCGACTAC GTCCGATCCA GAGGGCTTTT TTACCTGGAA CCGTTCTTTC ATTACTGGCC CCAATGGCGT AGGTACAACC GAACTCGGAT TTGAAGCCAC CGGAATTAAC GATAGGGGGA CAGTAGTGGG GTGGGATTCC GCTTTCCCTA CCGAGTTTCT CTCCTCCGTT TATCCGGCTG TAGTATTCAG GGCCGGGAGA GAATATCAGG TCAGTGGTTT GCTTGGACCT TACGACAATT ACGATCAGTT TCTGGCTATT AATAACTTAG AACAAATTGT GGGAAAGGCA TCCGGAGTAC ATGCTGTTCT CGGCTTCTCC GATACCCCCT CGGGATTCTG GACTGACATA GGTGCCTTGA CTGGAAATTA CTATAGCGAG GCTGTTGGGA TTAACGATGC AGGACAGGTA ATAGGTTCTT ACCAGGCTAA CGGTGTCTCC CATGCTTTCA TAACCAATGC GAGTGCCACG GAACTGACCG ATCTCGGCGC ACTAGGAGGC TTTGGCAGCG AGGCTCTTGG AATTAACGAT ATCGGACAGG TAGTTGGATG GGCCAATACG TCGGATGGAG ATCGGCATGC GTTTTTTACC GGTCCTAACG GAGAAGGCAT GATAGACCTC AATTCATTGG TTCACCTGTC CGAGGGGGGC ATCCTCACCG CCGCTATGGG TATAAATAAT GAGGGACAAG TTATTGTTCT GGCTATCCCG GAATCGGAAA TCTACGCTCT GATGCTTGCT GGCCTAGGCT TGATCGGTTT CATGGTACGA CGCAAGAAAG AAGAAAATCT ACTGAGAAGG CAAAGGACAC ACGTCGTGTA G
|
Protein sequence | MVGTSTTAQG ETHAFLTGPN GMGMNRVGIT TDPLLSSFSG AVGINNTGQV IGNYPTTSDP EGFFTWNRSF ITGPNGVGTT ELGFEATGIN DRGTVVGWDS AFPTEFLSSV YPAVVFRAGR EYQVSGLLGP YDNYDQFLAI NNLEQIVGKA SGVHAVLGFS DTPSGFWTDI GALTGNYYSE AVGINDAGQV IGSYQANGVS HAFITNASAT ELTDLGALGG FGSEALGIND IGQVVGWANT SDGDRHAFFT GPNGEGMIDL NSLVHLSEGG ILTAAMGINN EGQVIVLAIP ESEIYALMLA GLGLIGFMVR RKKEENLLRR QRTHVV
|
| |