Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1392 |
Symbol | |
ID | 3786422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1583745 |
End bp | 1585163 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637811480 |
Product | hypothetical protein |
Protein accession | YP_412087 |
Protein GI | 82702521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000235247 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCCA CCATCCAGTT CAGCCAGCCA GACAAGAAAT TCGACATTTT ACAAAAGCTG TTCAGCTTCG TTAAAGGCTT CAAAAATCTG CGCCAGCACA TCCTGGAGCA GGGCATATTG CTGGAAAGGT TGAATTCTGG CGAAATAGAG AATGTGCAGC GAGCGCTTGC AGGTATAAAT TACCTGGAGG GCAGGGTTAT AGACAACTCG GTGCGTATTT TCGTCACCGA CGGGGAACTG CGCGCGCTGT TCGATCTGAT GATGCCCGTT TCCCGCAAAC AGAATGATTT TTCCCGGATC TTGTGGGAAC GTGGTTTTAC GATTGAAGAG CTGTCGCAGG ATCAGGCGGA AAACCTGAGG AATCAGTTTT CGGCCATCGC CACCGTGACC ATCGGGCCGG ATGTTCCACG GACCAGGATT TATACTGTTG GCGGGCAGAT TTTTCAGGAA GATGGAGTGC CGCTGTGCGC AAGGGGCTTT ACCGTTTGCG CATTCGATGC GCTCTCGGTC AATACATTTG TGCGGTGCGG CGCTATGGGC GCGGTTCAGG AGGATGGATT CTACCGTATC GACTATGCCT GGCGCTCGAA TGGACGAATA GGCCCTGATC TGTTTGTGCG GGTATTCGAT CCGGAGGGCG ACATTGTAGC TGAAGCAAGG AAGAATCCGG CTGCGGTTCA GGAATTTCTC GATATTACAG TTAAAACGCT TTGCATCGTT CGGGGCACAA TTCGCCAAGT GGATGACTTC CCGCTTCCTC ATCTCCTCGT CCGTGCATTT GATCGGGACA TGCGAGCGGA GACATTGCTG GGTCAGGCAA TGACGGATGC GGAAGGAAGT TATCAGATTA CATATGGCAC AAACAAGCTC CGGATGAAGG ACAAAGCGGA TCTGATCGTG CGCGTTTTCG AACCGGCCGA TAGCGAAGGC AAGGAAACAG GAGACGAAAT CGGAGCTTCA GAAATCATAT TCAATGCTCC GCTACAGCAA GCGGTCGATC TGGAGGTCAA ATCGGGAAAA TTCCGGGGAC CGTCCGAGTA TGAGCGATAT ATTACGGCCC TGAAGCTGCT CATTGAGGGA GAGCCTGTTC ACCAATTGAC CGATAAGGAT TTAAGTTTCC TTGGAGGTAA GACAGGCATT CCGCTGGAAC ACCTGAATTA TCTTCGGCTG GATGATCAAT GGTGTTTTCA TTACAGCATG GAACCGGCTG TGGTCTATAG TCTATTACGT CAGGGACTTC CTGCCGACCT CCACCACCTG TCGACTGAAA AACCAACCCG CCTGCATGAG GCGCTGCAGG CCTCCCTGGC GCACAACATC GCCCCTGCAG TACTTGCCGA TAAGGTTGAT CAGGCCATAA AGCCACTTCT CTCCCTTGCT GATTCGATGG TCTTTGAGCT TGAAAGAAGG GCAAAATAA
|
Protein sequence | MRATIQFSQP DKKFDILQKL FSFVKGFKNL RQHILEQGIL LERLNSGEIE NVQRALAGIN YLEGRVIDNS VRIFVTDGEL RALFDLMMPV SRKQNDFSRI LWERGFTIEE LSQDQAENLR NQFSAIATVT IGPDVPRTRI YTVGGQIFQE DGVPLCARGF TVCAFDALSV NTFVRCGAMG AVQEDGFYRI DYAWRSNGRI GPDLFVRVFD PEGDIVAEAR KNPAAVQEFL DITVKTLCIV RGTIRQVDDF PLPHLLVRAF DRDMRAETLL GQAMTDAEGS YQITYGTNKL RMKDKADLIV RVFEPADSEG KETGDEIGAS EIIFNAPLQQ AVDLEVKSGK FRGPSEYERY ITALKLLIEG EPVHQLTDKD LSFLGGKTGI PLEHLNYLRL DDQWCFHYSM EPAVVYSLLR QGLPADLHHL STEKPTRLHE ALQASLAHNI APAVLADKVD QAIKPLLSLA DSMVFELERR AK
|
| |