Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1444 |
Symbol | |
ID | 3784637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1654339 |
End bp | 1656375 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811532 |
Product | hypothetical protein |
Protein accession | YP_412139 |
Protein GI | 82702573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGCAT TCAGGATTTC CGGATTCTCC GGGCTTGTTC CGCGGCTGGC AAAGCACTTG CTCAGCTCGA ACCAGGCGCA GACGGCGACC AATTGCAACC TTGCCGCAGG CGACTTGCGG CCCCGAAACG CGCCGCTACT TGTTTTTTCC CCCCAGATCG ATGGAGAAAT CCGGTCGATG TTCAGGATGG AAAAGGATGG GAGCGAAAAG TGGCTCGCCT GGAGCAGGGA TGTCGATGTA GCCCGTTCGC CTGTTGCAGG GGATACGCTT CAGCGATTCT ACTACACGGG CGACGGGGAG CCGCGCACTT CCAACTTTGA AATGGCAACA GCGGGCGCCA ATGCTCATCC GTCCGCCTGC TATGTACTTG GGGTTACACC CCCGGTCAGC GAACCGCTGG TGAGCGCGTC GGGTGGCAGC GGAGTAGCGA CTTCCCGTGC CTATGTATAT ACATTTGTCA CGCAATGGGG GGAAGAGTCG CAACCTTCCC CCGCTTCGAT AGTGACCAGC GGAAAGATAG ATGCAACCTG GATGATTTCA AATCTGGACG CAGCACCCCC CAATTCCGGG GCAATTACTG CTGTTTCCAG GAATTCTCCA GTTGCCGGGC AAATGGAAAT CAGCCTTGAT ACCGTCTTCG GTTTAAGAGC GCACGAGGAA ATCCAGTTTG AATCGGTGTC GGGCATGACT GACTTGAACG GTCGATTCAT TCTGATGAGC GTGGATCCGG TAACCAAAAA GGTGGCCATA TCCCTTTCTA CAGACCAGCT CTACGCCGGA GGTGGGAGGT GGGAGCGGCA GGCTCCACAT AATACCGGGG GGATGAACAA GCGCATCTAC CGGACGCTTA CCACGTCGTC AGGGACCGAA TATCGCTATG TCGCAACACT TTCCGGGATT ACAAAAAACT ACAGCGATAC TGTCCCTGAC ACGGTTATTG CGCTGGGAGA AACATTGCCC TCCACAAACT GGGAAATGCC CCCGGCAAAC ATGAGAGGCA TCGTTGTACT TGCGAATGGA ATTGCCGCAG CATTCGCGGG CAATGAGGTG CTTTTCTCGG AACCGTTCAA ACCTTATGCC TGGCCCACTT CGTATCGGCA AACATACGAC CAGGAAATTG TAGCCATCGC TGCAATGGGA ACCACACTCG TCGGCATGAC CAGGGGCAAT CCCTTCACCC TGACCGGCGT TGAGCCCGTA ACCATGGGCG GAGGAATGGA GAAGCTGGGA GTGGCGTGGC CCTGCATGTC GAAACGAGGA GTAGCGAATT TTGCATTTGG CATCGGGTAT CCCGCTCCGC AAGGGATGGT AATGATCGGG ACAAGCAGCG ATATTGTCAC AAAAGATTTG TTTACCCAGA AGGAGTGGTC CGAACTAAAC CCCGATACCT TTATCGCGAC CTCTGCCGAT AACCGCTATT ACTGCGGCTA TTCGGCTGGC GACAGCTCCC TGATGTTCGT GATCGATAAG GCGGAGGATG CATCCTTTAC AAAAATCAAC CAGAACATCA GTTGCATCTG GACGGATCCC ATCACCGGCA AGCTCTACAT CGCCACAAAC AAGAAAATCT ACGAATGGGA AGGGGATACG GGAACTAAGC TTTTCTATGA GTGGAAAAGC AAACGGTTCG TTACTGCGTC ACCGGTTAAT TATGGTGCGG GGAAGATCGA TGCCGATTTT AAAATGACGG AAGAAGAAAG AGCAGCGGCG CAATCCTCCT ATAAGGAGAC TATCGCTGCC AACCAGACAT TGATCAGTTC TTATTCCATG GATGACGGAC TGGCAGATAC ATGCCTCGGT GAATACGAGA TCGGGGGTGA TGCGACTCAG GATATTCCCC TTTTATCCAT AGATTCTCTG CAATTTCAAT TATGGAGCGA TGGCGTGCCG AAATTTACCA AACAGGTCAA GAATAACAGG GCATTTCGGC TTCCCGGTGG CTATAAGGCC GATAACGTAG AGTTTGTGCT ATCCGGCAAT GTGAAGGTAA ACAGCGTCGT CCTGGCTGAA ACAATGGATG GATTGAAGCA GGCATAG
|
Protein sequence | MSAFRISGFS GLVPRLAKHL LSSNQAQTAT NCNLAAGDLR PRNAPLLVFS PQIDGEIRSM FRMEKDGSEK WLAWSRDVDV ARSPVAGDTL QRFYYTGDGE PRTSNFEMAT AGANAHPSAC YVLGVTPPVS EPLVSASGGS GVATSRAYVY TFVTQWGEES QPSPASIVTS GKIDATWMIS NLDAAPPNSG AITAVSRNSP VAGQMEISLD TVFGLRAHEE IQFESVSGMT DLNGRFILMS VDPVTKKVAI SLSTDQLYAG GGRWERQAPH NTGGMNKRIY RTLTTSSGTE YRYVATLSGI TKNYSDTVPD TVIALGETLP STNWEMPPAN MRGIVVLANG IAAAFAGNEV LFSEPFKPYA WPTSYRQTYD QEIVAIAAMG TTLVGMTRGN PFTLTGVEPV TMGGGMEKLG VAWPCMSKRG VANFAFGIGY PAPQGMVMIG TSSDIVTKDL FTQKEWSELN PDTFIATSAD NRYYCGYSAG DSSLMFVIDK AEDASFTKIN QNISCIWTDP ITGKLYIATN KKIYEWEGDT GTKLFYEWKS KRFVTASPVN YGAGKIDADF KMTEEERAAA QSSYKETIAA NQTLISSYSM DDGLADTCLG EYEIGGDATQ DIPLLSIDSL QFQLWSDGVP KFTKQVKNNR AFRLPGGYKA DNVEFVLSGN VKVNSVVLAE TMDGLKQA
|
| |