Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1462 |
Symbol | |
ID | 3785553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1670005 |
End bp | 1671990 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811550 |
Product | putative phage portal protein |
Protein accession | YP_412157 |
Protein GI | 82702591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.584586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGG CAGGTGTTAC GGCAGATATT TCGATTGAGG CTTACGACAA AATATGCCGC GATATTCGGG ATCAACCGAA GTGGCGGTCG GACTCGGACA CGGACTGTGA TTATTACGAC GGCGCACAGA CCAGCTCGGA GGTGATCGGA CGACTGAAGA TGGCCGGTAT CCCGCCCCAG GATTCCAATC TGATCAAACC GACGATCAAC GCGGTATTGG GATTGGAAGC GCGTAGCCGA ACGGATTACA GGGTGACGGC GGATGATGAG AGTCAGGCAG AAATTGCGGA AGGTCTTTCC GCAAAAATCA AGGAGGCCGA AACCGAGTCA CGCGCTGACC GGGCCATGTC AGACGCGTAT TCCAGCATGA TTCGGGCGGG CATTGGTTGG GTAGAAATAT CCCGGGAATT TGATGCGCTC AAATATCCAT ATCGTGTCCG GGAGGTGCAC CGCAACGAAA TTTATTGGGA CTGGAGTTCA AGGGAGCCGG ATTTGTCCGA TGCGCGATAT TTGCGACGCG ATAAGTGGAT AGACCGGCTT CAGGCCGCCC TCATGTTCCC CGACCGGGCG GAAATCATTG CAAATAGCTG GAAGGGGTGG AATGGGACTG ACGTGTATGA AGGTTATGAT AACGGCTTGG CGCGAGCATA TGAAATCGAG CAGGCATGGA ATCGCACCCA GGAAGATTAC CTGAACCGGA ACTCGGGTAT GGTCAGGCTC TCCGAGTTGT GGTATCGGCA TTTCGAGGAT GCATACGTCC TGGTATTGAC CGATGGAAAG ATAATCGAAT ACCGCGAGGA TAACCCATAC CATCAGGCAG CGGTTGCCCA AGGTCTCGTG CAGGTCCAGA AATCGGTTCT CACCAGGATG CGGGTCTCGA TCTGGTTGGG GCCGCATAAA CTCATGGATG TTCCGAGTCC ATTACCCCAT TCAGATTTTC CATATGTTCC ATTCTGGTGC TTTCGCAAGG ATAGGAGTCG AGCTCCCTAC GGATTGATTC GTGACATGCG GGGACCGCAG GATCAGATTA TCGATCTGGA TATTCTTCTC TATGAAGTCC TCAATTCGGT AAAAGTCGAA GTGGACAATG ATGCGCTTGA TCTCAGCCAG AATTCCTATC AGGAGGTGGC GAATAATATA AGCAGTCTGC GCTCGATGAC CATCCTCAAC TCTCAGCGAA GGAATACCAG TGGCTTCAGG GTAATACGCG AGCACCAGCT TGCCGCTCAA GTGTTTCAGC TCGTGCAGGA ACGCAAGCGC AGGATCGAGG AAGTGGGCGG AATCTACCGC ACTATGCTGG GAGCGCATAC CTCAGCAAGC AGCGGTGTGG CGATCAACAG TCTCGTGGAA CAGGGGTCGA CCGTGTTGGC GGAGCCCAAT GATAATTTTC GACATGCTCG CCGACTTGTT GGCCAGCAAC TTCTTGCGCT GATCAAGGAG GACCTGATTG GAAGACCCGC GCAAATTACA GTCCAGCAAG GTAATAAACC AAAGGTGGTT TATTTCAATC GCCAGTTGGA TGACGGGTTG GTACACAACG ACATTGCTTC TGCAATGGTC AAAGTCGTGC TTGAGGATAT TCCTGCTACT CCGACATTTC GCGCGCAGCA GTTGCAAGCT ATGTCGCAAA TCGTACAAGC CGCGCCCCCT CGGTTTCAGG CCGTACTTTA TCCGGTAATG CTCGAGCTGT CCAATGTCCC GAACCGGCAT GAGTTGGCCC ATCAATTAAG GCAAGTGGCA GGCATTGACG ATAATTCCCC GTTTCAGGCT TTGCAGCAGC AAGTGCAGCA GATGATACAG GAGGCGCAAG GGCGGATTGG GGAGTTGCAG CAGAGATTGG GGGAGGCTGA GCTGCAGCTT AAGGCATGCG GACTTGACTT GAAAGCGCGT GCGCAGGCGC ATAAAGAGGA TATGGATGAT GCTAAATTCC GTCTGGAGGC GGAGAAGATT GCTGATTGTC AGATAGGTAA CGCATTATCC GCATAA
|
Protein sequence | MTEAGVTADI SIEAYDKICR DIRDQPKWRS DSDTDCDYYD GAQTSSEVIG RLKMAGIPPQ DSNLIKPTIN AVLGLEARSR TDYRVTADDE SQAEIAEGLS AKIKEAETES RADRAMSDAY SSMIRAGIGW VEISREFDAL KYPYRVREVH RNEIYWDWSS REPDLSDARY LRRDKWIDRL QAALMFPDRA EIIANSWKGW NGTDVYEGYD NGLARAYEIE QAWNRTQEDY LNRNSGMVRL SELWYRHFED AYVLVLTDGK IIEYREDNPY HQAAVAQGLV QVQKSVLTRM RVSIWLGPHK LMDVPSPLPH SDFPYVPFWC FRKDRSRAPY GLIRDMRGPQ DQIIDLDILL YEVLNSVKVE VDNDALDLSQ NSYQEVANNI SSLRSMTILN SQRRNTSGFR VIREHQLAAQ VFQLVQERKR RIEEVGGIYR TMLGAHTSAS SGVAINSLVE QGSTVLAEPN DNFRHARRLV GQQLLALIKE DLIGRPAQIT VQQGNKPKVV YFNRQLDDGL VHNDIASAMV KVVLEDIPAT PTFRAQQLQA MSQIVQAAPP RFQAVLYPVM LELSNVPNRH ELAHQLRQVA GIDDNSPFQA LQQQVQQMIQ EAQGRIGELQ QRLGEAELQL KACGLDLKAR AQAHKEDMDD AKFRLEAEKI ADCQIGNALS A
|
| |