Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_3038 |
Symbol | |
ID | 3704337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3433639 |
End bp | 3435177 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637739512 |
Product | virulence factor MVIN-like |
Protein accession | YP_345009 |
Protein GI | 77166484 |
COG category | [R] General function prediction only |
COG ID | [COG0728] Uncharacterized membrane protein, putative virulence factor |
TIGRFAM ID | [TIGR01695] integral membrane protein MviN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00171259 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGCA CCCCCTTACT TAAATCGACT GCTGTAGTCG GCAGCGCTAC TCTCCTCTCA AGAGTGCTTG GTTTTATCCG CGATGTGGTC ATCGCCCAAA CTTTTGGGGC AGGAGCAGCT GCGGATTCTT TTTTTGTAGC CTTTAAAATT CCCAACTTCC TGCGGCGTTT ATTTGCGGAG GGGGCTTTTT CTCAAGCATT TGTGCCGGTA CTCTCAGCCT ATCAAGTACG TGGTGATTTC AACGAGATTC AGCAGCTCGT CAATCGGGTG GCGGGAACCT TGGGACTGGT TCTACTGCTG GTCACTCTCA CTGGGGTTAT AGGCGCCCCC TTCTTGGTAA TGGTCTTTGC TCCAGGTTTT ATAGAAGAGC AAGACAAATA CGCACTCACT GTCCATCTAC TGCGAATAAC CTTCCCCTAT TTATTATTCA TTTCCTTGAC GGCTTTTGCT GCCGGTATTC TCAATACCTA TAAACAATTT GGCGTACCTG CCATTACGCC TATTTTCCTC AATTTAGCTC TTATTGCCGC AGCCCTGTGG TTTGCTCCCC AGATGGAAAT TCCAGTGACT GCTCTTGCAT GGGGGGTCTT TTTTGCCGGT TTAATACAGC TATTATTCCA ATTTCCCTTT CTCGCCCGCT TAAATCTCCT GCCAAAATTC CGCCCCCGCT GGAAAGATCC TGGCGTGCAG CGGATCTTTA AGCTTATGTT ACCCGCCATC GTTGGAAGTT CAGTAGCTCA AATTAATCTG CTTATCGATA CCCTGCTTGC CTCATTTTTA GTCACCGGCA GTGTGTCCTG GCTTTATTAT TCGGATCGGC TGGTAGAGTT TCCCCTAGGC GTTTTCGGCA TTGCCTTAGC CACAGTTATC CTTCCTAGCC TTTCTGAAAA ACACGCTCGA GCATCAGGCG AGTCCTTTGC CCGCACGCTC GATTGGGCCT TGCGCTGGGT TTTTCTTATT GGTGCGCCAG CCGCAATAGG GCTAGCTATA CTTGCGGAAC CAATCCTTAC CACCTTGTTC CAATATGGCG AGTTCGAGAG CCACGATGTT ATCATGGCTT CCCGTAGTCT AATTGCCTAT AGCTTTGGCC TACTTCCTTT TATTTTGATT AAAATACTGG CGCCTGGATT TTATGCCCGG CAGAATACGA AAACGCCGGT GCGAATCGCT ATCATCGCCA TGATTGCTAA CATGGTATTA AACGGAGTCC TTATCTTTCC CCTGGCTCAT GCGGGGCTCG CTCTCGCTAC TTCCCTTTCC GCCTGGCTTA ACGCAAGCCT GCTCTTTTTC ACCTTAAAAC GGCAAGGAAT CTATCAACCT CAACCAGGCT GGTTGTGGTT TGGCTTACGG ATACTTATTG CTGGTAGTTT CATGGCCGTC ACTCTGCTTT GGCTCATGCC ATCGCTAACC AATTGGCTAA ACTGGGAAGC AGCCGTCCGT ACCGCGCACA TTATGCTGCT AATAGGAACT GCCGTGCTTG TTTATTTTGG CAGCTTACTC CTCATGGGCC TTCGTCCGCG AATGCTAACG TCCGCCTGA
|
Protein sequence | MRSTPLLKST AVVGSATLLS RVLGFIRDVV IAQTFGAGAA ADSFFVAFKI PNFLRRLFAE GAFSQAFVPV LSAYQVRGDF NEIQQLVNRV AGTLGLVLLL VTLTGVIGAP FLVMVFAPGF IEEQDKYALT VHLLRITFPY LLFISLTAFA AGILNTYKQF GVPAITPIFL NLALIAAALW FAPQMEIPVT ALAWGVFFAG LIQLLFQFPF LARLNLLPKF RPRWKDPGVQ RIFKLMLPAI VGSSVAQINL LIDTLLASFL VTGSVSWLYY SDRLVEFPLG VFGIALATVI LPSLSEKHAR ASGESFARTL DWALRWVFLI GAPAAIGLAI LAEPILTTLF QYGEFESHDV IMASRSLIAY SFGLLPFILI KILAPGFYAR QNTKTPVRIA IIAMIANMVL NGVLIFPLAH AGLALATSLS AWLNASLLFF TLKRQGIYQP QPGWLWFGLR ILIAGSFMAV TLLWLMPSLT NWLNWEAAVR TAHIMLLIGT AVLVYFGSLL LMGLRPRMLT SA
|
| |