Gene Noc_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3038 
Symbol 
ID3704337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3433639 
End bp3435177 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content49% 
IMG OID637739512 
Productvirulence factor MVIN-like 
Protein accessionYP_345009 
Protein GI77166484 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00171259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGCA CCCCCTTACT TAAATCGACT GCTGTAGTCG GCAGCGCTAC TCTCCTCTCA 
AGAGTGCTTG GTTTTATCCG CGATGTGGTC ATCGCCCAAA CTTTTGGGGC AGGAGCAGCT
GCGGATTCTT TTTTTGTAGC CTTTAAAATT CCCAACTTCC TGCGGCGTTT ATTTGCGGAG
GGGGCTTTTT CTCAAGCATT TGTGCCGGTA CTCTCAGCCT ATCAAGTACG TGGTGATTTC
AACGAGATTC AGCAGCTCGT CAATCGGGTG GCGGGAACCT TGGGACTGGT TCTACTGCTG
GTCACTCTCA CTGGGGTTAT AGGCGCCCCC TTCTTGGTAA TGGTCTTTGC TCCAGGTTTT
ATAGAAGAGC AAGACAAATA CGCACTCACT GTCCATCTAC TGCGAATAAC CTTCCCCTAT
TTATTATTCA TTTCCTTGAC GGCTTTTGCT GCCGGTATTC TCAATACCTA TAAACAATTT
GGCGTACCTG CCATTACGCC TATTTTCCTC AATTTAGCTC TTATTGCCGC AGCCCTGTGG
TTTGCTCCCC AGATGGAAAT TCCAGTGACT GCTCTTGCAT GGGGGGTCTT TTTTGCCGGT
TTAATACAGC TATTATTCCA ATTTCCCTTT CTCGCCCGCT TAAATCTCCT GCCAAAATTC
CGCCCCCGCT GGAAAGATCC TGGCGTGCAG CGGATCTTTA AGCTTATGTT ACCCGCCATC
GTTGGAAGTT CAGTAGCTCA AATTAATCTG CTTATCGATA CCCTGCTTGC CTCATTTTTA
GTCACCGGCA GTGTGTCCTG GCTTTATTAT TCGGATCGGC TGGTAGAGTT TCCCCTAGGC
GTTTTCGGCA TTGCCTTAGC CACAGTTATC CTTCCTAGCC TTTCTGAAAA ACACGCTCGA
GCATCAGGCG AGTCCTTTGC CCGCACGCTC GATTGGGCCT TGCGCTGGGT TTTTCTTATT
GGTGCGCCAG CCGCAATAGG GCTAGCTATA CTTGCGGAAC CAATCCTTAC CACCTTGTTC
CAATATGGCG AGTTCGAGAG CCACGATGTT ATCATGGCTT CCCGTAGTCT AATTGCCTAT
AGCTTTGGCC TACTTCCTTT TATTTTGATT AAAATACTGG CGCCTGGATT TTATGCCCGG
CAGAATACGA AAACGCCGGT GCGAATCGCT ATCATCGCCA TGATTGCTAA CATGGTATTA
AACGGAGTCC TTATCTTTCC CCTGGCTCAT GCGGGGCTCG CTCTCGCTAC TTCCCTTTCC
GCCTGGCTTA ACGCAAGCCT GCTCTTTTTC ACCTTAAAAC GGCAAGGAAT CTATCAACCT
CAACCAGGCT GGTTGTGGTT TGGCTTACGG ATACTTATTG CTGGTAGTTT CATGGCCGTC
ACTCTGCTTT GGCTCATGCC ATCGCTAACC AATTGGCTAA ACTGGGAAGC AGCCGTCCGT
ACCGCGCACA TTATGCTGCT AATAGGAACT GCCGTGCTTG TTTATTTTGG CAGCTTACTC
CTCATGGGCC TTCGTCCGCG AATGCTAACG TCCGCCTGA
 
Protein sequence
MRSTPLLKST AVVGSATLLS RVLGFIRDVV IAQTFGAGAA ADSFFVAFKI PNFLRRLFAE 
GAFSQAFVPV LSAYQVRGDF NEIQQLVNRV AGTLGLVLLL VTLTGVIGAP FLVMVFAPGF
IEEQDKYALT VHLLRITFPY LLFISLTAFA AGILNTYKQF GVPAITPIFL NLALIAAALW
FAPQMEIPVT ALAWGVFFAG LIQLLFQFPF LARLNLLPKF RPRWKDPGVQ RIFKLMLPAI
VGSSVAQINL LIDTLLASFL VTGSVSWLYY SDRLVEFPLG VFGIALATVI LPSLSEKHAR
ASGESFARTL DWALRWVFLI GAPAAIGLAI LAEPILTTLF QYGEFESHDV IMASRSLIAY
SFGLLPFILI KILAPGFYAR QNTKTPVRIA IIAMIANMVL NGVLIFPLAH AGLALATSLS
AWLNASLLFF TLKRQGIYQP QPGWLWFGLR ILIAGSFMAV TLLWLMPSLT NWLNWEAAVR
TAHIMLLIGT AVLVYFGSLL LMGLRPRMLT SA