Gene Nmul_A2510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2510 
Symbol 
ID3786635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2869133 
End bp2871202 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content55% 
IMG OID637812601 
ProductTonB-dependent receptor 
Protein accessionYP_413191 
Protein GI82703625 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0311712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTGTA CAAATTGTGA TTTTGCGTGC GGACCTGTTC TCCGTGCGCA TGAAGGGTCG 
AGTTCATACG CTGCATTGGC GATGGCAACG CTGCTCGCGC TCAATGCATC CTCCGTCCAG
GCTCAAAAAG CGGAAAAAAT TGAGCTACTC AATCCCGTAG TTGTCACCGC AACCCCTTTT
GAGGACCGCA CTGAGCTGGA TATAGCCCAG CCCGTCTCCG TATTGAAGGG AGATGATTTG
CGGAGGAAAC GCGAAGCCAG CCTGGGGGAT ACCCTTTCCC ATGAACTGGG CGTGACATCA
AGTTTTTTCG GACCGGGCGC CGGTCGTCCC ATTATTCGCG GTCTCGATGG GCCACGCGTC
AGGGTGTTGG AGAACGGCAT CGGTACGCTC GATATCTCTT CAATCAGTCC AGACCATGCC
GTTACTGCCG AATCTCTCAA TGCGTCGCAA ATCGAAATCC TCCGGGGTCC ATCAACTCTT
CTCTACGGCA GCGGCGCGTC GGGTGGCGTG GTCAACGTCG TGAACGGCCG TATTCCAAAA
CAACTGTTCA AGTCGATAAA AGGCAATATC GAAGCGCGTG GGAATACCGC GACCGAGGAG
CGTGCCGGCG CCTTCAATGC AAGCGGAAGT ATCGGGCAAG CATCCTGGAG CGTCGGGGGG
TTCAAGCGCA AAACCGACGA TTACCATATC CCGGGACGGG CGAATGAGAG TGATCCCGGC
AGCCGGAAAG GAATTGTCGG GAACAGCGCC ATCAATTCCG GCGGCTTGTC CGGGGGGGGC
TCGTATGTCG GGGAAAGAGG TTTTGTAGGC GGCTCGATCT CCCGAATGGA CAACGAATAT
GGCATTCCCG GACCGGAAGG TTCGAAGATC GATCTCAAGC AGACGCGCTA TGATCTGGCG
GGTGAACTGG ACAAACCCAT GGCCGGTCTC GAGAAGCTCA AGGTGCGCAT GGGTTACAAC
GATTACAAGC ACAACGAGAT CGAAAGCACG GGTGAAGTTG CGACCCGTTT CAAGAACCAG
GGACTGGAGA GCCGCGCCGA ACTCACGCAT GCTACTGTGG CAAACTGGAA CGGCGTATTC
GGTGTGCAGT TTCGTAACCG ACATTTTTCC GCCTTGGGCG AAGAGACCGT GGTACCCGTC
ACCAATTCCC ATTCGGTAGG TGTGTTCCTG GTGGAAGAAC GCAATTGGAA TCGATGGCGC
CTGGAACTGG GCGGTCGCGG CGAGTACGCC GCCCAGAATC CCAAAGACGG TCATCCCTCA
CGTTCCTTTG GTCTCTACAA TGTCTCCGCC GGGATTTTAT GGAAATTCCT GGATGGCTAC
GGATTGTCAT TGACTGGCAT TCGCGGTCAG CGCGCGCCCT CGACAGAGGA ACTGTATATC
CATGGCGCAC ACCGTGGAAC CGCCACCTTC CAGAGCGGAA ATAATGGATT GAGAAGCGAG
ACCACAAACA ATCTCGATCT GGCACTGCGC AAGACCAGTG GTATGGTGAC GTGGAAGATC
AATATATTCC ATAACTGGAT CGACAATTAT ATTTTCGTCC AAAGTGCGGA CACGAATGGA
GACGGTGTGG CCGATCGCGT GAATCAGGAA GGGATGCTGG AGCCCACGGG TGAGTTTCTG
GTACAGAACT TTGCACAAGG CGGAGCCAGA TTTTACGGTG CCGAAGCAGA AACCGTTCTC
ACGTTGAAGC CGGACGAAAT CGACCTGCGC CTGTTCGCGG ATTACGTAAG GGGAAAACTT
GATAACGGAG GGAATGTGCC GCGCATCACG CCGTTACGCT TCGGCCTGGA ATTCAATCAC
AGAACCGGCC CCTGGACGTC GAATATAAGT GCCACTCGGG TGATGCGCCA GAACGATCTG
GCGGAACTGG AGACAAGCAC GCCCGGGTAT ACGATGGTGA ACATGGAGGT AAGTTACCGC
ATCAAGAAGA CGCGCTCCAA CGGCATTCGA ATCTTCCTCC AGGGCAGAAA CCTGCTCGAT
GAAGAAATGC GTGTTCATAC CTCTTTCCTG AAGAATTTCG CACCGCTACC GGGCAGGGCG
CTCGTCGCTG GGCTGAGAGG GGAGTTTTAG
 
Protein sequence
MSCTNCDFAC GPVLRAHEGS SSYAALAMAT LLALNASSVQ AQKAEKIELL NPVVVTATPF 
EDRTELDIAQ PVSVLKGDDL RRKREASLGD TLSHELGVTS SFFGPGAGRP IIRGLDGPRV
RVLENGIGTL DISSISPDHA VTAESLNASQ IEILRGPSTL LYGSGASGGV VNVVNGRIPK
QLFKSIKGNI EARGNTATEE RAGAFNASGS IGQASWSVGG FKRKTDDYHI PGRANESDPG
SRKGIVGNSA INSGGLSGGG SYVGERGFVG GSISRMDNEY GIPGPEGSKI DLKQTRYDLA
GELDKPMAGL EKLKVRMGYN DYKHNEIEST GEVATRFKNQ GLESRAELTH ATVANWNGVF
GVQFRNRHFS ALGEETVVPV TNSHSVGVFL VEERNWNRWR LELGGRGEYA AQNPKDGHPS
RSFGLYNVSA GILWKFLDGY GLSLTGIRGQ RAPSTEELYI HGAHRGTATF QSGNNGLRSE
TTNNLDLALR KTSGMVTWKI NIFHNWIDNY IFVQSADTNG DGVADRVNQE GMLEPTGEFL
VQNFAQGGAR FYGAEAETVL TLKPDEIDLR LFADYVRGKL DNGGNVPRIT PLRFGLEFNH
RTGPWTSNIS ATRVMRQNDL AELETSTPGY TMVNMEVSYR IKKTRSNGIR IFLQGRNLLD
EEMRVHTSFL KNFAPLPGRA LVAGLRGEF