Gene Nmul_A2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2334 
Symbol 
ID3785324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2657292 
End bp2659559 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID637812422 
Productextracellular solute-binding protein 
Protein accessionYP_413017 
Protein GI82703451 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTACCC AATTTCTGAA TAAGGCCCCA ACCTTGCCAG CCACGCACCT CCTTCCCGCA 
TTCCTTGCTG TTCTGGTGTG CGCCTGCAGC GGCAACCCGT GGAACAGTCC ATATCCTGCC
GCGGATGCGG AAAAAAATAT ACTTTATGCA GCTTTTGCCG AACGCCCCAA GCATCTGGAC
CCCGTGCAGT CCTACAGTTC CAATGAGATT CTTTTTACGG CGCAGATATA TGAACCGCCG
CTGCAATACC ATTACCTGAA ACGCCCCTAC GAATTGATTC CCGCTACCGC GACGAAAATG
CCGGAGGTGC ATTATTTCGA TGAAAAAGGC GCGCGCCTGG GCGACAATGT GGACGCCGCT
CAGATCGCCT TTACCGTTTA TGAAATTTCA ATCAAGTCCG GTATACGTTA TCAACCTCAT
CCGGCCTTTG CCAGGGACTC ACAGGGAAAG TTCCTTTATC ACCGGCTTGG CGAGGAGGAC
GTACGTGGCA TTTACAAGTT GAGCGATTTT CCACAAACCG GCTCCCGGGA ACTCACGGCT
GCGGATTACG TTTTCCAGGT CAAGCGGCTG GCTCATCCCC GGCTGCACTC TCCCATTTTT
GGCCTGATGT CCGATTACAT CGTGGGCCTC AAGGAATATG CGGCGATGCT CGAGATGGCG
GCCAAAGAGC AGTCCAGGGA GCAGGGGGAG CAGAGCGGAG GCGAGCAGGA CGGCGAGGAT
TACCTGGATC TTGCGCGTTA CCCCCTGGAG GGGGCCGAAG TGGTGGACCG GTATACCTAT
CGCATCAAGA TCAAGGGCAA ATATCCCCAG TTCAGGTACT GGCTCACCAT GCCTTTTTTC
GCTCCTATTC CTTGGGAGGC TGAGCGCTTC TACAGTCAGA AAGGATTGGC AGAAAAAAAT
ATCAACCTCG ACTGGTATCC CGTCGGGACC GGCCCTTATA TGCTGTCGGA AAATAACCCC
AATCGGACAA TGGTCCTGGA AAGGAATCCC AATTTTCATG GCGAAACCTA TCCTGCCGAT
GGAATGCCAG GAGATGTGGA AGCGGGGTTG CTGAAAGATG CGGGCAAGCC CCTGCCTTTT
ATAGAAAAGA TCGTATATAG CCGGGAGAAG GAAAGCATTC CGCGCTGGAA CAAGTTTCTG
CAGGGTTACT ACGATTCATC GAATATCGGT TCCGACAGTT TCGACCAGGC CGTGCAGTTG
ACAGGACAGG GAGAAGCCAC CGTAACCGAA GCCATGAAGG AGCAGGGCAT TCGGCTGGAA
ACGGCTGTTG CGGCTTCCAC AAACTATGTC GGTTTCAACA TGCTCGACCC GGTCGTCGGC
GGGGTGGGGA AGCATGGCCG CGAATCGGCC AGGAAGTTGC GCCAGGCGAT TTCCATTGCG
GTGGATTATG AGGAGTATGT CTCCATATTT GCCAACGGGC GTGGGATTCC CGCGCAAGGT
CCGATTGCTC CCGGCATTGC CGGTCACCGG GAGGGTAAGG AAGGCATCAA CCCGGTTGTC
TATGAGTGGG TAAACGGACG GTCCCGCCGC AAATCCGTCG AAGTGGCCAA GGCATTACTG
GCAGAGGCAG GCTATCCCAA CGGTATCAGC GCGAAAACCG GCGCGCCCCT CGTACTTTAT
TTCGATGTCA CGGCTCGCGG CGCGGAGGAT AAATCCAGTC TCGACTGGAT GCGCAAGCAA
TTTCAGAAAC TCAACATCCA GTTGGTGGTG CGCAGCACGG ACTACAACCG CTTTCAGGAC
AAGATCCGCA AGGGTAATGC CCAGATTTTC GAGTGGGGCT GGAACGCCGA CTATCCCGAT
CCGGAGAATT TCCTGTTCCT GTTACATGGC CCGCAGCAGA AGGTAGACCA TGAAGGCGAG
AACGCTTCCA ATTACTCCAA TTCCGAATAT AACCGGTTGT TCGAGCAAAT GAAGAATATG
GAGAACGGTC CCGCTCGCCA GAAAATCATC GATAGAATGG TCAATATCCT GCGTCATGAC
GCACCCTGGC TATGGGGCTA TCATCCCAAG GATTATGGCC TGTATCACTC ATGGTATGGC
AACGTAAAGC CGAACAGGAT GTCAAACAAC AATGCCAAAT ATCTGCGCAT TGATGGCGTT
TTGCGTGAGC AGAAGCGGCG TGAATGGAAT GAACCAGTAG TCTGGCCGAT GGTTGTAGGA
CTGGCCGTGC TTGCCGGAAG TCTGCTTCCC GCCGTGCTTG TTTACCGGCG TAGGGAACGA
GGGAGAGGAA TATCGGATGT TGCGCTGAAA ATGGAAGCGC AGGGGTAG
 
Protein sequence
MFTQFLNKAP TLPATHLLPA FLAVLVCACS GNPWNSPYPA ADAEKNILYA AFAERPKHLD 
PVQSYSSNEI LFTAQIYEPP LQYHYLKRPY ELIPATATKM PEVHYFDEKG ARLGDNVDAA
QIAFTVYEIS IKSGIRYQPH PAFARDSQGK FLYHRLGEED VRGIYKLSDF PQTGSRELTA
ADYVFQVKRL AHPRLHSPIF GLMSDYIVGL KEYAAMLEMA AKEQSREQGE QSGGEQDGED
YLDLARYPLE GAEVVDRYTY RIKIKGKYPQ FRYWLTMPFF APIPWEAERF YSQKGLAEKN
INLDWYPVGT GPYMLSENNP NRTMVLERNP NFHGETYPAD GMPGDVEAGL LKDAGKPLPF
IEKIVYSREK ESIPRWNKFL QGYYDSSNIG SDSFDQAVQL TGQGEATVTE AMKEQGIRLE
TAVAASTNYV GFNMLDPVVG GVGKHGRESA RKLRQAISIA VDYEEYVSIF ANGRGIPAQG
PIAPGIAGHR EGKEGINPVV YEWVNGRSRR KSVEVAKALL AEAGYPNGIS AKTGAPLVLY
FDVTARGAED KSSLDWMRKQ FQKLNIQLVV RSTDYNRFQD KIRKGNAQIF EWGWNADYPD
PENFLFLLHG PQQKVDHEGE NASNYSNSEY NRLFEQMKNM ENGPARQKII DRMVNILRHD
APWLWGYHPK DYGLYHSWYG NVKPNRMSNN NAKYLRIDGV LREQKRREWN EPVVWPMVVG
LAVLAGSLLP AVLVYRRRER GRGISDVALK MEAQG