Gene Nmul_A2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2331 
Symbol 
ID3785321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2652685 
End bp2654784 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content56% 
IMG OID637812419 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_413014 
Protein GI82703448 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain
[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.425368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTG TCCTGCAAAT CAAAAATCTA GAGACGGTTT TGCACACCGA GCCGCGACCG 
GTTTGGGCGG TCGATGGGCT TACGCTGCAG ATCATGAAGG GTGAAACATT CGCACTTCTT
GGAGAATCGG GGTGTGGCAA ATCCATCACG GCGCTTTCGA TAATGCGGCT GTTGCCGGAA
GCCGGCGAAA TCGTGGGCGG TTCTGTCTGT CTGGGGGGGG AGGATCTTCT GCTGCTTCCC
GAAGCAGCCA TGCGCAATGT GCGCGGGAAG CGCATTGGCA TGATTTTTCA GGAGCCCATG
CTAAGCCTCA ATCCGGTGAT GACCGTGGCG CAGCAGATTG GCGAGGTATT GCAGCGCCAT
TTCGATTTGC GGGGACCAGC GGCAGAAAAA CGCATCGTGG AAATTCTAGA TCAGGTAGGA
ATACCAGATG CAGCAAGACG CATGAACGAA TATCCTTTCC AGTTTTCCGG TGGAATGAAG
CAACGGATGA TGATTGCCAT GGCCCTGGCC GGGGAGCCAG AGCTTCTGAT TGCCGACGAG
CCCACCACCG CGCTCGATGT GACCATTCAG GCACAAGTGC TGGAACTCAT GCGTAGTTTA
CAGCAGCAGA AAGGTATGGC AATTTTGCTG ATTACGCACG ATCTCGGTGT GGCTTCGGAA
ATGGCGCATC GGGTGGGAGT AATGTATGCA GGGGAAATCG TGGAGACGGC GAGCCGTGAG
GCATTTTTTC GCGGCCCGGC GCATCCCTAT TCCCGTAAGC TCCTTGCTGC CTTGCCCGGC
AGGGACAGAC GCAAACAGGG TCATCGGCTG GATGCGATCA GGGGCAACGT TCCCTCCCTG
TCACGGGAAT TCGTGGGCTG CCGTTTTGCG GACCGGTGCG ATCAGGCTTG GGAATTGTGT
CATCAGGTTG CGCCGGGATG GACCGCGCTG ACCGGAGAGC AGCAGGTAAG ATGTCACCTC
TTTAATCAGA ATTTTGGATC GAAGATCGTC AAACCAGCGG TTTCGGCGCA AGCAGGCGCG
CCGGCAACGC GGGATGCGGA AAGCAGAGAA TTTGAATCAT TCCCTGGCGC TGCAGTGGAG
CACTCAACGC CTGATGGTTC CCTTTTATCC GTTACTGATC TGAAGGTTTA TTTCCCTATC
CACAAGGGCT TGATGAGGCG TGTAGGGGGG TACGTCAAGG CAGTCGACGG AATATCCCTC
AAGATTGATC GGGGCAGAAC GTTGGCGCTG GTGGGAGAAT CCGGCTGCGG AAAGACGACG
GCAGGAAAAG CCATGCTGCA ACTCATCCCT CCCACAGCGG GCAGCGTGCG CTACAACGGG
ATGGAACTGG TAGGCCTGGA GCGCACCCGG TTAAGGAGAC TGCGGGGCGA ATTCCAGCTT
ATATTCCAGG ATCCGTATTC CTCACTCAAT CCCAGAATGC GCATTGTCGA TATTATCGAA
GAGGGGATGA ACGCCCTCAG GGTCGAAGAT GACGGAAAAG AACGCAAGGA AAAAAGAGTG
GACGGGCTAC TCGAAAGTGT CGGCCTGCCA GCAGAGACCA AATGGCGTTA CCCGCATGAA
TTCTCCGGCG GGCAGCGCCA GCGGATAGCC ATTGCCCGCG CATTGGCCGT CAAGCCGAAG
CTGATCGTAT GTGATGAGCC CACCAGTGCC CTGGATGTGT CGGTACAGGC TCAGATTTTA
AATCTGTTGA GGGAATTGCA GCGCAATCTG GGCCTGGCCT ATCTGTTCAT CACCCATAAT
ATTTCAGTGG TCGAATACAT TGCGGATGAG ATCGCGGTCA TGTATCTGGG AAGGATAGTG
GAACGCGGGA CAGTGGACGA AGTTCTGGGC AGTCCCCGCC ATCCCTATAC GCAGGCATTG
TTATCCGCTG TTCCCGTTAT CGAACTGGAG TCGAAGCGGA AGGTTATTCG CCTGCACGGC
GACCTTCCGT CCCCCGCCAA TCCGCCGCAG GGCTGCCATT TTCATCCCCG TTGCTCCCAT
GTCATGCCGG TCTGCCATAA AAACTACCCG GCAGCAAGCA CATTCAGTTC AACGCATACG
GCACACTGCT ATCTTTATCC TCAAGACGAG GAGCAGTCAG ACCGTAAAAC ACAGGCATAA
 
Protein sequence
MSSVLQIKNL ETVLHTEPRP VWAVDGLTLQ IMKGETFALL GESGCGKSIT ALSIMRLLPE 
AGEIVGGSVC LGGEDLLLLP EAAMRNVRGK RIGMIFQEPM LSLNPVMTVA QQIGEVLQRH
FDLRGPAAEK RIVEILDQVG IPDAARRMNE YPFQFSGGMK QRMMIAMALA GEPELLIADE
PTTALDVTIQ AQVLELMRSL QQQKGMAILL ITHDLGVASE MAHRVGVMYA GEIVETASRE
AFFRGPAHPY SRKLLAALPG RDRRKQGHRL DAIRGNVPSL SREFVGCRFA DRCDQAWELC
HQVAPGWTAL TGEQQVRCHL FNQNFGSKIV KPAVSAQAGA PATRDAESRE FESFPGAAVE
HSTPDGSLLS VTDLKVYFPI HKGLMRRVGG YVKAVDGISL KIDRGRTLAL VGESGCGKTT
AGKAMLQLIP PTAGSVRYNG MELVGLERTR LRRLRGEFQL IFQDPYSSLN PRMRIVDIIE
EGMNALRVED DGKERKEKRV DGLLESVGLP AETKWRYPHE FSGGQRQRIA IARALAVKPK
LIVCDEPTSA LDVSVQAQIL NLLRELQRNL GLAYLFITHN ISVVEYIADE IAVMYLGRIV
ERGTVDEVLG SPRHPYTQAL LSAVPVIELE SKRKVIRLHG DLPSPANPPQ GCHFHPRCSH
VMPVCHKNYP AASTFSSTHT AHCYLYPQDE EQSDRKTQA