Gene Nmul_A0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0979 
Symbol 
ID3786579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1139170 
End bp1140270 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID637811062 
Productextracellular solute-binding protein 
Protein accessionYP_411674 
Protein GI82702108 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCTC CACGACTCCT TTTTTTTCTG CTGGCATCAC TGCTCACGTT CGCCGCGGGA 
TGTACTCCCC CGAATCCGGA TGCGGGCAAC AGCGGCAAAA ATGTCCTGCA CCTGTTCAAC
TGGAATAACT ACATCGCGCC GGAAACCGTT GCGCGCTTCG AGAAATCCTG TAAGTGCGAC
CTGTCGCAGG ATTATTATGC CGACAACGAG GAAATGCTGG CGAAGCTCGC GGCGGGAGCC
ACCGGTTATG ATGTTATCGT TCCCACGGGC AATGCGATCG ACACCCTCAT CCGCCAGGGA
GCGCTGCGGC CGCTGGATAA ATCGCTCCTG CCCAATTTCA GGAATATCAA TCCTGCCTAT
CTCGATACGG CCTTTGACCC CGGTAACATA TACTCGGTCC CCTACGCCTA CACGCTCTCT
CTGCTCGGTT TCAACAAGGA GAAGATCGAG CAGCTTGGTC TGCCGACTGA TACCTGGGCA
ATCATCTTCG AACCCAAATA TCTGGAAAAA ATCAGGGGAC GGGTGACCGT GCTCGACAGC
CAGCGCGAGC TGATGGCCGC CGCGCTCAAG TATCTGGGCT ATTCCGTGAA CGATACGGAT
GAGAGGCATT GGCAGGAGGC CGCCGCTCTG ATCGTGCGCG CCAAACCCTA TTGGGCGGCC
TTCAGCAATA CCAGCTACAT CAAGGAACTG GCAGTGGGTA ATCTGTGGGT GGCGCACGGT
TATTCCAATG ACATGTTCCA GGCGGCGCTC GATGCCCAGA AAACCGGGCG GAAATTCACG
ATCAGCTATT CGACGCCCAA AGAGGGAGCA GTGCTGGCAG TGGATAGCAT GGTTCTGCAC
AAAAGCGGGA AACGCCCCGA TCTTGCTCAC CAGTTCATCA ATTTCATGCT GGATGGAAAG
AATTCCGCCG AACTCACCAA TCTCATCGGC TCGGGCAATC CCAATCTCGA TGCTTTGCAA
TACATCCAGC CAGAAATTGC AAGCAACAAG GCCATTTTTC CCGATCCGGA ACTGATTGCC
CGGCTTGAAA TGCTGCGCGA TCTCGATCGC AAGCAGCGGC GACTGTTGAG CCGCTTGTGG
ACAGAAATTA AACTGCGATA A
 
Protein sequence
MNPPRLLFFL LASLLTFAAG CTPPNPDAGN SGKNVLHLFN WNNYIAPETV ARFEKSCKCD 
LSQDYYADNE EMLAKLAAGA TGYDVIVPTG NAIDTLIRQG ALRPLDKSLL PNFRNINPAY
LDTAFDPGNI YSVPYAYTLS LLGFNKEKIE QLGLPTDTWA IIFEPKYLEK IRGRVTVLDS
QRELMAAALK YLGYSVNDTD ERHWQEAAAL IVRAKPYWAA FSNTSYIKEL AVGNLWVAHG
YSNDMFQAAL DAQKTGRKFT ISYSTPKEGA VLAVDSMVLH KSGKRPDLAH QFINFMLDGK
NSAELTNLIG SGNPNLDALQ YIQPEIASNK AIFPDPELIA RLEMLRDLDR KQRRLLSRLW
TEIKLR