Gene Nmul_A2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2098 
Symbol 
ID3784669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2390849 
End bp2391892 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content52% 
IMG OID637812186 
Productintegrase catalytic subunit 
Protein accessionYP_412783 
Protein GI82703217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTG TTCAACAAAA TGTTATTAAG CACAAGGTAG GGCTGCTTAA TCTGGCAGCC 
GAGCTTGGCA ATGTATCGCG TGCCTGCAAG GTGATGGGCT TTTCCCGGGA TACCTTCTAT
CGGTATCAGT CAGCGATGGA AAATGGTGGT ATCGATGCCC TGATTGATGC CAATCGAAGG
AAGCCCAATC CCAAGAATAG AGTTGAAGAA GCAGTCGAGA TGGCGGTAGC TGCATTTGCC
CTTGAGCAGC CTGCATTCGG GCAAGTGCGC GTTTCCAACG AGTTACGCAA ACGCGGCATC
TTCATTTCTC CTTCAGGCGT TCGCTCTGTC TGGCTGCGCA GGGACCTGGA ATCATTCAAG
AAGCGGTTGG CGGCACTGGA ATGTCATATC GCGCAGACTG GAGAAGTGCT GACCGAAGCG
CAGGTTAGTG CCCTGGAGAA GAAGCAGGAC GACGATGTCG CCCATGGGGA GGTGGAAACC
GCCCATTCAG GCTACCTTGG CAGTCAGGAT ACCTTCTATG TGGGCACGAT CAAGGGCGTG
GGTCGTATTT ATCAGCAGAC CTTCGTCGAC ACCTACTCCA AGTGGGCCAC AGCCAAGCTC
TATACGACCA AAACGCCGAT TACAGGCGCG GATTTGCTCA ACGATCGGGT ACTACCCTTC
TTTGCAGAAC AGGACATGGG CCTGATCCGA ATTCTGACCG ACCGAGGCAC TGAATATTGC
GGCAAGCCAG AGACTCACGA TTATCAGCTC TATCTGGCGC TCAATGATAT TGAACATACT
CGCACCAAAG CCAATCATCC GCAGACCAAC GGCATCTGTG AGCGCTTTCA CAAGACGATC
CTGCAAGAGT TCTACCAGGT CGCGTTCCGG CGCAAGATAT ACCGATCAAT CGAGGAGTTG
CAAATTGACC TGGATGACTG GCTGCACTAC TACAATCACG ACCGCACTCA TCAGGGCAAG
ATGTGCTGCG GGCGAACACC CATGCAAACA TTAATTGACG GAAAGGAGGT GTGGAACGAT
AAAATCACAT TACTGAACAG TTGA
 
Protein sequence
MSSVQQNVIK HKVGLLNLAA ELGNVSRACK VMGFSRDTFY RYQSAMENGG IDALIDANRR 
KPNPKNRVEE AVEMAVAAFA LEQPAFGQVR VSNELRKRGI FISPSGVRSV WLRRDLESFK
KRLAALECHI AQTGEVLTEA QVSALEKKQD DDVAHGEVET AHSGYLGSQD TFYVGTIKGV
GRIYQQTFVD TYSKWATAKL YTTKTPITGA DLLNDRVLPF FAEQDMGLIR ILTDRGTEYC
GKPETHDYQL YLALNDIEHT RTKANHPQTN GICERFHKTI LQEFYQVAFR RKIYRSIEEL
QIDLDDWLHY YNHDRTHQGK MCCGRTPMQT LIDGKEVWND KITLLNS