Gene Nmul_A2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2198 
Symbol 
ID3786223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2495984 
End bp2497012 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID637812285 
Producthypothetical protein 
Protein accessionYP_412882 
Protein GI82703316 
COG category 
COG ID 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000179032 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC AAAAACAGAA AGCGATTGCA CAGGCAACGC GCGCGGTAGT TAACGGAGGA 
TTGTTTGCGG GTTGCCTGTC GTTCAGTCTC ATCGCAAACG CGGGCATCAA TTTCCAGTTC
GTATACAAGG ATGCGCCTGG TACGGGATTT CTCGATCCCG TAAACGGCGC CAGTCGGCAA
GCTGCGCTCA ACACTGCTGC CACGGAATTC TCCAGAATGT TTGGCACCCA CTTCGCCAAC
TCGGGGACCA TTGTGCTGGA AGCAACGGCC ACCAATGATC CCCAGAGCAG TACCCTGGCA
GGGGCGGGCA GTGAATATGT CGTTCCCCCG GTACCGGGAT TCAACCTCAA CGAGGTCGTG
CGCGAGAAAC TCCAGACAGG GATTGATTCC AACGGAAGCA GACCCGATGG CTCGCTCGAT
ATCAACTTTG GGAGCAAATG GGAGCTTGGC TTCAACACGC CCGTGTCGAG TGAGCGCTAC
GACTTCTATT CCACGATGTT TCATGAATTT ACGCATACGC TTGGTTTTTC TTCATCCATA
GGGCAATTCG GTGATCCGAT CGGGGGTACG AAGGATGCGG GAAGCTGGAG CAGCTTCGAC
AGCTACCTGG TAAACAAGAG TGGAACTCCG GTCATTGATC CTGCAACTTT CGCACTCGAT
CAGACTGTCT GGGATGCAGG CAGCGTGGGA GGCACCAGTC CCTCGGGAGG CCTGTTCTTC
GATGGAGCTC ACGCCATGGC GGCAAACGGA GGCAATCCGG TGGGTCTGTA CACACCGTTT
CCGTGGGAGG AGGGGAGCAG CGTTTCCCAC CTGGATGATA ATAATAGCGC TTATGCGGGA
ATGATGATGC TGGCCGCCTC TGAGACGGGA CCTTATGCCC GGGACTACAG TGCGGTCGAG
ATTGGCATGC TCCAGGATCT TGGATATACG GTAACGGCTG TGCCAGAGCC GGAGGTCTAC
GCGATGATGC TGGCCGGCTT GGGGTTGCTG GGCTGGGGAA CGCGGCGCAA AAAGCGCCAT
GACCAGTAA
 
Protein sequence
MKNQKQKAIA QATRAVVNGG LFAGCLSFSL IANAGINFQF VYKDAPGTGF LDPVNGASRQ 
AALNTAATEF SRMFGTHFAN SGTIVLEATA TNDPQSSTLA GAGSEYVVPP VPGFNLNEVV
REKLQTGIDS NGSRPDGSLD INFGSKWELG FNTPVSSERY DFYSTMFHEF THTLGFSSSI
GQFGDPIGGT KDAGSWSSFD SYLVNKSGTP VIDPATFALD QTVWDAGSVG GTSPSGGLFF
DGAHAMAANG GNPVGLYTPF PWEEGSSVSH LDDNNSAYAG MMMLAASETG PYARDYSAVE
IGMLQDLGYT VTAVPEPEVY AMMLAGLGLL GWGTRRKKRH DQ