Gene Nmul_A2292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2292 
Symbol 
ID3785108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2607500 
End bp2608543 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID637812380 
Producthypothetical protein 
Protein accessionYP_412976 
Protein GI82703410 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGGTC TACCGCGGGC AAGTGGATCG AGATTTTCAA GGCGCTTGTG GTCGAACCTG 
ACCTGGAGTG GGCTTTTATC GATGGCAGTT ATGCCAAGGC ACACCAGCAT AGCTCGGCCG
CCTCACGGGA TGATGGATCC TTCCGGAACA ACGGTCTGGA GCTACGATAC CCAGGGCCGG
ATTATTGGAA AAACCCAGGT TGTGGATGGT GCCAGCCTGG TTACCAGTTA CGTTTACGAT
GGCGATGGCC GGCTCGCCAG CCTCACCTAC CCTTCGGGGG AAGTCATTAA TTACAGCTGG
TCCAATGGCC AGCTGACGGG CATTTTCCGG GGAACCTCCC TAATTGCTTC TGGAATCACC
TGTCACCCGT TCGGGCCGGT CAAATCCTGG ACGCTCGATA ACGGGCAGAA CATCGTCCGC
AACTTCGACC TCGATGGGCG CATCACAAGC TACAGCCTTG GATCGCTTGC CTACGATGCG
GCCTCCCGCA TCACCGGCAT ATCCCGGGGC GGCATCAGTA TCCTCGGCAA CAGCAAGACT
TACGGCTATG ATGCCACGGA TCGGCTGATT TCCTTTAGCG ATGGAACCTC GGCCGAGACC
TATGCTTATG ATGCGTCCGG CAATCGGACT GGCCAGACCA TCAACGGCAT GGCCTACACC
TACTCCGTGA GTTATGCCAG CAACCGCCTG GATGCGATGG CCGGTCCGGG AAGTTCCCTG
CACTACAGCT ATGATGCCAA CGGCAGCTTG GTCAATGACG GCCAGAGGAG CTTTGGCTAC
GATGCCAGTG GACGCCTGAG TGAGGCGGTC GGCCTCGCAA GCTACAGTTT CAACGGACTG
GGCCAGCGGG TCAAGAAAAA TGCCGGTACG GTTATCATGT TTGTCTTTGA CGAGGGCGGA
CAGTTGATCG GAGAATACGG CGCAGCAGGC AATCCCATCC AGGAAACCGT CTGGCTGGAC
GACGTTCCCC TGGCTGTCCT CAGGTACGGC ACCACCTACT ACATCCATAC CGATCACCTC
AACACCCCCC CGGCAGATTC ATGA
 
Protein sequence
MPGLPRASGS RFSRRLWSNL TWSGLLSMAV MPRHTSIARP PHGMMDPSGT TVWSYDTQGR 
IIGKTQVVDG ASLVTSYVYD GDGRLASLTY PSGEVINYSW SNGQLTGIFR GTSLIASGIT
CHPFGPVKSW TLDNGQNIVR NFDLDGRITS YSLGSLAYDA ASRITGISRG GISILGNSKT
YGYDATDRLI SFSDGTSAET YAYDASGNRT GQTINGMAYT YSVSYASNRL DAMAGPGSSL
HYSYDANGSL VNDGQRSFGY DASGRLSEAV GLASYSFNGL GQRVKKNAGT VIMFVFDEGG
QLIGEYGAAG NPIQETVWLD DVPLAVLRYG TTYYIHTDHL NTPPADS