Gene Nmul_A0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0292 
Symbol 
ID3785538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp313774 
End bp315273 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content53% 
IMG OID637810368 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_410992 
Protein GI82701426 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.53568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGC AGCAACATCA GTCCGATATT ACCGCCAAGG TGTTATACGG CGCCCGCTGG 
TCGGCCATGC TCCGGGTGAC CGGCCAAATG GTCAGCTGGC TCAGTACCAT CATCGTCGTG
CGCTTCATCC GCCCGGAGGA TTACGGACTC AATGCCATGT TGGAAGCGCC ACTCGAACTC
CTCATGCTAC TCAGCACGTT TGGCCTCGAC TTGGCCCTGG TCCGGTGGAA AACGATCGAG
CAGGAGGAAT TGCGCAGTGT CTTCGGCTCG CTGCTGATCA TCAACGGACT CCTTTTTCTC
GTCTATTTCT TTGGCGGCAG TCTGATGGCT GCTTATTTTG ATGAACCTCG TCTTGAATCG
CTGGCTCAGG TGCTGGCCTT CGTTTTTATC CTGGCACCGT TTCGGGTCAT TCCCAACGCA
CTGCTGGATC GTAATCTGAA ATTCAAGCTG CGCGCTTTGG CCGAGTTTAT TGCAAACATA
AGCGCCGCTG TAGCGACACT GGTGCTTGCA ATTCTCGGCT GGGGAGTCTG GGCCTTGGTA
TCGGGGGTGC TGATCAACCG GATTCTTCTC GCAATTATAT TGATGGTCCT GCAGCCCTGG
TTCATCATGC CCTCTTTGAA CTTCCCGGCT GTACGCGCCA TGATGGTCTT CGGCGGCGTT
TTGTCCTTGG GCGGGGCAGT CGTGCTCGTC ACTGACAAGC TTGCCACCCT GATTGCCGGT
CCTGTTCTGG GCGCGGAATT ACTCGGTATT TTTGCCGTTA CCTTTCAGTT TGCGCTGTTG
CCGCTCGCCA AGATAATGCC GGTAATTAAT CCCATTATCT TTCCCGCATT TTCCAAATTC
CAGGATCAGC CCGGGGTGGC AACGTACTAT TTGAGTAAAT CTCTTGGTAT TGTTTCGCTG
GCCTTGTTTC CTGTCATGAT AGGGCTGGCC TGCATCGCAC AGGAATTCGT GGCTACGGTG
CTGGGTAACA AATGGGCGGC AGTGGCTTTG CCGCTCGCAT TATTGTCCAC CGTGATGCCA
TTCAGGATGA CGACATCTTT TCTCCGGCCG GTACTGGCAA GCATGGGACG AGCCGATCTT
TCGTTGAAGT CTGCTGTTTT CGCCTTGATT ATCTTATTGC CGCTGATATT GGTGGGTGCT
CATTACGGAG TGATGGGGCT GGTGATGGCC ATGGTGGTGA CCGAACTGAT CGTCGTCTTT
CTGACTATCG GCATGAGCAA GGCAGTTCTG CACACGTCCT TTACAGGGAT TGCGCTGAGC
CTTCGCCCCG CCATAGCCGC TTCTACAGTG ATGGCGGCGT GCCTGATGGG CGCAAAAATA
GCCTTGGGCG ACGCATTTGG CAGTAGTGCC AACCTTATCA CATTACTTAT CGAAATCAGT
TTTGGCGCAC TTGTTTATTT CTTGACGTTG CGGATCTTCT ACGGAAAACT GCTGGATGAC
ACCATACGGT TGTTTCTGGG CCGTAATGGG GGACTTGCTC ATCTGCCAAA TGAGTCGTGA
 
Protein sequence
MDKQQHQSDI TAKVLYGARW SAMLRVTGQM VSWLSTIIVV RFIRPEDYGL NAMLEAPLEL 
LMLLSTFGLD LALVRWKTIE QEELRSVFGS LLIINGLLFL VYFFGGSLMA AYFDEPRLES
LAQVLAFVFI LAPFRVIPNA LLDRNLKFKL RALAEFIANI SAAVATLVLA ILGWGVWALV
SGVLINRILL AIILMVLQPW FIMPSLNFPA VRAMMVFGGV LSLGGAVVLV TDKLATLIAG
PVLGAELLGI FAVTFQFALL PLAKIMPVIN PIIFPAFSKF QDQPGVATYY LSKSLGIVSL
ALFPVMIGLA CIAQEFVATV LGNKWAAVAL PLALLSTVMP FRMTTSFLRP VLASMGRADL
SLKSAVFALI ILLPLILVGA HYGVMGLVMA MVVTELIVVF LTIGMSKAVL HTSFTGIALS
LRPAIAASTV MAACLMGAKI ALGDAFGSSA NLITLLIEIS FGALVYFLTL RIFYGKLLDD
TIRLFLGRNG GLAHLPNES