Gene Nmul_A2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2354 
Symbol 
ID3785291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2680898 
End bp2682256 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID637812445 
Producttype II secretion system protein E 
Protein accessionYP_413037 
Protein GI82703471 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATAC GAGAACGTCT CCAGAGCGTG AAGGAAACCG GGTTGGCTGA GCATCATCCC 
GCGCCGGTCA CGGCCCTGCA CGATATCCCC GCCTATGAGG AATTGAAGGC CCGCGTCCAC
CAGAAGCTGC TCGATCGGGT GGATCTGGCA GTCATGGAAA GCCTGCCGGC TGAACGCCTG
CTGATGGAGA TCAGGAACCT GGTGGAGCGG TTGCTGGTCG AGGAATCCGT CCCCATCAAC
GAGGCGGAGC GGCAGGGCAT CGTCCGCGAC ATCCAGAACG AAGTGCTCGG CCTGGGTCCG
CTCGAGCCCT TGCTGGCAGA CCCGACCATT TCCGACATCC TGGTGAATAC CCATCGGCAG
GTCTATGTCG AGCGGCGGGG ACGTCTGGAA CTGACCGACA CCCACTTTGC CAACGAAAAG
CACTTGCGAA AGATCATTGA CAGGATCGTG TCGCGTGTCG GGCGGCGCGT GGATGAGTCC
AGTCCCATGG TCGATGCGCG TCTGCCCGAC GGTTCACGCG TCAATGCCAT CATTCCGCCC
CTGGCGATCG ATGGCTCGCT GCTTTCGATC CGGCGTTTTT CCGTCAAGCC GCTCAAGATG
AACGATCTCA TGGCGTACAA GTCGCTGACC CCGGAAATGG GCGAGATCAT CAGCGGACTG
GTCAAGGGCA AGTGCAGCAT ACTCATTTCC GGCGGCACCG GCAGCGGCAA GACCACGCTG
CTCAATATCA TGTCCGGTTT CATTCCGTCC TCCGAACGGA TCGTGACCAT CGAGGACGCG
GCCGAGCTGC AGCTGCAGCA GCCCCACGTG GTGCGGCTGG AGACGCGCCC GCCCAACGTC
GAGGGCAAGG GGGAAATCTC GCAACGGGCG CTGGTGAAGA ACAGCCTGCG CATGCGCCCC
GACCGGGTGA TCATAGGGGA AGTGCGCGGA GCCGAGGCGC TGGATATGCT GCAGGCCATG
AACACCGGTC ACGAAGGTTC CATGGCCACG ATCCATGCGA ATACGCCGAG GGATGCCCTG
GGCAGGGTTG AAAACATGGT GAACATGGCC GGGTTGAACC TGCCCATCAA GGCGGTCCGC
CACCAGATCA GTTCGGCCAT CTGGGTGGTG ATCCAGGTCT TGCGCCTGAC TGACGGCAAA
CGCAAGGTGA CGAGCATCCA GGAAATCACC GGCATGGAGG GGGACATTAT CACGATGCAG
GAAATCTATG CTTTCGAGCA GACGGGTATC GCGGCGGACG GAACCGTGCA GGGCCATTTC
CGCGCCACCG GCATCCGCCC CAAGTTCGCC GAGCGACTGC GTGTGCATGG GATACCGCTG
CGCGAGGAGC TGTTCGATCC CTCGCGCCGG TATACATAG
 
Protein sequence
MSIRERLQSV KETGLAEHHP APVTALHDIP AYEELKARVH QKLLDRVDLA VMESLPAERL 
LMEIRNLVER LLVEESVPIN EAERQGIVRD IQNEVLGLGP LEPLLADPTI SDILVNTHRQ
VYVERRGRLE LTDTHFANEK HLRKIIDRIV SRVGRRVDES SPMVDARLPD GSRVNAIIPP
LAIDGSLLSI RRFSVKPLKM NDLMAYKSLT PEMGEIISGL VKGKCSILIS GGTGSGKTTL
LNIMSGFIPS SERIVTIEDA AELQLQQPHV VRLETRPPNV EGKGEISQRA LVKNSLRMRP
DRVIIGEVRG AEALDMLQAM NTGHEGSMAT IHANTPRDAL GRVENMVNMA GLNLPIKAVR
HQISSAIWVV IQVLRLTDGK RKVTSIQEIT GMEGDIITMQ EIYAFEQTGI AADGTVQGHF
RATGIRPKFA ERLRVHGIPL REELFDPSRR YT