Gene Nmul_A1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1903 
Symbol 
ID3784275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2192118 
End bp2193581 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content59% 
IMG OID637811989 
Producttype II secretion system protein E 
Protein accessionYP_412590 
Protein GI82703024 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCTG ACAGGATTCC TTATGTCTTT GTCAAGACAA ATGGAGTGGC CGTGACGAGC 
GTCACGAGCG ATCACGCAGA GGTGGTGGTG CGCGGCGAAG TCCAGGCCGG GGCACTGGCG
GAGTTGCGTC GCGTTCTGGG CGTGCCTCTG CGGGCGCGAC GCCTCGCTAC CGACGAGTTC
AACGAGATCG TTGCGGTGCT GTATAACGGC GCAAACGAAG GCGCCGCCGC ACTCGCAGAT
GATCTCGCAC AGGATATAGA CCTCTCGAGA TTACTGCAGG AACTGCCGAA GGTCGAGGAC
CTGCTCGAAA GCCAGGATGA TGCTCCTGTC ATCCGGCTGA TCAACGCGCT CTTTACACAG
GCGTTACGCA CCGCGGCTTC CGATATCCAC ATTGAGCCAT ATGAAACACG CTCGGTTGTA
CGGTTGCGAG TGGACGGCAC ATTGCGCGAC CTGATCGAAC CGGCGCGCGC ATTACATGCC
GCCCTCATCT CACGCATCAA GATCATGGCG CAGCTCGACA TTGCGGAAAA ACGCCTTCCG
CAGGACGGCC GGATTACATT GAGGATGGCA GGCAGGCCAG TCGACGTGCG CGTATCCACC
ATCCCCACCG CACACGGCGA ACGCGCCGTA TTGCGTTTGC TGGACAAGCA GGCTGGCCGC
CTGGACCTCC CTCGGCTTGG CATGGATGAA ATCACCTTGA CTCGCATGGA CAGGCTTATT
CGCGAGCCCC ATGGCATTAT TCTCGTAACC GGCCCCACCG GATCGGGTAA AACAACCACG
CTTTACGCCG CCCTGTCGAG GCTGGATTCC GCGTCGCTCA ACATCATGAC GGTTGAGGAC
CCCATCGAGT ACGATCTTGA CGGCATCAGT CAGACTCAGG TCAATCCGCG AATCGAAATG
ACGTTTGCGC GCGCCTTGCG GACAATCCTG CGGCAAGACC CGGATGTCAT CATGATTGGA
GAGATTCGCG ACCTCGAAAC CGCACAGATC GCGGTGCAGG CCAGCCTTAC GGGCCATCTG
GTATTTGCGA CTCTGCATAC CAATGATGCG ATAAGCGCCG TGACCCGGCT TGTCGACATG
GGAGTCGAGC CGTTTCTGCT GGCATCGAGT CTCATCGGCG TAGGTGCGCA GCGTCTGGTG
CGGCGGCTCT GTCTGGAATG CCGCCAGCCC TGGGACGAGG CCATGGGAAA ATCCCCGAGC
TCTTTTTCGG CTTCCGGAAT TTTATACAAG GCGCAGGGCT GTGCGGCATG CAATCACTCC
GGCTATCAGG GACGCACCGG GATTTATGAG TTGCTCGCGG TTGACAACGA CCTGCGCCGG
AGAGTTCATG ATCGCGCTTC CGAACAAGAC CTGCGAGAAT ATGTGATTTC CGCCGGAATG
CGCTCGTTAC GTGACGACGG CATGCGCCTC GCTACCCAGG GCATCACCAG CCTGGAGGAA
GTCGTGCGTG TAACACGCGA ATAG
 
Protein sequence
MASDRIPYVF VKTNGVAVTS VTSDHAEVVV RGEVQAGALA ELRRVLGVPL RARRLATDEF 
NEIVAVLYNG ANEGAAALAD DLAQDIDLSR LLQELPKVED LLESQDDAPV IRLINALFTQ
ALRTAASDIH IEPYETRSVV RLRVDGTLRD LIEPARALHA ALISRIKIMA QLDIAEKRLP
QDGRITLRMA GRPVDVRVST IPTAHGERAV LRLLDKQAGR LDLPRLGMDE ITLTRMDRLI
REPHGIILVT GPTGSGKTTT LYAALSRLDS ASLNIMTVED PIEYDLDGIS QTQVNPRIEM
TFARALRTIL RQDPDVIMIG EIRDLETAQI AVQASLTGHL VFATLHTNDA ISAVTRLVDM
GVEPFLLASS LIGVGAQRLV RRLCLECRQP WDEAMGKSPS SFSASGILYK AQGCAACNHS
GYQGRTGIYE LLAVDNDLRR RVHDRASEQD LREYVISAGM RSLRDDGMRL ATQGITSLEE
VVRVTRE