Gene Nmul_A1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1904 
Symbol 
ID3784142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2193591 
End bp2194796 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content59% 
IMG OID637811990 
Productgeneral secretion pathway protein F 
Protein accessionYP_412591 
Protein GI82703025 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCGT ACCGCTACGA AGCGCTTGAT CCCGAAGGCC GCAAGGTAAC GGGTGTGCTG 
CAGGCTGATA CCGCGCGCCA GGCGCGGGCA CAGTTGCGCG CGCAAGGCTT GCTGCCTTCG
ACCGTCGATC AGGTCCGCGC TCGCGAGGGC GGTCAGGCAC CCTGGGTCCG AGGCCTGCGT
CCGGAGGAGC TGAATTTGCT GACACGGCAG ATGGCTACTT TGCTGACCGC CGGTCTGACA
GTCGAGCAGT CGCTTGCTGC ACTGATCGAA TCCGCCGAGG AGCCGATGAC ACGCGAAGTG
CTCGGCGGTG TCAAAACAGA GGTGATCGCG GGGCTTTCCC TCTCCGCCGC GCTGGGCAGC
TACAGCAGGA GTTTTCCCGA TTTCTATCGG GCGCTGGTGC ATGGTGGGGA AGAATCGGGC
ACATTGCCGC TGGTACTGCG TCATCTTGCC GAGTATCTCG ATGCACGTCA GACACTAAAA
CAGAAAACCA GTCTTGCGCT CCTTTACCCG GCACTGGTGA CCATCATTGC CATTATCATT
GTTGCCGGCC TGCTCATGTA TGTGGTTCCC CAGGTAGTGC AGGTATTCCA GCATTCTCGC
CAGAGCCTGC CCCTCCTGAC TCGCGCACTG ATCGGGTTGA GTGATTTCCT TCTCATGTCA
TGGCCTTATC TGATTATTGC CATTGTCGGC GGGGCACTTT CCGCACGCGT CGCGCTACGG
CATGAGAACA TCAGATACCG ATGGCACGCT CTGCTGCTGC GCACCGCATG GCTGGGATCG
TTGATTCGCA GCAGCAACAC ATCCCGTTTC GCCAGCACGC TTTCCATTCT GGTCGGAGGA
GGTGTGCCGC TTCTTAAAGC TCTCAGCTCC GGTGCCCGCG TGATGAGCAG CATGGTCATG
CGTAAAGCGA TCGAAAATAC CATCGAACAG GTCCGCGAAG GCGCGAGCCT CTCCAGAGCG
CTGCGGGAAA CCCGCGTGTT TCCGCCCCTG CTTGTGCATC TGGTGGCAAG CGGGGAAATG
AGCGGCAAGC TGAAAGAAAT GCTCGAACGC GCCGCCCAGC TCGAAGCCCA GGCGCTGGAA
CGGCGACTGG GCGTCTTTTT AACGCTGCTG GAACCAGTAA TGATCCTGGT AATGGGGGGC
GTGGTGCTGA TGATCGTGCT TGCCATACTG CTCCCCATCA TGGAAATCAA CCAGCTGGTG
CATTAG
 
Protein sequence
MEAYRYEALD PEGRKVTGVL QADTARQARA QLRAQGLLPS TVDQVRAREG GQAPWVRGLR 
PEELNLLTRQ MATLLTAGLT VEQSLAALIE SAEEPMTREV LGGVKTEVIA GLSLSAALGS
YSRSFPDFYR ALVHGGEESG TLPLVLRHLA EYLDARQTLK QKTSLALLYP ALVTIIAIII
VAGLLMYVVP QVVQVFQHSR QSLPLLTRAL IGLSDFLLMS WPYLIIAIVG GALSARVALR
HENIRYRWHA LLLRTAWLGS LIRSSNTSRF ASTLSILVGG GVPLLKALSS GARVMSSMVM
RKAIENTIEQ VREGASLSRA LRETRVFPPL LVHLVASGEM SGKLKEMLER AAQLEAQALE
RRLGVFLTLL EPVMILVMGG VVLMIVLAIL LPIMEINQLV H