Gene Nmul_A2588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2588 
Symbol 
ID3785469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2972856 
End bp2974655 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content57% 
IMG OID637812679 
Producttype II secretion system protein E 
Protein accessionYP_413269 
Protein GI82703703 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAC CTTCCGTACT CATCAGCGAG CGCAGGGAGA ATCCCGAGCA AACGGGGCAC 
CCATCTCTTG CGCTTCAGGA AGAATATTCC CGGAATGAGC TTCCGGTAGA CGCGCGGGCA
TCACAGCCGC CGTCTCAGAT ATTGCTGCTG GATGCGCAGC GGTTGGGCGT GGCGCGCAGC
GAGGCCGCCA ACCGGGGTGT TCCGGTGGTC AGCATCCTGG AGGAAACACT GGGATGCGCG
CCGGAGCAGT TGATCGCAGA ACTCGGGCGG CTGCTCAGGA TGCCGGTATT GACGATGGAG
AAGTTACGGG CATCAACTCC GGCGTTTGAA ATATTGCCTT TCAGTGAAGC CACTAAAAAG
GAGTGCGTAT TGCTTCGCCA GCAGGGGAAG CACATCCTTG CCGTCAGCAA TCCCTTTTCC
TCCAGCTTGA GGGCCTGGGC GGAAGAATAT ATCGATGTTC CCGCGATATG GCACCTGGTT
CATCCCGCCG ATCTGACAGC CTTTTTCAGC CAGCAGGAGC AGACCATGCG CGCCATGGAC
AGCGTGCTGC CTGCGGCAGA GCGGGGCGCC AGGCAGCCGG GCGAGGAAGA CCTCTCGCTG
AAGACAATCA ACGAAGGCAC CAGCCAGGTA GTCCGTTTGG TGCATTCGAC GCTGTATGAT
GCCCATAAGT CGCACGCGAG CGATATTCAC CTGGAAGTCG TTCCCGGAGC CCTGTCGATC
AAATATCGTA TTGACGGTGT ACTGACCATG ATTGGAGTGG TGCAGGGGGC GGATCTGGCG
GAACAGGTCA TTTCCCGTAT CAAGGTAATG TCCGATCTGG ATATAGCTGA GCGGCGAGTT
CCCCAGGATG GCCGTTTCAA GATTTCCATT CAGGGGCGGG AGATCGACTT TCGTGTTTCG
ATCATGCCCA GTGCTTTCGG AGAAGATGCG GTACTGCGCA TTCTCGACCG CCAGGCGCTG
GCTGATCATG TCAAGGGCTT GACCCTCGAT CATCTTGGAT TCGATCGGGT CGCCATATCC
ACCCTGCGAC GCTTGAGCTC GGAGCCCTAT GGAATGCTGC TGGTGACAGG CCCCACCGGC
AGCGGGAAGA CTACCACGCT TTATGCGGCG ATTTCGGAAG TCAACCAGGG TCACGATAAG
ATCATTACCA TCGAAGACCC GATCGAGTAT CAATTGCCTG GCGTATTGCA GATACCTGTC
AACGAGAAAA AAGGATTGAC TTTCGTGCGC GGCCTTCGCT CCATCCTGCG CCACGACCCC
GACAAGATCA TGGTAGGCGA AATCCGCGAT CCGGAAACGG CGCAGATTGC CATTCAGGCT
GCTTTGACCG GCCATCTCGT ATTCACCACC GTACATGCCA ATAGCGTTTT TGATGTCATT
GGCCGTTTTA CCCACATGGG AGTGGATCCC TACAGTTTCG TTTCTGCCTT GAATGGGGTT
GCCGCGCAAA GGCTGGTGCG CCTGCTTTGC GTGCATTGCG CAGTGGAAGA ACAGCCCGAC
GAGCAGCTGA TTGCGGAATC CGGAATCGAT CCTGAGCAGA TCGCCGCGTT CAGATTCCGC
AGCGGCAAAG GATGCGGCCA TTGCCGGGGA AGCGGCTATC GGGGGCGAAA CGCAATCGCG
GAAATCCTGG TGCTGAACGA TGAAATCCGC GAACTCATCG TAGCGAAAGA ACCCGTACGC
CGTATAAAGG AAGCCGCGCG GCGGGGAGGC ACCCTGTTCC TGCGGGATGC TGCTTTGGCC
ATGGTCAGGA GCGGGCAGAC GAGTCTACAG GAGGCAAACC GTGTCACTAT TGCGGCGTAA
 
Protein sequence
MNAPSVLISE RRENPEQTGH PSLALQEEYS RNELPVDARA SQPPSQILLL DAQRLGVARS 
EAANRGVPVV SILEETLGCA PEQLIAELGR LLRMPVLTME KLRASTPAFE ILPFSEATKK
ECVLLRQQGK HILAVSNPFS SSLRAWAEEY IDVPAIWHLV HPADLTAFFS QQEQTMRAMD
SVLPAAERGA RQPGEEDLSL KTINEGTSQV VRLVHSTLYD AHKSHASDIH LEVVPGALSI
KYRIDGVLTM IGVVQGADLA EQVISRIKVM SDLDIAERRV PQDGRFKISI QGREIDFRVS
IMPSAFGEDA VLRILDRQAL ADHVKGLTLD HLGFDRVAIS TLRRLSSEPY GMLLVTGPTG
SGKTTTLYAA ISEVNQGHDK IITIEDPIEY QLPGVLQIPV NEKKGLTFVR GLRSILRHDP
DKIMVGEIRD PETAQIAIQA ALTGHLVFTT VHANSVFDVI GRFTHMGVDP YSFVSALNGV
AAQRLVRLLC VHCAVEEQPD EQLIAESGID PEQIAAFRFR SGKGCGHCRG SGYRGRNAIA
EILVLNDEIR ELIVAKEPVR RIKEAARRGG TLFLRDAALA MVRSGQTSLQ EANRVTIAA