Gene Nmul_A1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1902 
Symbol 
ID3784274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2189733 
End bp2192102 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content57% 
IMG OID637811988 
Producttype II and III secretion system protein 
Protein accessionYP_412589 
Protein GI82703023 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0584402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTACA GAGGTAAAAT CGTGCGGCAT CGCTCGCTAT GTGCAGTTTT TTTATGGATG 
GCAGTGGCAG GCTGGGTTAC CGAGGCTTGG GCCGTAAATC CCTCTGCCAC CCTGCCCGTC
GAGGAAATTG GCGTCCCGCC GAGCGTCCTG CCGGAAGCGG ATTCCCTGCG CGCCAGTAAA
TCCAGGTCTT CTGCCGGTCA GGATCTCGTT ACACTCAATT TCGTCAATGC CGATATTGAA
GGGGTAGTGA AAGCAGTCAG CGAAATTACC CGAAAAAATT TCATGCTCGA CCCCCGCGTC
AAGGGTACGA TCAACATCGT TTCAGCCAAG CCAGTGCCGA GGTCGTCGGT CTACGAAGTA
TTCCTGTCGG CACTGCGATT GCATGGATAT GCAGTCGTTG AGGATTACGG CATCATCAGG
ATCGTTCCGG AAAGTGATGC CAAGCTATAC CAGGGCCCGA CACTTGGTCC CACGAACAAG
CGGCAGCTCG CGGGCGACCG TATCCAGACA CAGGTATTCA CGCTGCAGTA CGAATCCGCG
GTGCAGATGG TGCCGATCCT GCGTCCGCTG ATCGCTCCAA ACAACAGTAT CACTGCAAAT
CCCAACAGTA ACACCCTGGT TATTACAGAC TACGCGAGCA ATCTCCAACG CCTGGCGAAA
ATAATTGATT CGGTGGATCA GCCAAGTGGA ACCGAGCCTG TCTCGATACC CCTTCAGCAC
GCCTCGGCGA TCGATGTCGC GCAAACCGTG AACCGGCTGT TTTCAGAATC GACGCAGTCC
CAGGCCGAGG GCGCCGCGGA CCCCACCCAG CAGCGTTTCA CAGTCGTCGC CGACGCCCGC
TCGAATACCC TCCTTGCACG CTCCGGAAAC CGGGCAGCGC TTGCGCGTCT GCGCCAGCTG
GTAACAGTGC TCGATTCTCC CACCAGCGCT GCCGGCAACA TGCACGTCGT CTTTCTCAAG
AATGCCGATG CAGTCAGGCT TGCCGAAACC CTCAGGGCGA TCTATCACAA CATGGCGTCC
CCGGTTTCCT CATCTTCGGG ACTGAGCCAG GGCACCGGCA CAGCTTTTGG AACATCTTCC
CTGGGTACAT CCACCGGCGG GGGGATGGGT GCCTCGTCAG GCACCTCAAC AGGGGGGTCG
ATGGGCACTT CCATGCCCGG TTCCAGCCTT GGTGCGGGGA CTGTTCCCGC TGCTTCCACC
GTCACCCCGG CTCCGATGCA AACTGGCGCA ACTTCCGCCA CCCCCGGCAT CATTCAGGCG
GATGCAGCCA CCAACTCGAT CATTATTACT GCCCCGGATG CTATTTATAA TAATTTGCGC
GCGGTGGTGG AGAAGCTCGA CGTGCGCCGC GTGCAGGTTT ATATTGAAGC GCTGATTGCC
GAAATCACTG CCGACAGAGC CGCGGAATTC GGCATCCAGT GGCAGAATCT GAGCAATGCC
GCGCAAGGTG GCACCCAGGT TTTCGGTGGC ACCAACTTCA ATGCCGGCAC TGCCGGAGGC
GGCAGTATCA TCTCCACCGC CCAGAATCCG ATAGCGAATG CAGCCTCCGG TCTGACTATC
GGCATCATGA ATGGCCTTGT CACGGCTATT CCCGGCATCG GCCCTGTTCT CAACATTCAT
ACGCTCATCC GCGCGCTGGA AACGGATGCC AATGCCAACA TTCTTTCCAC CCCCACCCTG
CTGACACTGA ATAACGAAGA AGCCAGGATC ATCATCGGGC AGAACGTTCC GATTCCCACC
GGCCAATTCA TTCCGCCAGT AGGAGGCGCC GTTACCTCCC CGTTTCAAAC CGTTTCACGC
CAGGACGTGG GACTATCATT GAAGATCAAG CCCCTTATCT CGGAAGGCAA TACTGTCCGT
GTGCAGATTT TTCAGGAAGT CTCGAGCGTC GTTCCTGGCA CGGTCAACGC CACCAACGGG
TTGATTACCA ACAAACGCTC GATAGAATCG ACAGTGCTGG TTGACGACGG GCAGATTCTC
GTGCTCGGCG GTCTGATGCA GGATTCGGTA AATGACTCGG TTGAAAGAAT TCCACTGGTC
GGGGCGATTC CGCTGTTCGG ACAATTGTTC AGTTACAACA AGCGCTCACG CAACAAAACC
AATCTGATGG TATTCCTGCG GCCGACGCTG ATGCGCGCGG GCGACGCCGC CGATCCGCTT
TCTGACGCAC AGTACGATCG GGTGCTGGGC GAACAGAAAA AAGTGAGACC CAAGTTTAAT
CTTGTGCTTC CGGATATGGA ATCGCCTACT TTGCCGCCGC GTCAACCACC TCCTGTCATC
CTTGATGACA GCATCACTCC CGATGATCCC GGAATTTCCA ATGTTCAAGG CAACTGGGAT
ACCGGGGGAG TGATGGATAA TACACCCTGA
 
Protein sequence
MGYRGKIVRH RSLCAVFLWM AVAGWVTEAW AVNPSATLPV EEIGVPPSVL PEADSLRASK 
SRSSAGQDLV TLNFVNADIE GVVKAVSEIT RKNFMLDPRV KGTINIVSAK PVPRSSVYEV
FLSALRLHGY AVVEDYGIIR IVPESDAKLY QGPTLGPTNK RQLAGDRIQT QVFTLQYESA
VQMVPILRPL IAPNNSITAN PNSNTLVITD YASNLQRLAK IIDSVDQPSG TEPVSIPLQH
ASAIDVAQTV NRLFSESTQS QAEGAADPTQ QRFTVVADAR SNTLLARSGN RAALARLRQL
VTVLDSPTSA AGNMHVVFLK NADAVRLAET LRAIYHNMAS PVSSSSGLSQ GTGTAFGTSS
LGTSTGGGMG ASSGTSTGGS MGTSMPGSSL GAGTVPAAST VTPAPMQTGA TSATPGIIQA
DAATNSIIIT APDAIYNNLR AVVEKLDVRR VQVYIEALIA EITADRAAEF GIQWQNLSNA
AQGGTQVFGG TNFNAGTAGG GSIISTAQNP IANAASGLTI GIMNGLVTAI PGIGPVLNIH
TLIRALETDA NANILSTPTL LTLNNEEARI IIGQNVPIPT GQFIPPVGGA VTSPFQTVSR
QDVGLSLKIK PLISEGNTVR VQIFQEVSSV VPGTVNATNG LITNKRSIES TVLVDDGQIL
VLGGLMQDSV NDSVERIPLV GAIPLFGQLF SYNKRSRNKT NLMVFLRPTL MRAGDAADPL
SDAQYDRVLG EQKKVRPKFN LVLPDMESPT LPPRQPPPVI LDDSITPDDP GISNVQGNWD
TGGVMDNTP