Gene Nmul_A1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1373 
Symbol 
ID3784468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1559121 
End bp1565051 
Gene Length5931 bp 
Protein Length1976 aa 
Translation table11 
GC content59% 
IMG OID637811461 
Productintegrins alpha chain 
Protein accessionYP_412068 
Protein GI82702502 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCAATA AATCTCCGGT CGAGCGCCAG GCTACCTCGA ATCCCGCCGG TGGAGGCGAC 
ATCCGGGGTC TGGGCGAGAC CTTCCAGCCT GATTTGAATT CGGGGGTCGG CAACTATCGG
GTTCCCCTCG ATCTGTTGCC CGGCATCCGT AATTTCCAGC CGAGCCTCGC CCTCGGCTAC
AGCACCGGCG CGGGCAACGG CCCCTGGGGC CTCGGCTGGT CCCTGCTTTT TGCCACTATA
AGCCGCCGCA CTTTTCGGGG CACTCCGTCC TATACGGAAG ACGACACATT CCTGTTCGCC
GGGCAGACAG AACTTGTGCC CATCGGTTCC GACACTTTCC GGGCTTCGGT CGAGAATGGG
TTCGAACAGT TCAGCCGGAC GGAAACCGGA TGGCAGGTTA CAGAGAAGTC CGGCATACGC
CATACCTTCG GCACGACCGC TGAAAGCCGC ATCGAGTTTG ACGATAACGG GACCCCGCGC
GTGTTCGCAT GGCTGGTCGA GCGTACTGAA GACACCACGG GCAATGCCAT TACCTACCGG
TATCGCAGTG AACAGGGGCA GCGGTATATC GAGGAGATCC GCTACGCCAT CTATGCCATC
CGATTCGAAT ACGAGGTCAG GCCCGACGCG TTCTCCGATT TCCGCAGCGG CTTCGAACAG
CGCACTACGT TGCGTTGCAG GCGGATCACG TTCCATGTCG AGCCCCTGGG GCCCGCCGCT
GTGCGAAGCT GGTCACTCGC ATATGACCTG GCGCCGTTAT CCGGGCTGTC CCTTCTGCGT
TCAGTCACCT TCTCCGGCAA TGATCCGGAA ACGGGAGAAT CGGCATCGCT TCCGCCACTT
CAGTTTTCCT ATACGGAGTT TACCCCCTCG AGCCGCCGCC TGAGGAAGTT CACGGCTGAG
ATCGGCGCCG CGCCTCCCGG TCTCGACCAG CCCAACCTGG AATTGATCGA CCTGGACGGA
CGGGGCCTGC CTGGCATTCT GTCGATCGAA AACGGCATCG CGCGATACTG GGCCAACACC
GGCCAACTCA GGTGGGCACC GCCCCGTTCC TTGCCTCAAT TCCCGGCTGC GTTCTCCACG
GCGGAAGACC GCGTGCAGTT TGCCGATATG GACGGAAACG GCGCAGCCGA CGTGCTTGTT
GGCAGCGGGA CGCCCAGCGG CTTCTATCGC AACGGGGGAA CCGGTGCTTT CGATGGATTC
AACCGATACG TCCGTTCACC GCAGCTTGCC TTTGAAACAG GCGGACTGCA GTTGATGGAC
GTCAATGGCG ATGGGCTCGT CGATGCAGTA CATGACGGGC GTACCGGTTT GCAGGTTTAC
AAAAATCGCG GTGCGGCAGG CTGGGAACCG CCGCGACTGG TGCCGAAGGA CGGCCTGCCG
ACACCTGTTC CCAATCTTTC CTCCGGTGAT CCTCGCGTGC GGCTGGCCGA CATGAACGGA
GACAAGATAC CGGATGTAGT GCTGTTCAGC AGCGGCTATT TCGAATACTG GCCTGCGCGC
GGCGAAGGCC ATTTCTCCGA ATCGCGCAAA ATGACTACCG CCCCGAGATT CCCTTATCCG
TTCGATCCGG CCCGGGTGTT CATCACCGAT ATCAACGGAG ATGGCCTGGC TGATGTCGTT
TTTGTCGGTT ACGACGAGAT AACCTACTGG ATCAACCAAT CCGGAAATGC GTTCAGTCCT
CCACAGACCA TTCGTTACAC CCCCCCCACT TCCCGGGCCG ATACGCTGCG GGTCGCGGAC
ATGAGCGGAA CCGGAACGGC TGGGATTTTA TGGACCGGTC CATATCCTGA CTACCGCTAC
CTCGACTTTA CCGGTGATGT GAAACCGGGC CTGCTTAGCC GCATCGACAA CGGCCTGGGC
CGCGTTACGA CCATTGAATA TGCGACCTCG ACCGCTGAGG CAAGGCGCGA TCAGGAGCAG
GGGCAATCGT GGATGTCGTT TCTTCCGTTT CCCGTACAGG TTGTATCCGC CGTTACCATC
GACGACGCGA TCAGCGGCCT GCACAACGTA ACCCGTTACC GGTATCACGA GGGGCACTAC
GACGGATACC GACGCGAATT CGACGGATTC GGCCGCTCGG AACAGATCGA GGAAGGGGAT
GCCTCGATTC CTTCCTGTCG AACCGTTTTT CACTATCATA CCGAAGCTTC CGGGAGGCCT
TCCGAGCCGG ATCCGGAGCT GCGCCGCAGC CTGAAAAGGA AGCTGTTCCG AGTGGAGGTT
TTCGGCGAAG ATGCAACTCC ATCGGCCTCC GCTCCCTACC GGGTCGAAGA AAGCCGGTGG
ATAGTGCGAA TCGAGCAGAC GCTGCCCGAT GACCGTCGCG TGCTGTTTCC TTTCGTCGAA
GAGACGGTCA CATCGACATT CGAGCGCGAG TTCGAAGCCC GCATCGAACG GAGGATGTTC
TCGTTCGACG CTTCCGGAAA CGTCACAGGC GAGGAGCGGA GGGGAGAAGG AGGAGAGGGA
GCTCCGCTGA TCGCGACGAC GCGGGCCGAG TACGCACAGG ACCCGACCGG CCGCGTCCGC
GATCATGTGT CGCGTATCGT CCAGCGCGAC GGGGACGGCA ATCTGCTGAA CGAGTTGCGG
CATTATTATG ACGGCCCCGC CTTCGAGGGT CTGCCCCTTG GCCAGGTTAC GAAAGGATTG
CTCACCCGGA CGGAGAAGGT TGCCCTGCAT AAAAGCACGG CCACCTCCGT GTATGGTGCT
CATGAACCGG ATTTTCCGGC ACTGGGATAT CATGATGGAG AGGACCAGGA TGGCGAACCT
GCATGGCTGT ATGACCATAA ACGGCTCAGT CTCGATGAGC GCGGCAATGT CCTGGTCAGC
CGGGATGCAC GCGGGAACGA CACGACCTAC ACCTTTGATG AATTCGGGTT GTTTGCGACT
TCGATCACCG ACGCGAAAGG TTTCATCGCA ACCGTCGAGC ATGATCTTCG CATTGCAAGA
CCCAGACGCA TGGTGAACCC CAACGGCGGA GTAACCGAGT CGCGCTATGA TCCTCTGGCA
CGCATTGCCA TAGTCGCCGC GCCCGGAGAC ACGCTTGCAA TCCCCAGCGT AACCTATTCG
TACGAAACTT CAGCACTGCC GGCACTGCGC AACACGCGTT ATCGCGTGAT TTCCGGAGAT
TCCCGCACCA TCGCGATCAA TGAGTATCTC GATGGAGGCG GCGCGTTATA TCAGCGCCGC
ACAGAGCATG ACGGCGCCAG GGTTTCGGTG AGCGGGCAGG CTGTGACCAA TGTACGGGGG
AAAATGGCGG AAAAACTAGA GTCGTTCTAT GCCGAAGGGT TGGAGTTTGA GCCATATCCC
GCTGACCCTT CCGCACCGCG ACGGCACTTT TTCTTCGATG CACTCGGTCG GGCAATCCGT
ACAGTGCACC CCGACGGGGG TGAATCGAGC GCGGAATACC GGCCATTCGA TACGCTTTTC
TCCGATGGCG GAGATAACGA TCCCGGGGCG GAAGTTTCGT TTGGAACGCC CCGGCGCGAG
CTCTACGATT CCTGGTACCG CTTGATCGGA GTGGAAGAGC GCGACCAAAG TACGATCCGC
ACGACCCGCT ATGAGCTCGA TGGTGCAGGA CGCTTGCGGC AGATCACCGA TCCCCGTGGC
ATTATCCTCA GCCGTATCAC CCGCGATCTC ACCGGGCAGG AGATCAGGAT CGAACATGTG
GACGCGGGCA ACCGCCTGCT GGCCTGTGAT GCCTCCGGAA ATCTCGCGCT TCAGGTGGAT
GCCGCAGACG AGTCGATCAA CCGGACTTAC GATGAATTGA ACCGCATCAC TGCCACCGCG
TATGGAGGAG GGGAGCCGGA GCAGTATTTC TACGATGCAG GGAGCGGCGC AAATCTGACA
GGCCGGTTGG CGCGAACCGT TGCTCCCGCC GGTGAAACCA CTTACAGCTA CGATGAGCGC
GGCAATGTCA TTCTGCGTTC CAGGCTGGCG CCCGGTGAGA CCGACGCTCT TGCGTTGAAT
TACACATTCG ACAGGATCGG CCGTATGACT CGCTTGAGCT ATCCCGACGG AGCCGTTGTC
GATTTCGAAT ACTATTCTGG ATTCCTGCTG CGGCGCATTC CCGGTGTAGT CGATACTATC
GAATATACGG CCGCAGGGGT GCGAACCGCA CTCCAGTATG AGAATGGTGT GCGGACGGAA
TACGACTACG ACCCCGTTTC CCTGAGGCTG CGGGAGCTTC GCACTCATCG GCTGGAGACG
GGCGCGGTCT ATTTTCATAC CCGGTATGAC ATCGACAAGG CGGGGAATGT AGCCGCGATC
GAGGATTTGC GGGCTGCAGC AGCCGGATTC GTGCGGACGC AAAGCTTCGG GTATGATGCA
TTCAACAATC TTCTGGAGGC TTCCAGTCCG GATCCGGGCG CCGGTTATTC GCACGTCTAT
GAATATGACC AGGCAGCGAA CTTCCTTCGC AATCCCCTGA TCAGTTCCAA TCCGCTCTTC
TACGAGAATG GCGGCAACAG TAACCGCCTG AGCGGCCACA ATGATGGCAT CAATGCCGTT
ACCCTGTTCG GGTACGATGC GAACGGCAAC GTCATTTCCA TGCCCGGACG CACCCTGACC
TTCGATGCCA AGCAGCAACT CTCGCGGGTG CAGGTTGCGG GGGGAACGGA CGTGGTTTTC
TTCTATGACC ACAAGGGCGT GCTGAGCAGG CGCGAAGCAA CCGCGGGGGG CTTGACCGAA
ACGACGCATT ACGTGGATAA CCTGTTCGAG TCGAAGGATG GCACCACGCA GCGCTGGATA
CTCGCAGGCG ACCTCCCGGT CGCCTGTGTT GCAGATGGAA TGATGGTTTT CATACATAAC
GATCACTTGG GCAGTGCGGT CATTTATACC GATGCGGGTG GCAATCTTCT GACGGAAACC
GCTTTTCATC CATTCGGGAG CGTGCTCGTG GCTCCCTCCG GCGCACTTCC GCCGGCTTTC
GCAACCAAAA AACTCGATGC GGACATAGGA CTGTACTACT TCAATGCGCG CTGGTACTCG
CCGGTAATGG GACGCTTCAT TTCGCCCGAT CCGCTCTATC TTTATCAGCC GGAACAGGGC
TTGCAGGAGC CAAAGCGCTT GAACCCATAC GCGTACGCGG GCAACAATCC GGTTCGCTAC
GTCGATCCGA GCGGATTGGG CTTCTGGGAC GTGCTGGGCG CCATTGTCAT CGCCATTGCC
GTCGTGGTGG CCGTGGTTGC CGTTTCCGTG CTGACTTTCG GAGTAGGCAC CGCAATCGGC
TTTGGCACCC TCCTGGCTTA CGCGGCGGTA GCAGGACTCG CAGGGGCGGC CATCGGGGCT
GTCGTAGGGG GCATCGCCTA TGGAAGCTGG GAAGGTGCAC TGCGCGGTGC GTTGATCGGT
TTTACAGCGG GCGCCAACGC AATGATCGGC GGCATGATCT TCGGTCCGAT CATCGGTGCG
GCACTGGGTA TCATCACCTT CCTGGCGGTG ATCCCACCGG TAGCGAAGAG CGATGTCTAC
CAGGGCATAC TTGGCTGGAC CAGCTACCTC ATGCCCATGA GCTGGCCGGG CCATGCCATC
GGCCTCGTCT TGTTCGCCCT GAATGTCGTC GGTTATCTCG TCACCTTCGG GCAGGTGGAC
AAGCTGCGAA TCCGTGACAT GCAGGTTGAC TGGAAAACAG GCAACATTTT CACGGTGGGC
GGATGGGTGG GTCAACTGGA TGGCAGGGCT TTCAATTTCG GCGGTTTCAG CTTCGTGAAT
ACTGCGCGCT ATGTGGGAGG AGAGATTATT CCGGCTACGT TCGAGCATGA ATCCGGGCAC
ATGTTGAGCA ACGCTGCGTT TGGATTCTTC CAGGCGACCC GCGTGTTCGA AGGAAACGGA
CTGGACAGTT TCTGGGAGCG CATTGCCGAA AGCAATGTGC CCCCTGGCCT GCGTGCCACT
GATCCTGTAA CGCCGGAACC GGATCGTCCT AAAATTCCGC AGTGGGGTTA G
 
Protein sequence
MANKSPVERQ ATSNPAGGGD IRGLGETFQP DLNSGVGNYR VPLDLLPGIR NFQPSLALGY 
STGAGNGPWG LGWSLLFATI SRRTFRGTPS YTEDDTFLFA GQTELVPIGS DTFRASVENG
FEQFSRTETG WQVTEKSGIR HTFGTTAESR IEFDDNGTPR VFAWLVERTE DTTGNAITYR
YRSEQGQRYI EEIRYAIYAI RFEYEVRPDA FSDFRSGFEQ RTTLRCRRIT FHVEPLGPAA
VRSWSLAYDL APLSGLSLLR SVTFSGNDPE TGESASLPPL QFSYTEFTPS SRRLRKFTAE
IGAAPPGLDQ PNLELIDLDG RGLPGILSIE NGIARYWANT GQLRWAPPRS LPQFPAAFST
AEDRVQFADM DGNGAADVLV GSGTPSGFYR NGGTGAFDGF NRYVRSPQLA FETGGLQLMD
VNGDGLVDAV HDGRTGLQVY KNRGAAGWEP PRLVPKDGLP TPVPNLSSGD PRVRLADMNG
DKIPDVVLFS SGYFEYWPAR GEGHFSESRK MTTAPRFPYP FDPARVFITD INGDGLADVV
FVGYDEITYW INQSGNAFSP PQTIRYTPPT SRADTLRVAD MSGTGTAGIL WTGPYPDYRY
LDFTGDVKPG LLSRIDNGLG RVTTIEYATS TAEARRDQEQ GQSWMSFLPF PVQVVSAVTI
DDAISGLHNV TRYRYHEGHY DGYRREFDGF GRSEQIEEGD ASIPSCRTVF HYHTEASGRP
SEPDPELRRS LKRKLFRVEV FGEDATPSAS APYRVEESRW IVRIEQTLPD DRRVLFPFVE
ETVTSTFERE FEARIERRMF SFDASGNVTG EERRGEGGEG APLIATTRAE YAQDPTGRVR
DHVSRIVQRD GDGNLLNELR HYYDGPAFEG LPLGQVTKGL LTRTEKVALH KSTATSVYGA
HEPDFPALGY HDGEDQDGEP AWLYDHKRLS LDERGNVLVS RDARGNDTTY TFDEFGLFAT
SITDAKGFIA TVEHDLRIAR PRRMVNPNGG VTESRYDPLA RIAIVAAPGD TLAIPSVTYS
YETSALPALR NTRYRVISGD SRTIAINEYL DGGGALYQRR TEHDGARVSV SGQAVTNVRG
KMAEKLESFY AEGLEFEPYP ADPSAPRRHF FFDALGRAIR TVHPDGGESS AEYRPFDTLF
SDGGDNDPGA EVSFGTPRRE LYDSWYRLIG VEERDQSTIR TTRYELDGAG RLRQITDPRG
IILSRITRDL TGQEIRIEHV DAGNRLLACD ASGNLALQVD AADESINRTY DELNRITATA
YGGGEPEQYF YDAGSGANLT GRLARTVAPA GETTYSYDER GNVILRSRLA PGETDALALN
YTFDRIGRMT RLSYPDGAVV DFEYYSGFLL RRIPGVVDTI EYTAAGVRTA LQYENGVRTE
YDYDPVSLRL RELRTHRLET GAVYFHTRYD IDKAGNVAAI EDLRAAAAGF VRTQSFGYDA
FNNLLEASSP DPGAGYSHVY EYDQAANFLR NPLISSNPLF YENGGNSNRL SGHNDGINAV
TLFGYDANGN VISMPGRTLT FDAKQQLSRV QVAGGTDVVF FYDHKGVLSR REATAGGLTE
TTHYVDNLFE SKDGTTQRWI LAGDLPVACV ADGMMVFIHN DHLGSAVIYT DAGGNLLTET
AFHPFGSVLV APSGALPPAF ATKKLDADIG LYYFNARWYS PVMGRFISPD PLYLYQPEQG
LQEPKRLNPY AYAGNNPVRY VDPSGLGFWD VLGAIVIAIA VVVAVVAVSV LTFGVGTAIG
FGTLLAYAAV AGLAGAAIGA VVGGIAYGSW EGALRGALIG FTAGANAMIG GMIFGPIIGA
ALGIITFLAV IPPVAKSDVY QGILGWTSYL MPMSWPGHAI GLVLFALNVV GYLVTFGQVD
KLRIRDMQVD WKTGNIFTVG GWVGQLDGRA FNFGGFSFVN TARYVGGEII PATFEHESGH
MLSNAAFGFF QATRVFEGNG LDSFWERIAE SNVPPGLRAT DPVTPEPDRP KIPQWG