Gene Nmul_A1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1566 
Symbol 
ID3785288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1800933 
End bp1804292 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content57% 
IMG OID637811654 
ProductOuter membrane autotransporter barrel 
Protein accessionYP_412261 
Protein GI82702695 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCCT TCTTTTCCTG GGGTATGGAC CAGCCGGATG TCCGCATTGC TTCAAGTACC 
GGCTGGAGCG TTTATTCGAT TCGCATCAGG CAGCGGAGCA GATTCAGGAG AATGAGCTTC
AGGGCAGGCA CATCATCTAT TAAGGGCATT ATGAAAAAAA AACCACGCTT CACATTATTC
GATTTGCAGC TCGTGATTCT AGGCTCTGCG TTATCAGCCG TTTCTCCAAA ACCATACGCG
CAGCTGATGC TGCCATGGGA TCCAGCGAGC AATCTCGTAT TTAATGGTTC CCTCGGCAGT
GGCGAGGTTG TCGAACTCAC CAGTGTTACC GGCGATTTTA GCGGCAGCCT CGGTACAGGT
CCGGGACAAG TGCAATGGAC CGGCAGTGGA GGGTTTCTTT CGAATGGCTC GAATCGTATT
GTCAATATCG GTGGCAGCGG CGGCACTCTG ACCTGGGGGA GCGGAAATTT TGTCCCCAGC
GGAGAGGCCC TCATTCTCGG CACATCCAGC GCCAACATGC TGGATTTCCA GAATGGAATC
GACCTCGGTG GGGTAATCCC TACCATTCAG GTCCAGGGGG GCTCCATTGA GGGACATGCG
CGGATTAACG GCACTTTGTC CGGCACGGGA GGGCTTCAGG TGATTGGTAC CGGCAACGAC
ATGCTGGAGC TGACAGGGGC CAACACATAC TTGGGTGAGA CCTTCGTCAA TTCCAGTACA
CTGGTTGTAA GTGCCGACAA CAACATGGGC GCTTCCATGG GGTGGCTTAG ACTCTCTAAT
GGCACGCACC GAAATACCGC AAGCTTTACC ATGACTCGTG GTGTTTTACT GGACTTTGGT
GGGGGTACAT TCCATACGGA TGCGAATCTG GAAGTGGCTG GCCCCATCCT CAGCATGTTC
GGTATGGGTA ATCTCACCAA AACCGGTTCC GCCCAGCTGA TCCTCAGCGG TAATAATTTC
TACTTTGGCG ACACCGTGAT CAGCGCCGGA ACACTCCAGG TTGGCAACGG TGGCATCACG
GGCAGCATCA CCGGGAACGT GATCAACAAT GGCACGCTCG CGTTCAATCG TTCTGATGAC
ACGAGCTATG GCGGCGTGGT TTCCGGGACT GGGGGATTAA TCAAGCTGGG TCCGGGAAGA
CTGACGTTGA CTGGTGAAAA CACGTATATC GGCGGTACGG CAATTGCCGC CGGTGCCCTG
CAAATCGGCA ACGGCGGCAC CACGGGCAGT ATCGCCGGGA ACGTGACCAA TAACGGCACG
CTCGCGTTCA ACCGCTCCAA TGACATGAGC TATGGCGGCG TGGTCTCCGG AACTGGCGCG
CTGAACAAGC TGGGCGCGGG AAGGCTGACA TTGACCGGGG AAAATACCTA TACCGGCAGC
ACGACAATTG ATGCCGGCGC TCTGCAAATC GGCAATGGCG GCATCACCGG CAGTATCGCC
GGGAACGTGA CCAATAACGG CACGCTCGCG TTCAACCGCT CCAATGACAT GAGCTATGGC
GGCGTGGTCT CCGGAACTGG CGCGCTGAAC AAGCTGGGCG CGGGAAGGCT GACATTGACC
GGGGAAAACA CTTATACCGG CGGCACGACA ATTGATGCCG GCGCTCTGCA AATCGGCAAC
GGCGGCACCA TCGGCAGTAT CGTCGGAGAT GTGAATAATT TCGGCGGCAT CCTCGAGTTC
AACCGTTCGA ATAACCTGGG CTATTCAGGG GCAATTTCGG GTTTTGGCAC GATCGTCAAA
GATGGGGCAG GGACACTGGA ACTGACAGGG AATTCAGGAG GGTTTGTGGG TTCGATTCTC
GTCAATTCCG GCACGCTCGC CGTCAATGGC ATCCTGGGTG GGGTGGTGAA TGTGGATGCG
GATGCCCGCC TTCAGGGCAA CGGGATGATC GGTACGGGAA TTGTTGCCGG CACTGTTGCT
CCAGGAAATT CCATTGGCAC GCTTGGGGTA AGCGGCAATT ACACGCAGCT TCCCGGCTCC
GTCTATGAGG TTGAAATCGA CCCTGCCGGA AACAGCGATC GGATCGCTGT CGCCGGCACG
GCGAATCTCC ATGGCGGCAC GGTTGCGGTC ACTCCAGGGG TGGGTACCTA TTCCGCCAGC
ACGCGCTACA CCATCCTGAC GGCGGCAGGT GGGCGGACAG GAGCATTCGA TGCGCTCACC
CTTACCCGGA CACTGCCCTT TCTGGATGTG GGATTGAGCT ACGACCCGAA CAACGTTTAC
CTCGATGCCA ACCCCCTCGC ATTTTGCGCC GTCACTGTTA CAGCCAATCA GTGTGCGGCG
GGCCAGAGTG TGGAAAGCCT GGGCTCCGGG CATTCACTGT ACGACACGAT TACCGGTCTT
CCTGATCGGG ATGCCGCGAG ACGGGCTTTC GATAGCCTTT CAGGAGAACT TCATGCCAGC
GCAAAGGGCG TCATGATCGA GGATAGCCGT TTCCTGCGCG AAGCGGTCAA TGATCGCTTG
CGTCAGTCCT TTAGCTGGCC GGGCGTGACT GTTTCCCGGG CATCGGGGCA AACCCTGCAA
GAAAACAGAG CAACTGGCCA CGCATTCTGG ACCCGTACAT TCGGTTCTTT CGGACACCGT
AGCGGAGATA TGAATGCGGC GCGCATCAAT CGGAACATCG GGGGTTTTTT CATGGGAGGC
GATACTCTTG TTGCCGACCT CCTGCGTCTG GGGATTGCCG GGGGCTACAG CAATTCGTCC
TTTAATGTGA ACGAACGCTT TTCAACAGGT TCAAGCGACA ACTATCACGT AACGGCTTAT
GGCGGCACGC GTTGGCATGC CCTCGGTCTG CGTTTTGGCG GCGGCTATAC CTGGCATGAC
TTCGAAACGA ACCGCAATAT CCTTTTCCCG GGTTTCAACA ACCAGGCGAA AGGGGATTAT
CGCGGCCGCA CCGGCCAAGT GTTTGGCGAG CTGGGCTACG AACTGCCGTT CAAGAACGTT
TCGCTGGAGC CCTTCGCGGG GGTGGCATAT GTCAATCTCG CCACCAAGGG TTTCCAGGAA
CGGGGAGGCA TTGCAGCATT GAGCAGTTCC AGGGATAACG AAGGCGTTAC CTACAGTACG
GTGGGCATGA ATGCGGCGAG CATGTTTTCC ACGGCAGGGG GGACGACCAC AAGGCTGCGG
GGCAGTCTGG GATGGCGTCA TGCCTTTGAC AGCGTTCCAA CCCGCTCGTC CGTGGCTTTC
AGTGGTGGTT CGGCATTTGG CATTGCAGGC ACTCCCATCG CGAAAGACGC AATGATCATA
GGGGCGGGCC TGGACGCAAG CGTTGGCAAG AACGCGATCC TGGGCATTGC CTACACCGGA
CAAGTATTTA GCAACGTCGT GGATAACGGT GTCCGGGCAA ATCTCGACTG GAGGTTTTAG
 
Protein sequence
MQSFFSWGMD QPDVRIASST GWSVYSIRIR QRSRFRRMSF RAGTSSIKGI MKKKPRFTLF 
DLQLVILGSA LSAVSPKPYA QLMLPWDPAS NLVFNGSLGS GEVVELTSVT GDFSGSLGTG
PGQVQWTGSG GFLSNGSNRI VNIGGSGGTL TWGSGNFVPS GEALILGTSS ANMLDFQNGI
DLGGVIPTIQ VQGGSIEGHA RINGTLSGTG GLQVIGTGND MLELTGANTY LGETFVNSST
LVVSADNNMG ASMGWLRLSN GTHRNTASFT MTRGVLLDFG GGTFHTDANL EVAGPILSMF
GMGNLTKTGS AQLILSGNNF YFGDTVISAG TLQVGNGGIT GSITGNVINN GTLAFNRSDD
TSYGGVVSGT GGLIKLGPGR LTLTGENTYI GGTAIAAGAL QIGNGGTTGS IAGNVTNNGT
LAFNRSNDMS YGGVVSGTGA LNKLGAGRLT LTGENTYTGS TTIDAGALQI GNGGITGSIA
GNVTNNGTLA FNRSNDMSYG GVVSGTGALN KLGAGRLTLT GENTYTGGTT IDAGALQIGN
GGTIGSIVGD VNNFGGILEF NRSNNLGYSG AISGFGTIVK DGAGTLELTG NSGGFVGSIL
VNSGTLAVNG ILGGVVNVDA DARLQGNGMI GTGIVAGTVA PGNSIGTLGV SGNYTQLPGS
VYEVEIDPAG NSDRIAVAGT ANLHGGTVAV TPGVGTYSAS TRYTILTAAG GRTGAFDALT
LTRTLPFLDV GLSYDPNNVY LDANPLAFCA VTVTANQCAA GQSVESLGSG HSLYDTITGL
PDRDAARRAF DSLSGELHAS AKGVMIEDSR FLREAVNDRL RQSFSWPGVT VSRASGQTLQ
ENRATGHAFW TRTFGSFGHR SGDMNAARIN RNIGGFFMGG DTLVADLLRL GIAGGYSNSS
FNVNERFSTG SSDNYHVTAY GGTRWHALGL RFGGGYTWHD FETNRNILFP GFNNQAKGDY
RGRTGQVFGE LGYELPFKNV SLEPFAGVAY VNLATKGFQE RGGIAALSSS RDNEGVTYST
VGMNAASMFS TAGGTTTRLR GSLGWRHAFD SVPTRSSVAF SGGSAFGIAG TPIAKDAMII
GAGLDASVGK NAILGIAYTG QVFSNVVDNG VRANLDWRF