Gene Nmul_A1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1181 
Symbol 
ID3784350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1360702 
End bp1362705 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content53% 
IMG OID637811266 
ProductAsmA 
Protein accessionYP_411876 
Protein GI82702310 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTGC GTACCAAGCA AATATTATCG ATCTCCGGCG TTGCACTGCT GCTGCTTGTC 
ATCGCATTCG TCCTGTGGTT CGACTGGAAT ATGCTGAAAC CATATATCGA ACGGCAGGTT
ACCGAGAGGA CGGGCCGTGA ATTCACGATT CGGGGCGATC TCGATGTAAA CCTTTCGCTC
AATCCCCTCG TCAGCGTTGA AGGACTTTCG CTAGCGAATG CCGAGTGGGG GACCGAACAA
CCCATGGTTG CCGTGGATAA GGTTGCTGTG CGCATCAGTC TATGGGATCT CCTGTCCGGC
GATATCGTGT TGCCCGAGCT ATCCATCACG CGGCCCCGGG TACTCCTGGA AAAAAGCATG
GATGGAAAGC GAAACTGGGA TCTGAAAAAA GAAGAAAAAA AGAAGATGGA ACTGCCTCAA
ATTGGCCAAT TCACCCTGGA TCAGGGAAAA GTTCTCTTCC GCGATCCGAA GACCAAGACA
GATATAGCAG CCGACGTATT CACGGATCCG GCGGTGGACG CCGGAGAATT GCCCCTTCAT
GTGGCAGCGG AAGGTAAATT TACCGACCTG AAATTCACCG CGCAAGCGCA AGGCGGCAAG
ATAATGTCGC TTGCGGATAA GACTATCCCC TATCCGATCA AAGCGAGTGC CGAGATGGGA
ACAACGCGCG CAAGCGCGGA TGGAACCATC AAAGGGCTGG CCGAGATGGC GGAAGTGGAT
CTGAAGCTGG ACTTACACGG AGAGGATTTA TCTGCGCTCT ACCCCGTAAC CGGCATCGTT
ATTTTCCCCT CCCCCCCGTA CCACATTTCC GGAAGGATTT TGCACCACGA TACCGAGTGG
TCCATGAAGG GATTTTCCGG AAATGTAGGC AACAGCGATC TTGGCGGGGA TATCGTATTC
GACACGGGAG GCGAGCGGCC GCTCCTTCGA GGCGACGTGG TATCCAAAGT ACTCGACCTG
AGCGATTTGC AAGGTTTTAT TGGGGCGCGG CGAGGCCCCC AACCACAGGA TACGCCTGCG
GAAAAGAAAG AAAAAAAGGA ATCAATGAAG AAACAGCGGC ATCGTCTTCT ACCAGACCAG
GAATTCCGGA TAGACCGTTT GAAAGCCATG GACGCAGACG TAAGGTTCAC TGGAGAATCG
ATCCGCAACA AGGAACTGCC GGTGAAGCAT ATCGTGAGCC ACTTGAAAAT AGACAACGGG
CTTCTGACCC TCAACCCCGT CAACTTTGCG GTGGCGGGTG GGAATATCAT TTCGAATATC
ACAATCAACA CGCGTCCCGA AGTGCCCAAG GGAGAAATCA AAGTCGACGT CAAGCGCCTG
CAATTGCAAA AGCTCTTTCC TAAGCTCGAA ATCACGAAGA ATAGTGCAGG TGTAATCGGC
GGTGCAATAG ACATTAATAG CCACGGCAAG TCCGTCGGCG CGCTGCTGGC TTCGGCCGAT
GGAAACTTTG GCCTGATCAT GTCCGGCGGC CAGATCAGCA AGCTGTTGCT GGAAGTGATC
GGACTTGATG GGGGGCAGAT CATCAAGCTC CTGTTTGCCG GAGACAAAAA CGTGCCGGTG
CGGTGCGGGG TAATCGACTT TGATATCAAG AAAGGCATCA TGAGCAGCAA AGCTTTCGTC
ATCGACACGA CCGATACCAA AATTGTCGCT AAAGGACAGA TAAGCCTGGC TGAAGAAAAG
ATTGACATGA AGCTGTCTCC CAAAGCCAAG GATGTCAGCA TCCTGAGCCT TCGCACCCCT
ATTCACATAG AAGGCACTTT CAAGGATCCC ACAATCCTTC CCGACAAGAT ACTTGCCATA
CGGGCGGGAG CAGCGGTCGT ACTGGGAGTT CTTGCGACAC CTCTAGCAGC GCTCATCCCG
ACCATTGAAA CCGGACTGGC CAAAGATGCC AATTGCAGGG CATTGATTGC TTCAGTGGAA
ACGCCAGCGA AGCGCGCGGC TGGAGTCAAA GATAAGAAAG ATGAGGATCA TCCCCCGGCG
TCCCAGACGT CCCGGTCAAA GTAA
 
Protein sequence
MRVRTKQILS ISGVALLLLV IAFVLWFDWN MLKPYIERQV TERTGREFTI RGDLDVNLSL 
NPLVSVEGLS LANAEWGTEQ PMVAVDKVAV RISLWDLLSG DIVLPELSIT RPRVLLEKSM
DGKRNWDLKK EEKKKMELPQ IGQFTLDQGK VLFRDPKTKT DIAADVFTDP AVDAGELPLH
VAAEGKFTDL KFTAQAQGGK IMSLADKTIP YPIKASAEMG TTRASADGTI KGLAEMAEVD
LKLDLHGEDL SALYPVTGIV IFPSPPYHIS GRILHHDTEW SMKGFSGNVG NSDLGGDIVF
DTGGERPLLR GDVVSKVLDL SDLQGFIGAR RGPQPQDTPA EKKEKKESMK KQRHRLLPDQ
EFRIDRLKAM DADVRFTGES IRNKELPVKH IVSHLKIDNG LLTLNPVNFA VAGGNIISNI
TINTRPEVPK GEIKVDVKRL QLQKLFPKLE ITKNSAGVIG GAIDINSHGK SVGALLASAD
GNFGLIMSGG QISKLLLEVI GLDGGQIIKL LFAGDKNVPV RCGVIDFDIK KGIMSSKAFV
IDTTDTKIVA KGQISLAEEK IDMKLSPKAK DVSILSLRTP IHIEGTFKDP TILPDKILAI
RAGAAVVLGV LATPLAALIP TIETGLAKDA NCRALIASVE TPAKRAAGVK DKKDEDHPPA
SQTSRSK