Gene Nmul_A2679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2679 
Symbol 
ID3785041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3077023 
End bp3080220 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content53% 
IMG OID637812769 
Producthypothetical protein 
Protein accessionYP_413358 
Protein GI82703792 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCCC TCACAACGTT CTTGCATCGC TTCATCCGCC ATCGCCGTTT CCTGATCGGT 
CTGAGCGTTT TCGCCGGGAT AATTATCCTG TTTGGTCTGC TCAGCTATTT CTGGTTGCCC
GGATATGCCA AGGCAAAGCT GGAAGAAGCT TTATCGGAGG TTCTGCACCG CCCCGTTACT
GTACAGTCGA TCGACATTCA GCCCTATACG CTGGAATTGA CAATTCGTGG TTTTCGAATA
GGCGAAAAAG AGACAAGCGT AGACGCAGAC AAGAATCTTT TTTCCTTTGA TGAACTTTAC
CTGGACCTGA GTATTGCCTC TATTGCCCAT CGTGCGCCCG TGATCAGTTC GGTTTCGCTC
AAAGGGCCGG CTATGCGTCT CGTTCGCGAA GCGCAGCAGC AGTTCAATAT CACCGATCTG
ATAGAAGACT TCATGAAACG GCCGAGAGAA GAGGAGGAAG GCGGCAAGAG CATGTTCTCG
GTACGGAATA TTGTTATTGA AGACGGCCGC TTCGAATTCG TCGACCATGT GAAAAAAAGT
CAGCAGGAGA TATCCGACAT TCAAATCGGT GTGCCTTTCA TCGCCAATTT CGAAAGTGCG
GAGGAAACAT GGGTAGAACC GCGCTTCAGC GCCAAGGTGA ACGGTGCCCC GTTCAATCTC
ACCGGGAAGC TGCGGCCTTT CACCACAAAC CGTGAAGCGA CCCTTGAGAT CAGGTTGAGC
GATGTCGATC TGACGCGCAT AGATGAGTAT TCCCCCATTC CAGTCGGTAT TCACCTTCTC
TCCGGGTATG TTGACAGCGA ACTCTTGTTG ACGTTTACCC AGGTCGACGG CCAATCGCCT
GCCCTGGTTC TCACAGGCAA TGCCGCCTTA AAGAAGCTGA AAGTCGAGAA TCATTCGGTG
GAGGCGCCCT ATGCTGCCAC ACTGGGCGAA TTCAACGTCC AATTGACTGA AATAAACCTG
AATGCAGTGA AGTCTTCGCG AGCTGCCATA GTCCTGACGG AAATTGCGGT GACGCCGGAG
GGGAGCGCTG AGCCCACACT CAGTCTGCCG AAGCTTGCCC TGGAGGAGAT CGTGGTCGAC
ATCCGCCAGA AGAATGTTAG CCTCCGCGCT GCAACTCTGG ATCGCTTCAA TGTCTCATTG
CGCCGCCATA AAGATGGCCG GCTCGATCTT GCCAAGCTTT TTACCCCGAT TACCCCGATA
AACAGGGCGG AAAAGGCATC GCAATCCTCT ACCCCTCCTT CTCCCAAAGC GGATTCAGGC
AAGCCATGGA CGGTAAAACT TGGAAGCTTC AAGCTGGCGG ATGCCGCGCT GCGCTTTGAA
GATGCCACTT TGCCGGACGT AGCTCCCATG GTGGTCAATA CCCTGGATCT GGCGGTCAAT
CAGATCGACT TCAGCGGCGC AACACCCTCC CAGCTCGAGC TCAAGGCAGA AGTAAACAAA
ACAGGCAGCC TGGAAACAAA AGGTAGCCTT GCCTGGGCAC CGGTCGCCGC GGACCTCGCA
GTCAATGCCA AGGATGTCGA TCTTGTTGCA TTGCAAGGCT GGGCTGGAGA CCGGTTGAAT
GTGCTGTTCT CGCGCGGCGC ATTATCCTTC CGCGGCAAGA TCAAAGCGGA TGGGGGCAAG
GGGCAGGAGC GACCTGAACG GGCACGGGGA GAAAAGGCAT CGCCGCTAAA AGTGGCGGTG
CGCGGCGATG GGAGGCTCAC TAATTTCAAC ATGCTGGACA AGGCGGACGC CACCAACCTC
ATGCGTTGGC GAAGTGTCGA TATCAAGGGT ATCGAGTTTG CCAATGAACC TCTCAGCATC
AATGCCGCTG CGATCACGCT TACTGACTTT TTTGCCCATG TCGTCATTAC TCCTCAAGGC
GAGTTGAATC TCAAATACAT CGTGCGGCAG GAGGAAACAG CAGTTTCTCC CTCCCAGGCT
GCTGCTGCGC CGGCAACCGA GCCGGAACCG GCGCGCGCAT CTCCTGAAAT ATCGCCAGCT
GCGCCCGTTC CTTCACAGCC CCCTCCGCGC AAGAATCTAC CCATCAGCAT AGGTCGTATC
GTGATGCAGG GAGGGCACGT CAATTTTCAG GACCAGTTCA TTCAGCCCAA CTATCGCGCC
TACTTGACCG ATCTGGCGGG ACGCATAGGA CCGCTCAATC CCCGGAAAAC CGGGGAAGTG
GATATTCGGG GCGCGGTAGA TAAAACGGCG CCATTGAAAA TCAGCGGGTC GGTCGATGCC
TTCGGCAGAG AACTCAACCT TGATATTACA GCCTCGGCCA AAGGTATCGA TATGCCTACG
TTCAGCCCCT ATTCCGGGAA ATATATCGGA TACGCAATCG AGAAAGGCAA ACTATCGGTC
GACGTCCACT ATCATGTTGA ACACGGCGAA TTAACAGCGG AAAACAGCAT CTTCCTGGAT
CAATTGACTT TCGGTGAGAA AATTGAAAGT CCGGATGCAG TCTCGATTCC GGTCAACCTC
GCATTGGCGC TGCTAAAAAA CCGACGCGGA GAAATAGATA TTCGTCTCCC TATAAGCGGT
TCCATCAACG ATCCGAAATT CAGCCTGGGA GGCATAATTG TCAAAGCAAT CCTCAATTTG
CTGACCAAAG CAGCCACGGC ACCTTTCACG GTGCTCGGAT CGCTTTTCGG GGGAGAAGAG
CTATCGGAGA TCAATTTTAC CGCGGGCGAG GTAAAGATTC CGCCAGAGGC AGAAGAAAGA
CTCCAGAAGT TAGCCCAGGC ACTGACAGAT CGGCCAGCGC TCAATCTCGA GGTCACCGGT
CATGCCGACG CTTCGATTGA TCCGGAGCCT TTGAAACGGC GTGTACTTGA ACGCAAAATC
AAGGCGGAGA AATTGTCAGC GGATATCAAA AAAGGAAAAT CTTATGATTC TCTGGAAGAT
GTAACAGTGA CACCGGAGGA ATATGAAAAA TACCTGGAGC AGGTTTACAA GGATGCCAAG
TTCGAGAAGC CGAAAAATTT CATCGGCCTG TCAAAAAGCC TGCCTGCGGA AGAGATGGAG
AAGCTGATGC TTGCCAATAT CGACGCGGGG GATGCAGAAC TACAGGAGCT TGCCGAAAGC
CGGGCGGTGA GTGCACGGGA CTGGCTGATT CAGCAGGGAA AAATTCCTGA TGCTCGAATT
TTCGTACTTT CACCGAAAGT TGAAGCAGGG ACGAACCGCG AGAAGGCGGG CAATCGCGTG
GAGTTCTCGC TCAGGTAA
 
Protein sequence
MSSLTTFLHR FIRHRRFLIG LSVFAGIIIL FGLLSYFWLP GYAKAKLEEA LSEVLHRPVT 
VQSIDIQPYT LELTIRGFRI GEKETSVDAD KNLFSFDELY LDLSIASIAH RAPVISSVSL
KGPAMRLVRE AQQQFNITDL IEDFMKRPRE EEEGGKSMFS VRNIVIEDGR FEFVDHVKKS
QQEISDIQIG VPFIANFESA EETWVEPRFS AKVNGAPFNL TGKLRPFTTN REATLEIRLS
DVDLTRIDEY SPIPVGIHLL SGYVDSELLL TFTQVDGQSP ALVLTGNAAL KKLKVENHSV
EAPYAATLGE FNVQLTEINL NAVKSSRAAI VLTEIAVTPE GSAEPTLSLP KLALEEIVVD
IRQKNVSLRA ATLDRFNVSL RRHKDGRLDL AKLFTPITPI NRAEKASQSS TPPSPKADSG
KPWTVKLGSF KLADAALRFE DATLPDVAPM VVNTLDLAVN QIDFSGATPS QLELKAEVNK
TGSLETKGSL AWAPVAADLA VNAKDVDLVA LQGWAGDRLN VLFSRGALSF RGKIKADGGK
GQERPERARG EKASPLKVAV RGDGRLTNFN MLDKADATNL MRWRSVDIKG IEFANEPLSI
NAAAITLTDF FAHVVITPQG ELNLKYIVRQ EETAVSPSQA AAAPATEPEP ARASPEISPA
APVPSQPPPR KNLPISIGRI VMQGGHVNFQ DQFIQPNYRA YLTDLAGRIG PLNPRKTGEV
DIRGAVDKTA PLKISGSVDA FGRELNLDIT ASAKGIDMPT FSPYSGKYIG YAIEKGKLSV
DVHYHVEHGE LTAENSIFLD QLTFGEKIES PDAVSIPVNL ALALLKNRRG EIDIRLPISG
SINDPKFSLG GIIVKAILNL LTKAATAPFT VLGSLFGGEE LSEINFTAGE VKIPPEAEER
LQKLAQALTD RPALNLEVTG HADASIDPEP LKRRVLERKI KAEKLSADIK KGKSYDSLED
VTVTPEEYEK YLEQVYKDAK FEKPKNFIGL SKSLPAEEME KLMLANIDAG DAELQELAES
RAVSARDWLI QQGKIPDARI FVLSPKVEAG TNREKAGNRV EFSLR