Gene Nmul_A1731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1731 
Symbol 
ID3786208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1978141 
End bp1979511 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID637811817 
Producthypothetical protein 
Protein accessionYP_412420 
Protein GI82702854 
COG category[S] Function unknown 
COG ID[COG4267] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGTA TCGGTTTCGA AATCCGCAAA ATCCTGGAGC GCGACAATTA CTGGTCGGTG 
TTGCGAGCCT ATGGTTATGC GGGCCTGGTC AGCGGTGGTC CATGGGTGCT CTCGATTTTG
AGCATCATGA TGATAGGCGT CCTTGCCGTC ATGCTGGGTG TGGAACAGCG CGAGGTCAAT
GCCTTCCAGA TCTGCGTTAC CTATATGATG GCGAGTTCAC TGATCTGGAC CGGGGGTATG
CAGCTCATGT TCACTCGGTT CGTCGCGGAC CGGACTTATG CCGGTGACAA GGCAGAGGTG
CTGCCCAATC TGTTCGGCGT CCTGATGTTG ACCATGGCGG GTGGCTTTGC GTGGGCAGCG
CCCTTTATCT TCTCGCTTTT CACGGGACCA TTTTTCCAGC AACTGTTGCT GCTGTGCAAT
TTTGTTGTGC TGAGCGGGAT GTGGATTGTC CTCATTTTCC TGTCGGGGAT GAAGGCGTAT
GGGCGAATCA TCTGGACTTT GGCCAAGGGT TACTCCCTGG GAATTGCAGT CGGACTCCTT
GCATCTCCAT GGGAACTCAA CGGCCTGTTG CTCGGTGTGC TGACCGGTCA CGGCTATCTG
CTGTTTTCCT TCCTGCATCA TATCGTGCGG GAATATCCCG GAAATTCTCT GCTCAAGTTT
GACTTTCTCG ATCGAAGCCG GAGCTTTTAC AGCCTGTTTG CGGTCGGCAC GCTCTATTAC
CTGGCGGTGT GGATCGACAA ATTCATCTTC TGGTTCGTGC CCTATACTTC CGAGGTGGTC
ATCGGTCCCT TGCGCGCATC CATCATCTAC GATCTTCCCA TTTTTCTGGC ATATCTATTC
ATCCTGCCGG GGATGGCCGT ATTTCTGGTA AGCATAGAAG CGGATTTTGC CGAACAGCAC
GAACGTTTTT ACCGTGCCGT GCGTGAAGGC GACACGCTCA TGCATATCGA ATATAGGCGT
GACCGGATGG TGTACGCTGC CCGGCAAGGT ATCTACGAAA TCTTCAAAGT GCAGGGTCTG
ACGGTGGTGT TGTGCCTTCT GTGGGGCAGG GGCCTGCTGC AAACCATTGG CATATCCCCG
CTCTACATTC ATTTGTTTTA TATCGATGTG GTCGCAGTCA GTGTGCAGGT GCTGCTGATG
GCGATACTCA ACATACTTTT TTATCTCGAC GCCCGGCGGG AAGTATTGAT CGTTACGGCA
TGTTTTTTTA TCACCAACCT GCTGTTCACC GTTGCCACGC TGCAACTAGG AGCCGAATCC
TTCGGATACG GCTTTGCAGT ATCCGTTACG CTGTCCGCAT TTCTCGGTCT CTTCATCCTC
TCGCATAAGT TCAACCGGCT GGAGTATGAG ACATTCATGC TGCAGGGTTA G
 
Protein sequence
MAGIGFEIRK ILERDNYWSV LRAYGYAGLV SGGPWVLSIL SIMMIGVLAV MLGVEQREVN 
AFQICVTYMM ASSLIWTGGM QLMFTRFVAD RTYAGDKAEV LPNLFGVLML TMAGGFAWAA
PFIFSLFTGP FFQQLLLLCN FVVLSGMWIV LIFLSGMKAY GRIIWTLAKG YSLGIAVGLL
ASPWELNGLL LGVLTGHGYL LFSFLHHIVR EYPGNSLLKF DFLDRSRSFY SLFAVGTLYY
LAVWIDKFIF WFVPYTSEVV IGPLRASIIY DLPIFLAYLF ILPGMAVFLV SIEADFAEQH
ERFYRAVREG DTLMHIEYRR DRMVYAARQG IYEIFKVQGL TVVLCLLWGR GLLQTIGISP
LYIHLFYIDV VAVSVQVLLM AILNILFYLD ARREVLIVTA CFFITNLLFT VATLQLGAES
FGYGFAVSVT LSAFLGLFIL SHKFNRLEYE TFMLQG