Gene Nmul_A1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1981 
Symbol 
ID3785005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2279002 
End bp2280138 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID637812070 
Producthypothetical protein 
Protein accessionYP_412668 
Protein GI82703102 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAGA TTCAAGGCAG CAATTCACCC GTGGTATTGC GCCGCGTTAC CGGCGCTGAT 
GGAATTCCGC GCGTGATTGT TGAAGGGAAC TGCACTCTCA CTGCTTTAGG TGAGCGGATT
CGACCGATTG CTGCGGAGCT TTCCGAGTAC GCTACCGCGC CGGAAATAAG GTGGGATCTT
ACACAGATAG CCCGGATGGA TGACGCAGGA GCTGTTCTCC TGTGGCGCGG CTGGGGTAAT
CAGCGACCCC GGTATCTGCT TGTAAGACCC GAGCATGAAA GGCTTTTCGA TCGCCTGGAA
GAAACGTGGA CCCCTCCTTC TGAAGCAGCA CCGGATCGGC TGTGGCCGAT TATGGTACTG
GGAAAGGCAG TTTTCCTCTT GTGGGAGCAT CTGGTCGGCC TGGTGATTCT GATGGGCCAA
TTGGTGCTGG AAATAAAGTA TCTGGCCGGC CGTCCCTCGC GCATACCATG GCGAGAAATT
TCGGCGAATT TGTATCGGAC CGGGGCGCAG GCACTGGGTA TCACGGCCCT GGTGGGATTT
CTCATCGGTG TAGTATTGAG CTATCTGTCT TCCCGGCAAT TGCAAATATT TGGAGCGCAT
GTCTTCATCG TCAACATCCT GGGCATCAGC GTTATCCGGG AGCTTGGACC CATGCTTGCC
GCTATCCTGG TTGCCGGCCG TTCCGGTTCA TCGATGACCG CGCAACTGGG GGTGATGCGG
GTAACCGAGG AACTGGATGC CCTGACAGTC ATGGGTATTC CCCACAGTCT TAGGCTAATC
CTGCCGAAAG TGATCGCCCT GGGTCTTGCG ATGCCACTCG TTGTGCTGTG GACCAGCGCT
GTTGCACTGA TAGGAGGAAT GACCGTTGCC GAGCTACAGC TTGGGTTGGG CTACAAGTTT
TTTTTGAGCA GGCTTCCGGA TGCCGTGGCT GTTTCCAACT TATGGCTGGG ACTAGGCAAA
GGGATTGTAT GCGGAATGGC AATTGCCTTG ATTTCTTGCC ACTTCGGTTT GAGAATCAAA
TCAAATACGG AAAGCCTGGG CGAAGGTACA ACGAACTCGG TCGTCACATC CATTACCGTG
GTAATTATTA TCGATGCAAT TTTTGCCGTG ATCTTTTCGG ATGTGGGATT AAGATAA
 
Protein sequence
MSEIQGSNSP VVLRRVTGAD GIPRVIVEGN CTLTALGERI RPIAAELSEY ATAPEIRWDL 
TQIARMDDAG AVLLWRGWGN QRPRYLLVRP EHERLFDRLE ETWTPPSEAA PDRLWPIMVL
GKAVFLLWEH LVGLVILMGQ LVLEIKYLAG RPSRIPWREI SANLYRTGAQ ALGITALVGF
LIGVVLSYLS SRQLQIFGAH VFIVNILGIS VIRELGPMLA AILVAGRSGS SMTAQLGVMR
VTEELDALTV MGIPHSLRLI LPKVIALGLA MPLVVLWTSA VALIGGMTVA ELQLGLGYKF
FLSRLPDAVA VSNLWLGLGK GIVCGMAIAL ISCHFGLRIK SNTESLGEGT TNSVVTSITV
VIIIDAIFAV IFSDVGLR