Gene Nmul_A0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0374 
Symbol 
ID3784069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp407378 
End bp409432 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content53% 
IMG OID637810450 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_411074 
Protein GI82701508 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAGG ATTTCTCATA TCGTGAGGCC ACCTCCCGCA ATATTGGGTG GGTAACACCT 
GTCGAACAGG AAATTCTACG GTACAAACGC ATTGCAATTG CCGGTCTGGG CGGCGTAGGG
GGTATTCATC TGCTGACCTT GGCTCGGCTG GGGATTGGCG CATTTCATAT TGCGGATTTT
GATGTTTTTG ATCTGGTCAA TTTCAACCGC CAGGTAGGCG CCACGGTCTC GAGCCTCAAT
CGGCCAAAGA GCGAGGTTTT AGCCGCAATG GCACGAGATA TCAATCCGGA ACTGAATATC
AAAATCTTTC CTCGGGGCGT CAATCCGGAT AATTTGCCGG AATTTTTCGA GGACGTTGAT
CTGTATATCG ATGGTCTCGA CTTTTTCGCC TTTTCAGCTC GCGCAGTAAC GTTTTCGGCC
TGTGAGGAGC GAGGTATTCC CGCCATCACG GCCGCTCCTT TAGGCATGGG ATCGGCATTA
TTGAATTTCC TGCCTGGCAA AATGACCTTC GAAGAGTATT TTCAGTGGGG TGATCTACCC
GAGGTGGAAA AGGCCCTCCG TTTCTTGATA GGACTCGCGC CCACTGGTCT CCATGCCCGT
TACCTGCTGG ATCCCTCCAG CATCAATCTC AAGGAACGCC GCGGTCCCTC AACGATAATG
GGTTGCCAAC TCTGCGCGGG TATTGCAGCT ACTGAAGCTC TGAAGATCCT GCTGAATCGC
GGTACGGTTT TGGCCGCTCC ACATGGAGTT CATTTCGACG CCTACCGCAA TAAACTGATC
CGTACCTGGC GTCCGGGCGG GAACAGCAAT CCTCTCCAGC AGTTCAGTCT CGCAATTGCA
AGGCGTCGCC TGAGCAAAAA AAATACTAAC AAGCTCTTCG GAGATCCGGA GCCCTATTCT
TCCACGGGTA CTATCATTCC CTCTGAATCG ACAGGCTGTG AGGACGAAAG TAAAAGGCCA
CCCGAGCTCA AGTCATATAC ATTAATCGAG CAAATCCTCG ATCTTGCCAG GTGGGCGCCG
AGCGGAGACA ATACCCAGCC TTGGCGGTTC CAGATCGTCG GGGACAATTC CCTGACAATA
CACGGTTTCG ACACCCGTGA TCACTGCGTC TATGACCTGG ATGGACGTCC TAGTCAAATT
TCAATCGGCG CATTGCTGGA AACCATATCT ATTGCCGCTA CGGGCCATGG CCTAAAAACA
AGTATCCAGC GGCGTCCCGA TTGTTCCGAT ACGAAGCCAA CCTTCGATAT CCATTTCGGG
AGTGACCCGC ATCTTAAGCC CGATCCGCTT ATTCCTTATA TACGCTATCG CAGCGTTCAA
CGCCGCCCCA TGAGTACGCG GCCTTTAACC GGGAGAGAAA AGAGTGCGCT AGAGGCTGCA
GTGAAACCCC AGTACGACAT CTTGTGGCTG GAGGGTTTTT CCAGGCGTCT GGAAATGGCC
CGGTTCCTGT TCGACAATGC AAAATTGCGT CTGATTATGC CTGAAGCCTA TCAGGTGCAC
CGCGCCATCA TACAATGGAA CTCCCGCTAT AGCGCAGATA AAGTACCTGA CCAGGCACTC
GGAGTTGATC CATTTACAGC CCGGTTAATG CATTGGATCA TGGGAAGCTG GGATCGTGTC
GAATTTTTTA ACACTTTTTT GGCGGGAACC TGGGCACCGC GTCTCCAGAT GGATTTGATC
CCCGCCATGG CCTGTGGCGG GCATTTCGTT ATCCTGGCGC CTCACGCGCC TCGATCCATC
GATGACTACG TGAGCGCAGG ACGCGTTATG CAGCGCTTCT GGCTTACAGC AACTAAACTT
GGCCTGCAGC TGCAGCCGGA AATAACTCCT CTGGTTTTTG CACGTTATAG TCGTGAAAGT
ATTCCCTTTT CGAAGACAAA ACAATGCATT ACACTCGCCA CCTCACTGAC CCGACGGCTT
GGTCATATTC TGGGAGAAGA AGTCGCGACC AGAGCCGTTT TCATGGGGCG CATCGGGGCT
GGCAAAAGAC CCGCTTCGCG GTCTATCCGG CTCGAGCTGG AGCAGATGCT TTGGCGCCTG
AGGGCAGGAG CTTAA
 
Protein sequence
MVQDFSYREA TSRNIGWVTP VEQEILRYKR IAIAGLGGVG GIHLLTLARL GIGAFHIADF 
DVFDLVNFNR QVGATVSSLN RPKSEVLAAM ARDINPELNI KIFPRGVNPD NLPEFFEDVD
LYIDGLDFFA FSARAVTFSA CEERGIPAIT AAPLGMGSAL LNFLPGKMTF EEYFQWGDLP
EVEKALRFLI GLAPTGLHAR YLLDPSSINL KERRGPSTIM GCQLCAGIAA TEALKILLNR
GTVLAAPHGV HFDAYRNKLI RTWRPGGNSN PLQQFSLAIA RRRLSKKNTN KLFGDPEPYS
STGTIIPSES TGCEDESKRP PELKSYTLIE QILDLARWAP SGDNTQPWRF QIVGDNSLTI
HGFDTRDHCV YDLDGRPSQI SIGALLETIS IAATGHGLKT SIQRRPDCSD TKPTFDIHFG
SDPHLKPDPL IPYIRYRSVQ RRPMSTRPLT GREKSALEAA VKPQYDILWL EGFSRRLEMA
RFLFDNAKLR LIMPEAYQVH RAIIQWNSRY SADKVPDQAL GVDPFTARLM HWIMGSWDRV
EFFNTFLAGT WAPRLQMDLI PAMACGGHFV ILAPHAPRSI DDYVSAGRVM QRFWLTATKL
GLQLQPEITP LVFARYSRES IPFSKTKQCI TLATSLTRRL GHILGEEVAT RAVFMGRIGA
GKRPASRSIR LELEQMLWRL RAGA