Gene Nmul_A1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1368 
Symbol 
ID3786511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1555152 
End bp1556345 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content55% 
IMG OID637811456 
Productflavin-containing monooxygenase FMO 
Protein accessionYP_412063 
Protein GI82702497 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2072] Predicted flavoprotein involved in K+ transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGATTTG CCACGAAGTT CGATCTCCTG TCTCACATCA GGCTCAATGC AGCACTGACG 
AGCGCACGCT TTGATGCTGA ACAGGGGTTA TGGCTGCTAC GGCTTGAAAA CGGGGAGCAA
CTGAAGGCAA GCTTCTTCAT CTGCAGCGCG GGACCGCTGA GCGAGCCGCG CTATCCCGAT
ATCCCGGGTA TGGATACCTA TGAAGGGCGT CTTTTTCATT CTGCGCGGTG GGATCACGAT
TACCCCCTCA ATGGAAAACG GGTGGCGGTC ATCGGCACCG CGGCGAGCGC GGTACAGATC
ATTCCCCATA TTGCTCCCCT TGCTGCAAGG CTTTACGTCT GCCAGCGTTC CCCCAACTGG
ATCATACCCA GGCTGAATCA TGTTTATGCG CCATGGGAGA AAGCCCTGTT TCGGTTAAAG
CCCATTGCCA AAACCAACCG TTTTCTGCTC TACTGGCTTC ATGAAATGAA CCGTCTTTCC
TTTAATCCCG GCGGTTTCAT GGCAAGGATC GGGCGTGAAC TTGCAGAATG GCATCTCAGG
CGGCAGGTGA TCGACCCGCG CCTGCGGGGA GCGTTGCGTC CCGGCTATCC CCTCGGCTGC
AAGCGTGTCC TGCTTTCCAA CGATTACTAC CCTACCCTCA TGCGCCCCAA TGTGGAGCTG
ATCGATACGC CTATCGACCG AATCGATCCC GGCGGTATCG TCACCCGGGA CGGGCAGAAA
CGCGAAGTGG ATGTGATTAT TTGTGCAACC GGCTTCAATG TAAAACGCAT GCTCTCGGTA
GAGATACGGG GTTTGCAGGG CTATCGCCTG AATGAGGCGT GGGCGCGCGA GCCCAAGGCT
TATCAGGGAG TCACCGTGGC CAGCTTCCCC AACCTTTTCA TACTCCTAGG GCCGAACACG
GGGCAAGGAC ACACCTCCGC GATCCTTTTT ATAGAAGCCC AGGTGAACTA TGCGTTGAAA
TGCATTCAGG AGATTGCCAG GCAGCAGAAG CGCTTTCTTT CAGTGAAACC GGAAGCCATG
AACCGGTATA ACGAGGAATT GCAAAAGACG TTATCGACAT CGGTATGGGC TGCAGGATGC
CGGAGCTGGT ACAAGACAGA ATCCGGGAAA ATCATCGGAA TTTATCCCGG ATTCTCTTTT
CAGTACGCAA AGCAGCTTCG GGAACCGCGA TTCGAGGATT ATGTCATGTG TTAG
 
Protein sequence
MRFATKFDLL SHIRLNAALT SARFDAEQGL WLLRLENGEQ LKASFFICSA GPLSEPRYPD 
IPGMDTYEGR LFHSARWDHD YPLNGKRVAV IGTAASAVQI IPHIAPLAAR LYVCQRSPNW
IIPRLNHVYA PWEKALFRLK PIAKTNRFLL YWLHEMNRLS FNPGGFMARI GRELAEWHLR
RQVIDPRLRG ALRPGYPLGC KRVLLSNDYY PTLMRPNVEL IDTPIDRIDP GGIVTRDGQK
REVDVIICAT GFNVKRMLSV EIRGLQGYRL NEAWAREPKA YQGVTVASFP NLFILLGPNT
GQGHTSAILF IEAQVNYALK CIQEIARQQK RFLSVKPEAM NRYNEELQKT LSTSVWAAGC
RSWYKTESGK IIGIYPGFSF QYAKQLREPR FEDYVMC