Gene Nmul_A1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1705 
Symbol 
ID3784804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1945346 
End bp1946446 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID637811792 
ProductFAD dependent oxidoreductase 
Protein accessionYP_412395 
Protein GI82702829 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCCG ACGGAGAGAT ATCGGATGTT GTTGTCATAG GCGGCGGACC GGCGGGCGCC 
ATCTGCGCGC TTGCGCTCGC ACGTGCGGGT GTCGATGTAA AGCTCATGTA CTGGGGAGGT
TACGCTCCCG GCGGCATCGA ACTCGTATCG GGTCGAGGGC GACATTTTAT CGAGCAGTAC
TGTCCCGATT TTTTTTCAGA AGTAGTGCAT GGAATAGAGA TTCATGAAAC CGCCTCGCTG
TGGGATACGG CCGAGCCGGT AATATTCAAT GCAATGTTTA ATCCATGGGG CGCTGGCGTA
GCGATTGAGC GTTCGCTTCT GGATGGGGCT TTGCGAAATC TTGCTTCCGG CGCGGGCGGC
ACTATAATTC CCGACGCCAA GGTAGTGGAC GTAGAGCGCC AGCATGACAG GTGGCGGCTG
ATCGCGCGTT CTTCCGAGGA CGCATCTTCC AATGAAACCA CTTCCCGGGC AGGCGAATTC
GCTATTTATG CGCGTTTTAT AGTGCTTGCG ACAGGACGCG TGCCGCTGCC GTTTTTCGAC
CATGCACCCG TTGCAGAGTC CTCGCAAATT GCCCTGATGA CTTCCCTCCA GGCTCGGATA
GCCCCTCGCC ACACTCTCTA TGTCGAAGGT ACCCGAAACG GGTGGTGGTA CGCTTTGCCT
GCCGAAAAAG GTTATTTCGC CAGTTTTTGT ATCGGGCGGA ATGAACTCAA GCAGCGGCAG
TCGCGCTTGA AGGATTTTTT CTTTCAGGAA TTGCAGTGTA CCCGCCTTCT CGCGCCATTG
TCGGCGGGAG CTTTCGATCA GCGGCCAATA GCCGGACGAA TGGCTGGCGC GACGATGTTC
CCAGCAATGG GCGGAGACGC CTGGATTGCA GTCGGAGATG CAACGGCAGC GCCGGATCCT
CTCAGTGGAA CGGGGATCGA GTGGGCAATC GAATCCGCGC AACTCGGCGC AGACATGTTA
CTGGAAGCAT TGCATGGATC TAAAGGCAAT GTTCTTTTCG ATCTTCCGCG TTATGAAAAT
ACGATACGCC GACGCATCGC CGCTCAGGAA AAGACAGCCG CTTACCATTA CCACAGGTTA
AAAGAGATAA GGGAAACATA G
 
Protein sequence
MFPDGEISDV VVIGGGPAGA ICALALARAG VDVKLMYWGG YAPGGIELVS GRGRHFIEQY 
CPDFFSEVVH GIEIHETASL WDTAEPVIFN AMFNPWGAGV AIERSLLDGA LRNLASGAGG
TIIPDAKVVD VERQHDRWRL IARSSEDASS NETTSRAGEF AIYARFIVLA TGRVPLPFFD
HAPVAESSQI ALMTSLQARI APRHTLYVEG TRNGWWYALP AEKGYFASFC IGRNELKQRQ
SRLKDFFFQE LQCTRLLAPL SAGAFDQRPI AGRMAGATMF PAMGGDAWIA VGDATAAPDP
LSGTGIEWAI ESAQLGADML LEALHGSKGN VLFDLPRYEN TIRRRIAAQE KTAAYHYHRL
KEIRET