Gene Nmul_A2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2001 
Symbol 
ID3784492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2299331 
End bp2300719 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content48% 
IMG OID637812090 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_412688 
Protein GI82703122 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGAAG ATCGAAAAAT TACGCATCAT GCTCGAACTG CTCCACATTG CGGCATTGAG 
AGAACTGCAG TTGCCTTCCT ATTCATCTTT GTCAATTACC TCATGATGAA AACCAAAATC
GCCATTATTG GTGGGGGCTT TGCTGGACTG AACTTGGTCA AGCATTTGGC TGGAAAAGAG
GAATTTGATG TGACGCTGGT TGACATGAAT AACTATCATC TTTTTCCTCC GCTTCTGTAC
CAGGCTGCGA CCGGCTTCCT GGACGTGTCC AATATTGCGT ACCCCTTCCG AAAATTTTTC
CATGACAAGG TGAACGTCCA TTTCCGGCTG GGCAAGCTGC AAAAGGTGAT GCCTGAAGGA
AACAAGGTAC TCCTGTCCAC GGGCGAGTTG GCCTACGATT GTCTTGTTCT CGCCACGGGC
ACCGAGCCCA ATTATTTTGG CATCGAAAAT ATTCGCCGGG CAGCACTGCC GATGAAGACG
GCCGATGATG CCATAGAAAT ACGAAATTAC ATGCTTCAAA AAATGGAAGA AGTAACTATT
GAGGTTGACG AAATGAGGAG GAAAAAACTG TTTTCGGTCG TGATTGCCGG TGGTGGTCCC
ACGGGAGTGG AAATTGCGGG CATGTTGGCA GAGATGCGCA AGAGAATCCT CCACAGGGAT
TACCCTGAAT TGACCGGTCT CCAGCCACGG ATTCACCTTG TTGACAGTGC GTCCGCTCTA
TTGGGCGCGA TGAGCGTTCA TTCGCAAAAA TACACCTACG AAGTGCTGCT GAAGATGGGC
GTTGAAATAC ATCTAAATAC CCAGGTGAAG GATTACATAA ACGATACAGT GATTTTCTCC
GACGGAAAAA CCCTTGAGAC GCAAATCCTG CTGTGGACTG CCGGCGTGAC AGGGAAGATA
TTCGAAGGCC TGCCCCACGA ATGTTATGGA CGAGGTAACC GCCTGCTGGT AAATGAATAT
AACAAAGTCA GCGGAACAAG GGACATCTAC GCCATAGGCG ATACTTGTCT GCTGACCTCG
GATAGAAATT TTCCGCAGGG ACATCCCCAG CTTGCACAAG TGGCGCTTCA GCAAGGCAGA
AACCTGGCCG CCAATCTTGT TGCCGTCATC CGTAACCAAC CTTTGACACC GTTTGCATAC
AATGACAAAG GATCTCTGGC CATTATCGGT AGAAACAAGG CCGTGGCCGA CTTTCCCAAA
CCCGCCTTGC ATCTCGAAGG ATTCATGGCA TGGGGAATCT GGCTATTTGT ACACCTTTTT
TCTCTTGTCA CTTATCGTAA CCGGTTTATG ACATTAGCAA ACTGGGCAGT TGCTTTTTTC
ACCAAGGATC AATCCTTGAG AATGATTATC AGGCCTCGAT CGGATCAACA GAGCAGCAGG
AATGAATGA
 
Protein sequence
MDEDRKITHH ARTAPHCGIE RTAVAFLFIF VNYLMMKTKI AIIGGGFAGL NLVKHLAGKE 
EFDVTLVDMN NYHLFPPLLY QAATGFLDVS NIAYPFRKFF HDKVNVHFRL GKLQKVMPEG
NKVLLSTGEL AYDCLVLATG TEPNYFGIEN IRRAALPMKT ADDAIEIRNY MLQKMEEVTI
EVDEMRRKKL FSVVIAGGGP TGVEIAGMLA EMRKRILHRD YPELTGLQPR IHLVDSASAL
LGAMSVHSQK YTYEVLLKMG VEIHLNTQVK DYINDTVIFS DGKTLETQIL LWTAGVTGKI
FEGLPHECYG RGNRLLVNEY NKVSGTRDIY AIGDTCLLTS DRNFPQGHPQ LAQVALQQGR
NLAANLVAVI RNQPLTPFAY NDKGSLAIIG RNKAVADFPK PALHLEGFMA WGIWLFVHLF
SLVTYRNRFM TLANWAVAFF TKDQSLRMII RPRSDQQSSR NE