Gene Nmul_A2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2141 
Symbol 
ID3784767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2433113 
End bp2434282 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content54% 
IMG OID637812229 
ProductFatty acid desaturase 
Protein accessionYP_412826 
Protein GI82703260 
COG category[I] Lipid transport and metabolism 
COG ID[COG1398] Fatty-acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.623834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCG GTCTTATCGA CTTGCCATGG TGGGGTTATA TCGCCGTTAC CCTCGGCCTT 
ACCCACATAA CCATAGCCAG TATCACAATC TTTCTGCATC GTCATCAAGC TCATCGCTCG
CTGGATCTTC ATCCCTTGCC GAGTCATTTT TTCCGCTTCT GGCTGTGGCT CACGACCGGG
ATGGTCACCA AGGAGTGGAC CGCCATACAC CGGAAGCATC ATGCCAAGTG TGAAACGGTC
GATGATCCCC ATAGTCCGCA AATAGTGGGG ATCGCCAAAG TGCTGCGGGA GGGATCGGAG
CTCTACCGCG CAGAAGCCAA AAACATGGAA ACCATGGAAA GATACGGCCA TGGTACGCCG
GATGACTGGC TGGAGCGGAA TGTCTATGAC AAGCACAGCC GCAAGGGGGT GGCCCTCATG
CTGATCATCG ACGTCATTCT GTTCGGACCC ATCGGCCTGA CCATCTGGGC CATCCAGATG
GCATGGGCGC CCATCATGGC TGCGGGCGTG ATCAATGGGA TAGGACATTA CTGGGGCTAC
CGTAATTTCC AGGCCGAAGA CGCCAGCACC AATATCGTTC CCTGGGGAAT TCTCATTGGC
GGAGAGGAGT TGCACAACAA CCACCACGCT TATGCCACTT CCGCGCGCTT ATCCAACAAG
TGGTATGAAT TCGACATCGG CTGGATGTAT ATTTGCATTC TCCAGTGGAT GGGGTTGGCG
CAAGTGAAGA AGGTGGCGCC GAAACTGCGC CTCGATGCCG CAAAGACAGA ATGCGACGCA
GATACGCTGC AAGCTGTCAT TTCCCATCGC TACGAAGTAC TGGCAAAGTA TGCCCAGTCT
CTCAAGCAGA CGCTGGCAAA GGAAGTTGAT CATCTGAAAG AAGCGGCGAC AAATCTTGGC
GTCGATCGTT CCACACTCAA GCGTTGGGTA CTTGCGGACT CCAAGACCCT GCAGGAAGAC
GAGCGGGCAA AGCTCAATCT AGTGCTGAGC AAGACGAGTA CGCTGGATAA AGTTTACAAA
ATGCGCGAGG AATTGATAAC GGTATGGCAA CGCTCCACTT CATCCAAGGA TGAGCTGGTC
AAGCAGCTGG AAGACTGGTG TCACCGCGCC GAGGAAAGCG GCATCGAGGT ATTGCAGAAT
TTCTCCCGCA GGCTGCGCTG CTACGCTTAG
 
Protein sequence
MTLGLIDLPW WGYIAVTLGL THITIASITI FLHRHQAHRS LDLHPLPSHF FRFWLWLTTG 
MVTKEWTAIH RKHHAKCETV DDPHSPQIVG IAKVLREGSE LYRAEAKNME TMERYGHGTP
DDWLERNVYD KHSRKGVALM LIIDVILFGP IGLTIWAIQM AWAPIMAAGV INGIGHYWGY
RNFQAEDAST NIVPWGILIG GEELHNNHHA YATSARLSNK WYEFDIGWMY ICILQWMGLA
QVKKVAPKLR LDAAKTECDA DTLQAVISHR YEVLAKYAQS LKQTLAKEVD HLKEAATNLG
VDRSTLKRWV LADSKTLQED ERAKLNLVLS KTSTLDKVYK MREELITVWQ RSTSSKDELV
KQLEDWCHRA EESGIEVLQN FSRRLRCYA