Gene Nmul_A0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0095 
Symbol 
ID3785820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp99069 
End bp101060 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content57% 
IMG OID637810165 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_410796 
Protein GI82701230 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATG TCTCCGCCAG CCGGCAGCAG GTTCGATCGG CATTATCTCC GGCAGCGACG 
CCCAGCGACG CGGAAATCCC TACAACATCT GGAAGACAGG ACATGGCGGG CATCATTCCG
GTGGAAACTG CGGGAACTCT GGATGGATTG TTCCGCGAGC GGGTGCGCCG CACGCCCGAA
GCGATAGCCT ATCGGGATTA TGATCGATCG AGCGGCAAGT GGCGTGATCT TTCCTGGGCG
CGAATGGATG AGCGCATTGC ACACTGGGTG ATTGCGCTAT CGAAGGAAAA GCTTAAACCG
GGTGACCGCG TCGGGATAAT GTTAAGAAAC TGCCCGGAAT GGGTAATGTT CGAACAGGCT
GCGCTGAGGC TTGGGCTGGT GGTAGTGCCG CTCTATCCCA CTGATCGCCC GGATAATGCG
GCTTATATCC TTCAGGATGC AGGCGTCAAG GTACTTCTGC TGGAGGAACT GGGGCAGTGG
CTTGCTTTCT CGGAAGTGCG TGACCAGATG ACAGGGCTTG TACGCGTCAT CATTATCCAG
GGCAGCGTCA GGCAGGGTCA TAGAGGAAAC GAGCTCGTGC TTGCGCTGCA TGACTGGCTG
CCGGAGTGGT TCCCCGGCGA AAGACCGACC GAGTCGTTGC CAAGCGTGGA ACCCCTCTAT
CAGCTTCCTG CGGAGCCGGA GCATCAGGCT CCGGAAGAAG TCCCGCCCAG CCGTGTCCTC
GCACGTGACC CGCACCAGCT GGCTACCATC ATTTATACAT CGGGTACATC CGGGCATCCC
AAGGGCGTGA TGCTCAGCCA TCATAATATA CTGACCAATG CCCATAGCTG CCTGCAGGTC
GTACCGATCG AAGAAAGCGA CGTGCTGCTC TCCTTTTTAC CGCTGTCCCA CACATTTGAA
CGGACTGCCG GTTATTACGT GCCGATGATG CGGGGCTCGA CAGTGGCGTA TGCGCGCTCG
ATTCCCCAGT TGCAGGAGGA TCTGCTGATC ATCCGGCCGA CCATCCTCGT TTCGGTGCCC
CGAATATATG AGCGCGTATA TGCCGGCATT CGCGCCAAGC TGGCGGAGGG GCCCCTGCTG
AGCCGCAGGC TGTTCGACCT TGCAGTCGAG ATCGGGTATA ACCGGTTCGA GTATCAGCAG
GGGCGCGCCG AGAAGCATTT TTCCCATGCG CTGTGGCCGT TGCTGGAAAT ACTGGTGGCA
AAGAAGGTGA TGAGCAAACT GGGAGGGCGG CTGCGCGCAG CAATGAGCGG CGGAGCCGCA
TTATCGAGCG AGGTTTCCCG CATATTTATC GGCCTGGGCC TGCCCATCCT CCAGGGCTAC
GGTATGACGG AAAGCAGCCC GGTGGTTTGC TGCAATACGA TCGAGGACAA TGTTCCGGCG
AGTGTCGGAC GGCCTATTCC AGGGGTGGAA GTGAAGCTCG GCGAGCAGAA TGCCTTGCTT
ATTCGCGGAC CCAACGTCAT GCTCGGCTAC TGGAACAATG AGGAGGCCAC GCGAGCGGTA
ATGACCCCGG ACGGGTGGCT CAATTCGGGC GATATCGCGG AAATCGATGA GGCCGGGCAT
ATCGCCATTA CCGGCCGGGT AAAGGAAATC ATTGTCATGT CGACCGGGGA AAAGATTCCA
CCTGCCAACA TGGAAGCGGC GATTCTACGC GATCCGTTAT TTGAACAGGT CATGGTGGTT
GGCGAGGGAC GTCCTCATTT AGCGGTGCTG GCAGTTCTCA ACTCCGGAAA TTGGGAAAGC
ATGGCTGACG AATACCATCT CGACCGGAAC TGGCGCCGTC TAGGGGGCGA TCCGAAGTTC
GAAGAAATCC TGCTGGAACG GATCGCTTAT CAGATCAAAG GATTTCCGGG TTATGCCAAA
ATATATCGCA TTGCCGTGGT GGCCGAGCCG TGGACAGTCG AGAATCAGAT GTTGACCCCG
ACGTTGAAAC TGCGGCGCAC TTATGTGCAG AACCACTACA AAAGCGAGGT GGACAGGCTC
TACACGAAGT AA
 
Protein sequence
MNNVSASRQQ VRSALSPAAT PSDAEIPTTS GRQDMAGIIP VETAGTLDGL FRERVRRTPE 
AIAYRDYDRS SGKWRDLSWA RMDERIAHWV IALSKEKLKP GDRVGIMLRN CPEWVMFEQA
ALRLGLVVVP LYPTDRPDNA AYILQDAGVK VLLLEELGQW LAFSEVRDQM TGLVRVIIIQ
GSVRQGHRGN ELVLALHDWL PEWFPGERPT ESLPSVEPLY QLPAEPEHQA PEEVPPSRVL
ARDPHQLATI IYTSGTSGHP KGVMLSHHNI LTNAHSCLQV VPIEESDVLL SFLPLSHTFE
RTAGYYVPMM RGSTVAYARS IPQLQEDLLI IRPTILVSVP RIYERVYAGI RAKLAEGPLL
SRRLFDLAVE IGYNRFEYQQ GRAEKHFSHA LWPLLEILVA KKVMSKLGGR LRAAMSGGAA
LSSEVSRIFI GLGLPILQGY GMTESSPVVC CNTIEDNVPA SVGRPIPGVE VKLGEQNALL
IRGPNVMLGY WNNEEATRAV MTPDGWLNSG DIAEIDEAGH IAITGRVKEI IVMSTGEKIP
PANMEAAILR DPLFEQVMVV GEGRPHLAVL AVLNSGNWES MADEYHLDRN WRRLGGDPKF
EEILLERIAY QIKGFPGYAK IYRIAVVAEP WTVENQMLTP TLKLRRTYVQ NHYKSEVDRL
YTK