Gene Nmul_A0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0968 
Symbol 
ID3785759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1124942 
End bp1126603 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content57% 
IMG OID637811051 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_411663 
Protein GI82702097 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.809053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCATT CCGGCATCCT GGATATGGTC CCCGCCGAGG TCCGCGCGCA ATGGGCCCGG 
CAGGGAATCT ATCCGAATAA ATCCCTGTAC GAGTTGTTTT GCGAGCGAGT GGAGCAACAG
CCGGATAATC CGGCAGTAAT ATCGCTCGAC CATACCACCA GTTACGCAGC GTTGCTGGAC
AAGGTCCATC GCCTGGCAAC CAGTTTCCAG GAATTGGGCA TTGTTGCGGG CGATGTAATC
TCATACCAGC TTCATAACGA CTGGCGGAGC TGCGCGATCG ACCTGGCGGC GGCCGCGCTT
GGAGCCATCG TGGCACCTTT TCCGCCAGGC CGCGGCCGCC TCGATATCCA GTCCCTGCTC
AGACGCTGTG ACGCCCGTGC AATTATTGTC GAGCGCGAAT ATGGAAAAAC CGACCTTTGC
GAATTGATCG AATCCATACG CCCCACTTTG CTTTCACTGC GCATCCTCGT GGTCGATGGC
GCAGCCGGAG ACGGTTGGCA CGCACTGGAT GAATTGTTCC GGCCCGCCTC CATTGAACCG
GACCTGCCGA CAGTCTGCCC CGATTCACCT GCCCGTTTCC TGATCTCATC CGGCACGGAA
TCCGAGCCCA AATGGGTGGC TTATTCTCAC AACGCGCTGG CGGGCGGACG AGGGCGATTT
CTGCAACGAA TTCATCCCGA AGGAAAGACT TTCAGGGGTC TTTACCTGAT GCCGCTCGGC
ACCGCATTCG GCTCCACGGC GACATTCGGG GTTTTATCCT GGCTGGGCGG CTCCCTCATT
GTCCTGCGGC AATTTGACGT TGCAGCTGCT ATTCAAGCCC TCGCGGAACT GAAGCCCACG
CATATATTGG GCGTTCCCAC GATGTTTCAA CGCATTGCCG CAGACCCTGC GTTGACGCAG
GCGGATACGT CCAGCCTGGT TGCCATCATC AGCGGCGGCG CGAAAATCGA TGAAACCTCC
ATTCGCCGAT GTACGAAGGC GTTCCGATGT GGATTCATCA GCTTGTATGG TTCTGCCGAT
GGCGTCAATT GTCATACGAC CCTGGATGAT GACCTGGAAA CCATTATCAG GACGGCGGGA
AGGCCCAATC CGGAAATCTG TTCCATTCGT ATAATCGATG ACCAGAAGCA GGAAGTACCG
CAAGGTTGCA TAGGAGAAAT AGCGGCCAGG GGTCCGATAA GTCCCATGCA GTACGTCAAT
GACCCGGATC TCGACGCCCT GTACCGTGAC CAGGAGGGAT GGGTGTATAC CGGTGACCTC
GGCCTTATTG ATGAAGAGGG CCATCTGGTG CTATCCGGCC GCAAGAAAGA CATCATCATT
CGGGGCGGCG TCAATATCAG TCCCGCTCAA ATTGAAAACA TTGCTGTTTC CCATCCGGCG
GTTGTCAGCG CAGCCTGTGT TCCGGTGCCC GACGCGGACC TGGGACACAG GGTTTGCCTC
TGTCTCGTCA CGAGAGAGGG AGCGGAACGT CCGTCACTTT CCCAGTTCAC CCGTTTTCTC
CATGAAAAGG GCCTGGAGAC AAGCAAGCTT CCCGAATACC TGCGCTATTA CCGCCAGCTG
CCCCTCAGCC CTGCGGGAAA AATCGATAAG AAGCGGCTGA CTACCGAAAT CGAATTCACG
GAACATCCCG CCCACCGGAG TCATCCCGAA TGGGCACATT GA
 
Protein sequence
MNHSGILDMV PAEVRAQWAR QGIYPNKSLY ELFCERVEQQ PDNPAVISLD HTTSYAALLD 
KVHRLATSFQ ELGIVAGDVI SYQLHNDWRS CAIDLAAAAL GAIVAPFPPG RGRLDIQSLL
RRCDARAIIV EREYGKTDLC ELIESIRPTL LSLRILVVDG AAGDGWHALD ELFRPASIEP
DLPTVCPDSP ARFLISSGTE SEPKWVAYSH NALAGGRGRF LQRIHPEGKT FRGLYLMPLG
TAFGSTATFG VLSWLGGSLI VLRQFDVAAA IQALAELKPT HILGVPTMFQ RIAADPALTQ
ADTSSLVAII SGGAKIDETS IRRCTKAFRC GFISLYGSAD GVNCHTTLDD DLETIIRTAG
RPNPEICSIR IIDDQKQEVP QGCIGEIAAR GPISPMQYVN DPDLDALYRD QEGWVYTGDL
GLIDEEGHLV LSGRKKDIII RGGVNISPAQ IENIAVSHPA VVSAACVPVP DADLGHRVCL
CLVTREGAER PSLSQFTRFL HEKGLETSKL PEYLRYYRQL PLSPAGKIDK KRLTTEIEFT
EHPAHRSHPE WAH