Gene Nmul_A1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1548 
Symbol 
ID3785270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1773522 
End bp1774637 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content54% 
IMG OID637811636 
Productcarboxylate-amine ligase 
Protein accessionYP_412243 
Protein GI82702677 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGA TGCCCTTCGC GCCCTCCCGT CCCCTGAGCA TAGGAGTGGA GCTTGAACTG 
CAATTGCTCG GATGCAACGA TTACAACCTC GCCCCTTCTG CACCCGAGAT TTTGCGTCGC
GTAGCAAAGC GCACTCATCC CGGGGAAATC AAACCGGAGA TGACGCGCAG TATGATAGAG
ATCAACACCT CGGTGCAGCA GGAGTATGCC GGACTGGTGA CGGAACTCCG TGCCCTGCGG
GACGTTGTTT CCGAGGCCGG ATGTTTTCTC AACGTAGCTG TTGCAGGCGG AGGCACCCAC
CCGTTCCAGC ATTGGAGCGA GCAGAAGATT TTCGATGCGC CGCGCTTTCA TTACCTCTCG
GAGCTGTACG GCTACCTCGC AAAACAATTC ACGATATTCG GACAGCACGT GCACATCGGC
TGTCCCGGGC CGGACGAAGC GCTCCATCTG ACGCACATGC TCTCGCGCTA CATTCCGCAT
TTTATTGCTC TGTCGGCCAG TTCGCCGTTT GTGCAGGGGC ATGATACCGG CTTTGCCTCT
GCCCGATTGA ACTCCGTTTT TTCCTTCCCA TTAAGTGGGC GAGCTCCCTT CGTATTGCGC
TGGAACGATT TCGAAAAATT TTTCGCCAAG ATGACGGGAA CTGGAGTGGT CGAATCCATG
AAAGATTTTT ACTGGGATAT AAGGCCAAAG CCCGAGTTCG GTACCATCGA GGTGAGGGTA
TGCGACACGC CCCTTACAGT CGAGATTGCA GCCTCCATTG CCTGCTATAT TCAGGCAATG
TCCAGATACA TCATGGTGGA GCAGCGTATG GCGCCCGAAG AGGATGACTA TCTGGTGTAT
ACGTTCAACC GCTTCCAGGC ATGCCGCTTT GGCCTGGAAG GTGTCTTTAT CGATCCTCGT
ACCCATCAGC AACGTAGTAT CCGGGAAGAT ATAATGGAGA TGCTTGAACA CATTTCCGAC
CATGCAAGAG AACTGCACGC GGTGGAAGCG ATGGAACGAA TCCGCGAGAT ACTCATCGTC
GGTAACGGCA CAAGTTGGCA GCGCAGGGCT TATGCAAGCG AGCACAACCT TGCCGACGTC
ATGCAGTTGC AAGCCGAATT ATGGATGGGA AACTGA
 
Protein sequence
MSLMPFAPSR PLSIGVELEL QLLGCNDYNL APSAPEILRR VAKRTHPGEI KPEMTRSMIE 
INTSVQQEYA GLVTELRALR DVVSEAGCFL NVAVAGGGTH PFQHWSEQKI FDAPRFHYLS
ELYGYLAKQF TIFGQHVHIG CPGPDEALHL THMLSRYIPH FIALSASSPF VQGHDTGFAS
ARLNSVFSFP LSGRAPFVLR WNDFEKFFAK MTGTGVVESM KDFYWDIRPK PEFGTIEVRV
CDTPLTVEIA ASIACYIQAM SRYIMVEQRM APEEDDYLVY TFNRFQACRF GLEGVFIDPR
THQQRSIRED IMEMLEHISD HARELHAVEA MERIREILIV GNGTSWQRRA YASEHNLADV
MQLQAELWMG N