Gene Nmul_A2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2303 
Symbol 
ID3786708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2619512 
End bp2621359 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content53% 
IMG OID637812390 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_412986 
Protein GI82703420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCT ACACCATTGA ACAATTCATG GCGACGACCT CCATAATGGG GGCGTCGTTC 
AGCGCGGATG AGGAGCGCAT CCTGTTCAGC AGTAATGAAT CGGGTATTTT CAATGCCTAC
ACGCTGCCAG TGACAGGTGG TACGGCAGAG GCGTTGACCT GCTCCGCCAG CGATACCACG
TTTTCGGTCA GCTTCTTTCC GCATGATGAT CGGGTACTGA TTACCCGGGA TTATCATGGC
GATGAAAATT ATCATTTGTT CCTGCTCGCC CCCGACGGCG AGGAGGAAGA TCTGACACCG
GGAGAAAAGC TCAAGGCGCA ATTCATGGGC TGGAGCCCCG ATGGACAGGC TTTTTACGTC
GCTACCAATG AGCTTGATGC CAGATTCTTC GATGTATACC GTTATGACGC CGAGACTTTT
GCGCGAACCT TGCTGTACCG GAATCACCAG GGGTTCGATC TTGGGCCCAT CAGTCGCGAT
GAGCGCTGGA TTGCACTGAA CCGCTCGCAT ACCGCATCGG ACAGTGACAT TTACCTGTTC
GATGTCGAAA AACGGGAAGT AAGGCACCTC ACGCCGCATG AGGGCTCCAT CAGTTTCCAT
GCTGAGACCT TCGATTCCAC TTCGCGCAAG CTCTTCTTTC TTACCACGGA GGGAAGCGAA
TTCAAGCATT TGTGCACCTA TGATCTCACT TCAGGGGTCG TCTGCGACCA CGAGAGTGCC
GAGTGGGACG TGATGTACAC CTATTTCTCA TACGATGGCC GATTCCGCGT AACTGGCATC
AACGCAGATG GAAGCATTGT CGTTCGTGTT GTCGAAATTG AAGATGGAGA GAGGGAGAAA
CCTGTAAAAC TGCCAGCACT ACCCCAGGGA GAAATCCGGG GTGTGGTCTT TTCGCAAAAC
AGCACGCGCA TGGCCTTCTA CGTAAATGGC GACCGCTCTC CCGATAATCT GTTCGTGCAC
GATTTCAGCA CCGGGCAGTT TCGTCAACTC ACGCAAAGCC TGAACAAGGA GATCGATCCA
TCAGATCTGG TGGAGGCGGA AGTGGTGCGG TTCCGATCCT TCGATGGAAT GATGATTCCC
TCCATCTATT ACAAGCCGCA TGAAGCATCA GGCACCAACA AGGTGCCTGC CATCGTGTAC
GTTCATGGAG GACCCGGCGG GCAGACCATG CGGGGTTATA ACGCCCAGAT TCAGTACCTC
GTGAATCATG GATATGCCGT GCTGGGCATC AATAACCGGG GAAGCTCCGG CTATGGTAAA
ACCTTCTTTA CCGCGGCCAA CCGCAAGCAC GGACGAGAGC CCTTGTGGGA TTGTGTGGAA
GCGAAGACCT TTCTGGCCAG TCTCGGTTAC ATCGACCATG AGCGCATCGG CATCATGGGC
GCGAGTTATG GCGGTTACAT GACACTTGCA GCCCTCGCGT TCCGGCCTGA AGCTTTCAAG
GTAGGGGTGG ACATTTTCGG CGTCAGCAAC TGGCTGCGTA CCCTGGAGAG CATTCCTGTT
TACTGGGAGT CCGTCCGCAA AGCCATTTAT GATGAAATTG GCGATCCCGT GGCGGACATC
GATTTTCTTG TTGCGACTTC CCCCCTGTTC CATGCCAGGG AAATACGAAA GCCTTTATTG
GTCATCCAGG GAGTCAATGA CCCTCGTGTG GTCAAGGCCG AGAGCGATGA AATGGTGGAG
GCGGTCAGGA AGCATGGCAT TCCGGTGGAG TACATTGTTT TCCCTGATGA AGGCCATAGT
TTTACCAAGA AAAAAAACCA GATCGAGGCG AATCGGCGAA TACTGGAGTT TCTGGACAAG
TATCTGAAAG GTGATGTGAA CAAGACTGCA AGCCAAGTGG AAAAATAG
 
Protein sequence
MKRYTIEQFM ATTSIMGASF SADEERILFS SNESGIFNAY TLPVTGGTAE ALTCSASDTT 
FSVSFFPHDD RVLITRDYHG DENYHLFLLA PDGEEEDLTP GEKLKAQFMG WSPDGQAFYV
ATNELDARFF DVYRYDAETF ARTLLYRNHQ GFDLGPISRD ERWIALNRSH TASDSDIYLF
DVEKREVRHL TPHEGSISFH AETFDSTSRK LFFLTTEGSE FKHLCTYDLT SGVVCDHESA
EWDVMYTYFS YDGRFRVTGI NADGSIVVRV VEIEDGEREK PVKLPALPQG EIRGVVFSQN
STRMAFYVNG DRSPDNLFVH DFSTGQFRQL TQSLNKEIDP SDLVEAEVVR FRSFDGMMIP
SIYYKPHEAS GTNKVPAIVY VHGGPGGQTM RGYNAQIQYL VNHGYAVLGI NNRGSSGYGK
TFFTAANRKH GREPLWDCVE AKTFLASLGY IDHERIGIMG ASYGGYMTLA ALAFRPEAFK
VGVDIFGVSN WLRTLESIPV YWESVRKAIY DEIGDPVADI DFLVATSPLF HAREIRKPLL
VIQGVNDPRV VKAESDEMVE AVRKHGIPVE YIVFPDEGHS FTKKKNQIEA NRRILEFLDK
YLKGDVNKTA SQVEK