Gene Nmul_A1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1736 
Symbol 
ID3786038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1986197 
End bp1989205 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content56% 
IMG OID637811822 
Producthypothetical protein 
Protein accessionYP_412425 
Protein GI82702859 
COG category[S] Function unknown 
COG ID[COG3868] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.525409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGAAGCA CCTTAATACA AAATGCGTTC CGGTGCGCCC CACTGGCTTC CGGAGCAGAA 
AGGATATCTC GACAAGTTTT GCAAACTGCC ACACCCAATC CCATTATTCC CGCCACTGAC
ATCGGAGACA GGGGGGGGCT GCGTATCTGT AAAGGTCGCA AAGATCATGG CGGCATCCGC
GACAACCTGC CTCAACCATC AGGATTCCGC TTCCTGTTCC ATTTTTCCGG TGCGATCCGG
AGCGAAGCAT CGTCGCTCTG TTCAGGCATC TGGCAACTGC TTCTGTTACT GATGCTGATG
CTGATGGCAT CAGCGGCCCA CTCAGACGGC CTGGAGCTGG AGAAACGCGA TATTGTGTTT
TACTATGGCA GCAGGCCCCC TGTCGAAGAT CTGCGCCATT TCGACCAGAT CGTCATCCAG
CCAAGCCAGA TTCTGCCGCA TGAGAAAACA GCGCTGTTGA ATCTGGACTC TTCACTCGTC
TTCGCCTATG TCTCGTTTGG TGAAATCGCC CGCAACAGCG AAGATATGGC GCGTATCGAT
ACGGACTGGT CGATAGGCGT CAACCCTGCC TGGAACAGCC TGGTAATGGA CATGACCGAT
CCTGCCTGGC GCGAATACCT GCTGAAGCAG CACTTCGAAC GCTTGTGGCG TGACGGTTAC
CGCGCCTTTT TTCTCGATAC AGTGGACAGT TACCTGATCG TGACGCCGGA AGGCAAACAA
CGCGAAGAGC AGGAAAAAGG ACTGGTTTCA TTGCTCGCGG AAGTCAAGCA GCGTTTTCCA
GGATGCAAGC TCATTCTCAA CCGGGGCTTC GAAGTACTGG ACCGGGCCAG CCAGTATGCC
GATGGTATGG TGGCGGAATC TCTTTTTCAT GGTTTTGACC CAGTTAGCGG TAAACACGCC
CCAACCAAGG AAGAAAACAG AAAGTGGCTG CTCGATCAGC TGGTGCGGGC GCAAAACGAA
TTCAAGGTGC CCATTACGGT GCTCGACTAT GTCGATCCGG GGAACTGGGT AGAGGCGGAA
AAAACGGCGC GTGACATTCT CAAGCTGGGA TTCATGCCCT GGGTGGCCAA TGGAGACCTG
ACGTGGCTGG GGCAGGGACG AATACGCCTG GCTCCCCGCA AACTGCTGGC TATCGTTAAT
GGAACGCCCG CGCAGCAGAT GGAGCATGAT CTGTTTCGGC ATGCTGCCAT GCCGCTCGAA
TATCTTGGCT TGGCGCTCGA CTATTGGTAC ATCGATCAGC TTCCCTTGCC GATTGAACCG
CTCGTCGGGC GGTATGCCGG GATTATCGCG TGGCTGGGGG AGGACAGTGC GAACGGTACC
GAACGGTATG AGAGTGTCTG CGCACGGCTG CAGTCAGAAG CGAATGCGAA CCTGCCCATA
GTGTTCATGG GTTATCTGCC GGCAGGTGTT GCGTGCCGGA ATCTGCTGGA TTATCAGGGT
GAATTGTACC CGACGACCGG CATATTGAAA CTCGATGCCA TGAATGATCG CCTGGGACGC
CCTGAGTCCG CTCCGGTTAT CGGCAGCGGC ACGCCCGATG TACGGGTTCG AGACCGCAAT
AACGCATGGC TCACCTTGAG CAGTGCTGAC AAAATATTTC ATCCGATTGC GGTGACGAAC
TGGGGCGGGT ACGCGTTACA TCCCCATATC CTGAGTGAGA GCGTCTCCGG GCGGCACGAA
TGGCTGCTCG ATCCTTTTGT TTTTTTCCAG GCGGCGTTGC GTCTGCCGGT GCAGCCGGTA
TTTGACATGA CAACGGAAAA CGGGCGGCGC CTGGAGATTA TCGAGGTGAG GGGAGATCGC
CTGTTTGCAA AGGATGAGCA GGGTGTGGAG GCAATCGACC GGCTCAGGGG CTGGATGGAA
AAAAATCCGC TGCCGATAAC ATTGGGTGTC ATTGAAGCGG AAGTAGCCAC CGAAGAGCAG
CAGAGCAAGC TGCGCCAGGT GGCCAGTTTG CCCCAAGTTC GTCTGGCAAG CCATACCTAC
AGTCACCCTT TTTATTGGGG GGTGTTCGAA GGTAAGACAG ACGCGGATCA ACAATCCTAC
CGATACAGCG TATTCATGAA GGATTACGCG GCGGAAATGA CGCGCGAAAC AGGTGGCACT
CTTCCATTCC TGCGTACGAT GGCACCGGAT TCCCCTCCGT TATTGATCTG GTCTGGGGAT
GGCAAGCCAG GGCCCGCAGT GCTGGCCGCC GCGCAGAAAG CCGGGCTGCC GCATTATGGC
GGAGGGGGGC TGCATTGGCA GAGTGGACAG ATATCGCTTG CAGACCTTGA TCCCGCACTT
CGTCCGACAG AATGGGGTAT TCAGGTGATG ACGCCGCTGA TTGCCGAACC TCTGTTCGCG
CAGCTCTGGT ACGGCGAAGC CCTGAATTTT GGTAAGGTTG GCGAATGGAA CCGGGAGCTC
GACCTTGCAC GCCGGTTGCG TGTTTCGTCC ATATCGATTC ATGCCGATGC GTTCCTGAAC
GAACGGGGGA GGGAGCTTCT GGAGCAGATG GCGGGGGCGC AGCGGAAGGA AAATGTGCTC
GGCATCTGGG TCGATGAATA TGTCACTCGC GTGCGTGCGT TCCAGACGGC AAGCATCGCA
CGCGATCTCG ACGGCAATTG GTCTCTTTTC GGGGATACCC TCAGAACAGT GCGGTTGCCC
CCGACCGAAA TGACGCCGGA AATCTCCAGC GATGTAGTTG GATATAATGA TCGCAACGAC
AACCGCTACA TCCACCTCGC TCGGAATCAC GCAATCCTGA AAGCCGCTCA AGGTAAACGG
AACGGAAATC CGGGACTGAG ATTGATCGAG GCGAGTGCCC CCCTTAAATC CTGGCATATC
AACAATGATG GTTCCGCCAC GCTTTTATTT GAATCGGGCG GCCGTTTGGC GCCCCTGACC
GTCACGGCTC CCGCTTCCTG CGCCTTGAGC ATAAATGACA CAAAGTTGAT ACCGCAGGTG
AAGGGTGCAC ATTCCGTTTA TACAGTTCCA GGAAACCTGA CAGCGGGAAA ATTCCGGCTT
GAGTGTTGA
 
Protein sequence
MRSTLIQNAF RCAPLASGAE RISRQVLQTA TPNPIIPATD IGDRGGLRIC KGRKDHGGIR 
DNLPQPSGFR FLFHFSGAIR SEASSLCSGI WQLLLLLMLM LMASAAHSDG LELEKRDIVF
YYGSRPPVED LRHFDQIVIQ PSQILPHEKT ALLNLDSSLV FAYVSFGEIA RNSEDMARID
TDWSIGVNPA WNSLVMDMTD PAWREYLLKQ HFERLWRDGY RAFFLDTVDS YLIVTPEGKQ
REEQEKGLVS LLAEVKQRFP GCKLILNRGF EVLDRASQYA DGMVAESLFH GFDPVSGKHA
PTKEENRKWL LDQLVRAQNE FKVPITVLDY VDPGNWVEAE KTARDILKLG FMPWVANGDL
TWLGQGRIRL APRKLLAIVN GTPAQQMEHD LFRHAAMPLE YLGLALDYWY IDQLPLPIEP
LVGRYAGIIA WLGEDSANGT ERYESVCARL QSEANANLPI VFMGYLPAGV ACRNLLDYQG
ELYPTTGILK LDAMNDRLGR PESAPVIGSG TPDVRVRDRN NAWLTLSSAD KIFHPIAVTN
WGGYALHPHI LSESVSGRHE WLLDPFVFFQ AALRLPVQPV FDMTTENGRR LEIIEVRGDR
LFAKDEQGVE AIDRLRGWME KNPLPITLGV IEAEVATEEQ QSKLRQVASL PQVRLASHTY
SHPFYWGVFE GKTDADQQSY RYSVFMKDYA AEMTRETGGT LPFLRTMAPD SPPLLIWSGD
GKPGPAVLAA AQKAGLPHYG GGGLHWQSGQ ISLADLDPAL RPTEWGIQVM TPLIAEPLFA
QLWYGEALNF GKVGEWNREL DLARRLRVSS ISIHADAFLN ERGRELLEQM AGAQRKENVL
GIWVDEYVTR VRAFQTASIA RDLDGNWSLF GDTLRTVRLP PTEMTPEISS DVVGYNDRND
NRYIHLARNH AILKAAQGKR NGNPGLRLIE ASAPLKSWHI NNDGSATLLF ESGGRLAPLT
VTAPASCALS INDTKLIPQV KGAHSVYTVP GNLTAGKFRL EC