Gene Nmul_A2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2654 
SymbolileS 
ID3785266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3044724 
End bp3047555 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content56% 
IMG OID637812744 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_413333 
Protein GI82703767 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACT ATAAGAAAAC GCTGAACCTG CCCGATACGC CTTTTCCGAT GCGCGGCGAT 
CTCGCGAAGC GAGAGCCTGT GATGCTCAAG GCATGGGAAG AAAAGGATCT TTATCAGAGA
ATCCGGGAAG CCTGCAAAGG CCGCCCCAAG TTCGTGCTGC ATGATGGACC GCCGTATGCC
AACGGGGATA TACACATCGG CCACGCGGTC AACAAGATAC TGAAAGACAT CATCATCAAG
TCGAAGACCT TGGCTGGCTT CGATGCGCCC TATGTGCCGG GCTGGGATTG CCATGGCCTG
CCGATCGAGC ACCAGATCGA AAAGAAGTAC GGCAAGAATT TGCCGGGGGA TAAGGTGCGC
GAGCTATGCC GCGCTTTTGC AAAGGAGCAG GTTGCGCGGC AGAAAGCCGA TTTCATCCGC
CTCGGTGTCC TGGGCGATTG GGATAACCCT TACCTCACCA TGAATTACCG CACGGAGGCG
GGCATTATCC GTGCGTTGGG CAAAATCCAT GAAAACGGCT ATCTCTACCA GGGGCAAAAG
CCGGTCAACT GGTGTATCGA TTGCGGCTCG GCATTGGCTG AAGCGGAAGT CGAATATGAA
GACAAGACCT CACCTGCGAT AGACGTGAAA TTCAGGGTGG CTGATCCGGC CGCATTCTGG
CAAAAGGTCT TGAAACCATC GGATGATTTG CTGCCTCCGG TTACCCATGA TGACAAACCC
GTTTATGTCG TTATCTGGAC GACCACCCCC TGGACCTTGC CGGCCAACCA GGCGGTGAGT
TTGAACCCTG ATGAAGAATA CGTTGCGATC GACACCGGAA GTGAGTTCTT ACTTCTGGCC
GATGCCCTTG CTGACAGCGC CTTGTCCCGC TATGGAATCT TAAAAGCCGA TGTAAGCATT
GCGGGCAGAT GTACGGGTGC CGCCTTGGAA CACATGTTGC TGCAGCATCC GTTTTACTCC
CGGCAGGTGC CCATCATTCT GGGAGAACAT GTCACCATGG ATGCGGGAAC AGGTGCGGTG
CATACCGCTC CCGCGCATGG GGTGGACGAT TATGTGGTGG GCCAGAAGTA CGGCTTGCCG
GTGGATAATC CGGTGGGGAA TGACGGACGC TTCTTCGATT CTGTGCCGCT CGTCGGCGGG
CTTTCCATAT GGAAGGCGAA CGAGGTGGTA ATACAGGCAT TGGAGAAGAA CGGCCTTTTG
TTGAAGAACG AAAAGCTTCA ACACAGCTAC CCGCACTGCT GGCGGCATCG GACACCGATC
ATCTTCCGCG CGACCCCCCA GTGGTTTATA TCCATGGACA AGGAAGTCTC GCGTAAGAAT
CATGGTGAAG TTGCGCAATC GCTGCGCGAA TCGGCGACAG CCGCCGTGGA GGCAACCGCC
TTTTACCCGT CCTGGGGAAA AGCCCGGCTG GAAGCGATGG TCAACAACCG GCCGGACTGG
TGCATTTCGC GCCAACGCAA CTGGGGTGTG CCGATGGCGC TTTTCGTCAA CAAGGAAACC
CATGAACTGC ACCCGCGCAC GAGCGAACTG CTGGAGCAGG TTGCTCAACG GGTGGAGCAG
GAAGGGATTG AAGCATGGTT TCGTTTGGAT GCGAAGGAAC TCCTGGGAGA GGAAGCGAAA
GATTACAAGA AGCTTACGGA TACGCTGGAT GTGTGGTTCG ATTCGGGTAC GACGCACGAT
ACGGTTCTGA AGCCTGATCC GCAATTGAAG TATCCGGCCG ACCTTTACCT GGAGGGGTCG
GACCAGCACC GCGGCTGGTT CCAGTCCTCC CTGCTCACCG GTTGCGCGAT TGACGGGCGC
GCTCCTTACG ATGCCCTGCT CACGCATGGC TTCGTGGTGG ACGGGCACGG CCATAAGATG
AGCAAATCCA GGGGGAATGT GATTGCGCCG CAGAAGGTGA TGGATACGTC GGGAGCGGAT
ATCCTGCGAT TATGGGTGGG GTCCACGGAT TATTCGGGAG AGCTTTCGAT ATCGGATGAA
ATCCTGAAGC GGGTGGTGGA GAGCTACCGG CGGATACGCA ATACGCTGCG TTTTCTGCTT
GCCAATCTTG CAGATTTTGA TTCATCAACC GATGCCGTGC CCATCGAGCA GTGGCTGGAA
ATAGACCGCT ACATTCTGGC CTTTACGAGA AGGCTCCAGG ATAAGGTGGT GGAAGATTAC
TGCAACTTTG ACTTTCATTT GATCGTGCAT CGCTTGCATA ACTTCTGTTC CGAGGATCTT
GGCGGTTTCT ACCTCGATGT CCTCAAGGAC CGGCTATATA CAGCCAGCGC CAATGGTCTT
CCGCGCCGTT CCGCCCAAAG CGCGCTGCAT CACATTGCCC ATAGCCTCGT GCGCTTGTTT
GCGCCCATCC TCAGCTTTAC TTCCGAAGAG GTATGGCAAT ATCTCCACGG CGATTCGGAG
AACAGCATTT TCCTGCATAC ATGGCACAGG CTACCGCCGC AGCCGGATGA GGAAGCGCTC
GTGACGCGCT GGAGCAGGAT ACGCGAATTG CGCTCGCAGG TTCAAAAAGC CCTGGAGGAG
TCGCGCGCCT CCGGGAAGAT CGGTTCTTCG CTTGCAGCCG AGGTGCAGAT TCGAGTTTCT
GGAGATGATT TTTCTCTGCT GGAGTCGCTA GGCGATGATC TCCGGCTTGT CATGATCACC
TCGGCAGCGC AGGCCGTTCA CATTGAGAAA CTGGAGGGCG AAAGCATTGC CGTAACGCCC
AGCGTCCACC CGAAATGCGA ACGGTGCTGG CATTATCGGG AGGATGTAGG TGTGGACAGG
GAGCACCCGA CCATTTGCGG GCGATGCGTA TCCAACCTGT ATGGGGCGGG CGAGGCTAGA
CGATATGCTT GA
 
Protein sequence
MTDYKKTLNL PDTPFPMRGD LAKREPVMLK AWEEKDLYQR IREACKGRPK FVLHDGPPYA 
NGDIHIGHAV NKILKDIIIK SKTLAGFDAP YVPGWDCHGL PIEHQIEKKY GKNLPGDKVR
ELCRAFAKEQ VARQKADFIR LGVLGDWDNP YLTMNYRTEA GIIRALGKIH ENGYLYQGQK
PVNWCIDCGS ALAEAEVEYE DKTSPAIDVK FRVADPAAFW QKVLKPSDDL LPPVTHDDKP
VYVVIWTTTP WTLPANQAVS LNPDEEYVAI DTGSEFLLLA DALADSALSR YGILKADVSI
AGRCTGAALE HMLLQHPFYS RQVPIILGEH VTMDAGTGAV HTAPAHGVDD YVVGQKYGLP
VDNPVGNDGR FFDSVPLVGG LSIWKANEVV IQALEKNGLL LKNEKLQHSY PHCWRHRTPI
IFRATPQWFI SMDKEVSRKN HGEVAQSLRE SATAAVEATA FYPSWGKARL EAMVNNRPDW
CISRQRNWGV PMALFVNKET HELHPRTSEL LEQVAQRVEQ EGIEAWFRLD AKELLGEEAK
DYKKLTDTLD VWFDSGTTHD TVLKPDPQLK YPADLYLEGS DQHRGWFQSS LLTGCAIDGR
APYDALLTHG FVVDGHGHKM SKSRGNVIAP QKVMDTSGAD ILRLWVGSTD YSGELSISDE
ILKRVVESYR RIRNTLRFLL ANLADFDSST DAVPIEQWLE IDRYILAFTR RLQDKVVEDY
CNFDFHLIVH RLHNFCSEDL GGFYLDVLKD RLYTASANGL PRRSAQSALH HIAHSLVRLF
APILSFTSEE VWQYLHGDSE NSIFLHTWHR LPPQPDEEAL VTRWSRIREL RSQVQKALEE
SRASGKIGSS LAAEVQIRVS GDDFSLLESL GDDLRLVMIT SAAQAVHIEK LEGESIAVTP
SVHPKCERCW HYREDVGVDR EHPTICGRCV SNLYGAGEAR RYA