Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0926 |
Symbol | |
ID | 3707316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1021251 |
End bp | 1023872 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637737434 |
Product | alanine--tRNA ligase |
Protein accession | YP_342968 |
Protein GI | 77164443 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.961534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGCA GCGCCGAACT CCGAAAAATA TTTCTCGATT ATTTTCGCCA CCAGAGCCAT GAGATTGTTC CCAGTGGCCC CTTAGTTCCT GCTAACGATC CGACCTTGCT GTTCACTAAT GCAGGCATGG TCCAATTCAA AGAGGTTTTT CTGGGCAGGG AGGTCCGAGC TTATCATCGT GCGGCAAGCG CCCAGCGTTG CGTCCGCGCC GGGGGCAAGC ATAACGATTT AGAAAATGTA GGTTATACCG CCCGCCACCA TACCTTTTTT GAAATGCTGG GTAATTTCAG CTTTGGGGAT TATTTTAAGC GCGAAGCTAT CGGTTATGCT TGGGAATTGT TGACTGAAGT GCTTAAGCTA CCGCCTGAGC GGCTATGGGT GACAGTATTC AAGGAGGATG ATGAAGCTGC CAATATTTGG TTGCAGGAAA TTGGGGTAAG TCCGGAACGT TTTTCCCGCT GTGGTGCCGA AGACAATTTC TGGTCTATGG GGGAGACGGG GCCCTGCGGC CCGTGTTCTG AAATTTTTTA TGATCATGGA CCGGAAATCG AGGGTGGTCC CCCTGGCAGC CCGGAGCAGG AGGGGGATAG GTACACTGAG ATTTGGAATC TGGTTTTTAT GCAGTATGAC CGGGGTAAGG AAGGCCGGCT TTCTCCCTTG CCTCGTCCTT CGGTAGACAC AGGGATGGGC TTGGAGCGGT TAGCGGCAGT AATGCAGGGA GTGCATGATA ACTACAATAT CGATTTATTC CGCAACCTTA TCGCCGCCAT TACCGCCCTT TCTGGTATTC AAAATCAAGA ACAAACTTCT TTGCGGGTAA TTGCCGATCA TATTCGATCT TGTGTTTTTT TAATCGTAGA TGGTATTCAG CCTTCTAATG AAGGGCGGGG GTATGTTCTG CGCCGCATTA TCCGGCGGGC TATTCGGCAT GGGCATAAAT TGGGCCTGCG TGATCCTTTT TTCTACCGCC TAGTGGAGCC GCTGGTCCAG GAAATGGGAG AGGCTTATCC TGAATTGTTT CGGCTTCAGA GCCAAGTAGA GCGGGTGCTG AAATTGGAAG AGGAGCGCTT TAATGAAACC CTGGAGCAAG GACTCAAAAT TCTAGAGCAG GATATTATTG ATTTATCAGA TGCCGTAATT CCCGGAGAAA CAATATTTCG GCTCTACGAT ACTTTTGGTT TCCCGGTGGA TTTGACTGCC GACATTGCCC GGGAACGGAA ACTCACCCTG GATATGAAAG GCTTTGAGCA AGCCATGGCC AAGCAGCGTA AACGGGCTCG GGCTGCCAGT CGCTTCAAAA TCGAATATGG ATCTGAGCTT CAGATGGATT TGGAGACCGA ATTTACCGGC TATGAACAGC TCCGTGGCGA GGGCCAGATT GCCGCTCTAT TCCGTCTGGC GGAGACTATG GAGACCGTGG AACAACTTAG CGCTGGCGAA AGTGGCATGG TGGTGCTGGA CCGGACTCCA TTTTACGCGG AGGCGGGCGG GCAAGTAGGA GATCGGGGAA CGCTACGTGG CTCCAATGGG TTATTCAACG TGACGGATAC CCACAAGCAG GGCGCCGCCC ATGTTCATCT GGGCGAGGTT CGCTTAGGTC AGCTTCGAGT TGGTGATTCA ATTCAATCCG AGGTGGATCG AAAATACCGA ACTCCTACCC GACTCAACCA TTCGGCAACT CATCTTCTCC ATGCGGCCCT GCGGGAAGTG CTAGGGGAGG GGGTGATTCA GAAAGGCTCC CTGGTGGCTT CCGACCGCCT GCGTTTTGAC TTCTCCCATC TTGAAGCCGT GCAGTCTGGA CAATTGCGCC AAATCGAGCA TCTGGTAAAT GCTAAGATTA GGGCTAATTT GCCCGTAGAA ACACAGATTA TGCCTTTGCA GCAAGCCCTA GATGCGGGTG TTATGGCATT GTTTGGCGAG AAATACGGCG AGCAGGTTCG GGTTTTGCGC ATGGGAGACT TCTCGATGGA GCTGTGTGGT GGTACCCATG TGGATCGGAC CGGCGATATT GGTCTTTTCA AGATCATTAA CGAAATGGGT GTTGCCGCTG GAATCCGCCG TATTGAGGCA GTCACGGGAG AGGCAGCGCT ATCCTGGGTA GAGGAGGGCG AGGTATGCTT GGAAGCCCTT ATGGGTCGGC TTAAGGCCTC CCGGAATTCA GCGGTTGACA AGCTGGAGCA ACTGCAGCAA CAGACCCGCC AGCAGGAGAA GGAACTACAG AGACTGAAAG CGAAATTAGC CACTACGGGA GGAGCGGATT TAAGCATTCA GGCGCAAGAA ATTCGGGGCA TCAAAGTTTT GGCGGCCCGA ATTGATGGGG TTGATAGTAA AACCTTGCGC GCCACGGTTG ATCAGCTCAA AGGCAAACTT ATTACTGCTG CCGTTGTATT GGGCACTGTA GTGGAAGATA AAGTCGTTTT AATTGCTGGG GTGACTAACA ATGCCACTAG CCGGATTAAG GCGGGAGATT TGGTCAACTT TGTTGCTGAG CAGGTAGGTG GCCGGGGTGG AGGTCGTCCG GATATGGCCC AGGCGGGAGG CAGGAATCCG GATAAATTGG ATGCTGCCCT TGATTTAGTG CCGAAGTGGG TAGAGGGACA GTTAACTTCC GGAGCGCAAT AA
|
Protein sequence | MKSSAELRKI FLDYFRHQSH EIVPSGPLVP ANDPTLLFTN AGMVQFKEVF LGREVRAYHR AASAQRCVRA GGKHNDLENV GYTARHHTFF EMLGNFSFGD YFKREAIGYA WELLTEVLKL PPERLWVTVF KEDDEAANIW LQEIGVSPER FSRCGAEDNF WSMGETGPCG PCSEIFYDHG PEIEGGPPGS PEQEGDRYTE IWNLVFMQYD RGKEGRLSPL PRPSVDTGMG LERLAAVMQG VHDNYNIDLF RNLIAAITAL SGIQNQEQTS LRVIADHIRS CVFLIVDGIQ PSNEGRGYVL RRIIRRAIRH GHKLGLRDPF FYRLVEPLVQ EMGEAYPELF RLQSQVERVL KLEEERFNET LEQGLKILEQ DIIDLSDAVI PGETIFRLYD TFGFPVDLTA DIARERKLTL DMKGFEQAMA KQRKRARAAS RFKIEYGSEL QMDLETEFTG YEQLRGEGQI AALFRLAETM ETVEQLSAGE SGMVVLDRTP FYAEAGGQVG DRGTLRGSNG LFNVTDTHKQ GAAHVHLGEV RLGQLRVGDS IQSEVDRKYR TPTRLNHSAT HLLHAALREV LGEGVIQKGS LVASDRLRFD FSHLEAVQSG QLRQIEHLVN AKIRANLPVE TQIMPLQQAL DAGVMALFGE KYGEQVRVLR MGDFSMELCG GTHVDRTGDI GLFKIINEMG VAAGIRRIEA VTGEAALSWV EEGEVCLEAL MGRLKASRNS AVDKLEQLQQ QTRQQEKELQ RLKAKLATTG GADLSIQAQE IRGIKVLAAR IDGVDSKTLR ATVDQLKGKL ITAAVVLGTV VEDKVVLIAG VTNNATSRIK AGDLVNFVAE QVGGRGGGRP DMAQAGGRNP DKLDAALDLV PKWVEGQLTS GAQ
|
| |