Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2530 |
Symbol | |
ID | 3704686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2882118 |
End bp | 2883986 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637739009 |
Product | hypothetical protein |
Protein accession | YP_344513 |
Protein GI | 77165988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000044535 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT GGTTTCTCTA CAACAGTATC GCATTAGTAC TATTCGGGCT ATCCATGGGG AGCTATGGAC GCGAAATCCA AGTGCCTATA CAACTCAATA ATGAGCTGCT TCGCCATATT CTCATTAGGG AAGTTTATGT CGGTCCCCAT CATACGGCCC AGGTCTGGAA TGATGATAGC GGCTGCAATT CTTTAGTGCT TTCCAATCCC CGGGTCGGTA ATGCCGGCCA GCAGCTTCGC ATTCTCAGTG ACGGAATAGC CAAGCTCGGA ACGCCCATTG GCAATCGCTG TATTCCACTC CTAGACTGGA CAGGAACGAT TGAAGTATTC CAAAAGCCCA TGCTTGGACC CGAACTGACT ACCCTCTACT TCCAAACTGT ACAATCCAAT ATTTATAACG CCGAAGGGCA TAAGGAAGCT GCCACGGGTC AGCTCTGGGA TAGGATCAAA GAATATGTAC ACCCCAGGCT CTCGCAGGTC CGGATTGACC TCCAGCCCCA GTTAGCGGAA CTGCGAAATC TACTGCCCTT GGTATTATCT CCCCGGGACC GCTCACGGAT TCAGACCGCC ATTGATTCCC TAACCCTGAC CGAAGCACAA ACGACCCCGG ACGGGATTAA GGTCGCCTTG CGTTTTACAC TCCCGGATCT CAACACTCCC CCTCCTTCTC CGGAACCGCC CCTCTCACCT GAGGAAATGC AGCGCTGGGA AGCAGCGTGG CAGCAAGGAG ATGCGTTTCT AACCTTTATT ATCAAGCAGG CAGCAGCTGA AAATGAGTTA GCAGAATTAC GCCCGCTTTT GTTGGAAATT CTTTTAGACG CCCGTCACGA TATAGACAAG GCCCTCACTG CTTCAACGCC TGGAACGGCC GATCCCATCC GCACGCTGTT TCTAAAAACC TGGGAACGCC TGGCACCGGT ATTGCGCCAT TTAAGCCTGA GCATGCCCCA TGAAACGGCC CTGCACTACC TGAGCTTTAT TGCCGCCAGT GATGTCCTAA AAACCATTGA CCAGCTTGGC CCGGCGTCTG GCCTCGATAT CTCCACCGAT GGGTTACGCC GGCTAGCCCG GATCATTGCT CCCCAAGCGG GGCACCATCC CTTGTTTTAC AACTTTAAGG TCGATCCTGA GTTGCGGCGC TTATTGGGAT TTAGCGTGGC TCCATCCCCT TCTCGGAAAA ACTCGCAACT CAACCTGAAC GAGGGGTTAT GGCGAAATGC CTGGGCTGCT GATAGCGTTG ATCGACCCCT GATTTCACGG CTCAACCAGT GGGTTCCCAC GACAAAGGAT ATGGGAACCT ACTTACCAAT GGTCCACCAG CTCCTGGATC AAACCGTCAG CCATTTACTC CAAACTCATC CCTTGGAAAG CCAATACCAC TCACTGTACC GCTGGCTCCT GCTGGCTACC GCTTGGCAAG AAAGCTGTTG GCGTCAATTC ACCAAGAAGG GAGACAAGAT CCGACCGTTC CACTCTGGTG GAGGCTCGGT AGGTCTCATG CAGATCAATC AAAACGTCTG GCGAGGTTTT TACGATGTGC ATGACTTGAA CTGGGACATT GCCTACAATG CGCAAGCGGG GGGAGAAATC TTGTTGCGCT ATCTGGTAGA TTACGCCATC AAAAAAGGCG AGCATAAAAA AACAGGTGAT CTCGATAATC TAGCTCGCGC CACCTATGCC GCTTATAATG GAGGACCAGG ACACTTGAGG CGTTACCGTA AGGCAGGTAC GCCAGAGTCC TTGCGTAAGA TAGATGCTTC TTTTTGGGAC AAATACCGAA CCATCAAACA GGGTAACGAA CTGGCTGTCG CCCAATGCTT TGGTATAGAA GCCTCGTCTT TATCGCTTCC GCCAGGAAAC AGAAGATAA
|
Protein sequence | MKKWFLYNSI ALVLFGLSMG SYGREIQVPI QLNNELLRHI LIREVYVGPH HTAQVWNDDS GCNSLVLSNP RVGNAGQQLR ILSDGIAKLG TPIGNRCIPL LDWTGTIEVF QKPMLGPELT TLYFQTVQSN IYNAEGHKEA ATGQLWDRIK EYVHPRLSQV RIDLQPQLAE LRNLLPLVLS PRDRSRIQTA IDSLTLTEAQ TTPDGIKVAL RFTLPDLNTP PPSPEPPLSP EEMQRWEAAW QQGDAFLTFI IKQAAAENEL AELRPLLLEI LLDARHDIDK ALTASTPGTA DPIRTLFLKT WERLAPVLRH LSLSMPHETA LHYLSFIAAS DVLKTIDQLG PASGLDISTD GLRRLARIIA PQAGHHPLFY NFKVDPELRR LLGFSVAPSP SRKNSQLNLN EGLWRNAWAA DSVDRPLISR LNQWVPTTKD MGTYLPMVHQ LLDQTVSHLL QTHPLESQYH SLYRWLLLAT AWQESCWRQF TKKGDKIRPF HSGGGSVGLM QINQNVWRGF YDVHDLNWDI AYNAQAGGEI LLRYLVDYAI KKGEHKKTGD LDNLARATYA AYNGGPGHLR RYRKAGTPES LRKIDASFWD KYRTIKQGNE LAVAQCFGIE ASSLSLPPGN RR
|
| |