Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2770 |
Symbol | |
ID | 3705500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3143180 |
End bp | 3145036 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637739246 |
Product | extracellular solute-binding protein |
Protein accession | YP_344747 |
Protein GI | 77166222 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000172881 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGAC GATTTACGGT TAAAGATTTC TTTCTGATTC TATTTCTGAG TCTGCTGGCC CTTCTCATTC TGCTCACCAT GTATATGGTT GATCGGCAAT GGCTCAAGCT CGCTGAAATG GAACAAACTT TGACTGAGCA GGCCCGGGAT TTACGCGGGC TGCGCGATCT GCTACGCGGC ACAGGCGGCG GGCTCACCCT AGCGCCCATG GGAAATCAGG ATACGGCCAT TATCCCACCC GCATTCCAAC GCGCCTATGC CGCTACCCAA CAACCCGATT ACCATCCCGG CGACTGGTTC GTGCAAGCTT TTGGCGCGGG ACTGACGACC TTGACTCCCC TCGTCTCCAC CGATGCCTAT GCCGCCGAAG TCCAAGGCTA CATACTGGAA TCCTTGCTGA CACGCCACCC GGAAACTTTG GATTGGCAAG GTCTGCTAGC CCGGGATTGG GAAGTCAGCG AGGATGGACT GGTTTTTCGT TTCCACTTAC GGGAAGGACT CACTTTCTCC GATGGCGAAC CTCTTACTGC CGAGGACGTG GCCTTCTCTT TCGCCTTTAT CATGAATCCC GCTATTGCCG CGCCCCGGGA GCGGGCTTAC TATGAAAAAA TAGAGCGAGT CACCGCCCTG GGCCCCTACC AGGTGGAGTT CAAATTCAAG GAACCCTACT TTAATAGTCT GGCTCTGGCC GGTGGCTTGG CCATCTTGGC CGAACATTTC TACGAACCTT ATTTGGAGAA GCCGGAAAAA TTTAACCAAT CCAAAGGCTT ACTACTTGGC TCCGGTCCTT ACCGCCTCCC AGATCCTAAA GGCTGGACTC CGGATCAAGG TCTGGTGGAG CTGGAACGCA ACCCCCGCTA CTGGGGGCCG GTGCAGCCCT CTTTCGACAA ACTGCTATGG AAGGTGATTG CCAATGACAG CGCCCGCCTG ACCACCTTCC GCAACGGGGA TATCGATATC TACAGTGCCC GCCCGCTGGA GTATCAAAAA CTGCGGGAAG ATCAGGCATT GGCGGCGCGC ACCCAGCATT TTGAATACAG GAGTCCCGCA GCGGGCTATA GCTACATTGC CTGGAACCAG GAGCGAAACG GCAAACCTAC CCGCTTTGCT GATCGACGGG TCCGCCAGGC CATGACCTTT CTTACTGATC GCGCCAGCAT GATTGAACAA ATCATGCTGG GCCATGGAGA AGCGGCGGTC AGTCCATTCA ACCCCGACAG CCCTCAGCAT GATTCCACCC TCACCCCCCG GCCCTTCGAT CTGGCTAAGG CCAAAACATT ATTGGCACAA GCGGGCTACG CCGACCGGGA CGGGGACGGA GTTCTGGAAG ATGCTAATCA TCAGCCTTTC CGCTTTGAAC TGGTGTTCTT CCAGGATGCC GATGACACCC GCCGTCAGGT GCTATTCCTT AAGGATCTGT ACGCCCGAGC CGGAATTTTG CTTGAGCCCA AACCCACCGA GTGGTCGGTC ATGCTGGATC AAATTAGCAA AAAGAATTTT GATGCTATCA CCCTGGGCTG GACCAGTGGA GTGGAAATCG ATATCTATCA AATGTTCCAC AGCAGCCAAA CCGTAGCCGG CGGCGATAAT TTCATTGCTT ACCAAAATCC AGAACTGGAC GAACTTATTG AAAAAGCGCG GGCGACAGTG GATGAAACCG AGCGTATGCC CCTGTGGCGA GCCTGCGAGC GTATCTTGTA TGAAGACCAG CCCTATACTT TCTTGATGCG GCGTAAATCA TTGGTATTTG TGGACCAGCG TATGCACAAT CTGGGAATCA CCCGTTTGGG CCTAAATCTG GGCATGACCC CGGTAGAGGC GTACGTCCCG GAATCCAAGC AGAAGTATAC TTACTAA
|
Protein sequence | MARRFTVKDF FLILFLSLLA LLILLTMYMV DRQWLKLAEM EQTLTEQARD LRGLRDLLRG TGGGLTLAPM GNQDTAIIPP AFQRAYAATQ QPDYHPGDWF VQAFGAGLTT LTPLVSTDAY AAEVQGYILE SLLTRHPETL DWQGLLARDW EVSEDGLVFR FHLREGLTFS DGEPLTAEDV AFSFAFIMNP AIAAPRERAY YEKIERVTAL GPYQVEFKFK EPYFNSLALA GGLAILAEHF YEPYLEKPEK FNQSKGLLLG SGPYRLPDPK GWTPDQGLVE LERNPRYWGP VQPSFDKLLW KVIANDSARL TTFRNGDIDI YSARPLEYQK LREDQALAAR TQHFEYRSPA AGYSYIAWNQ ERNGKPTRFA DRRVRQAMTF LTDRASMIEQ IMLGHGEAAV SPFNPDSPQH DSTLTPRPFD LAKAKTLLAQ AGYADRDGDG VLEDANHQPF RFELVFFQDA DDTRRQVLFL KDLYARAGIL LEPKPTEWSV MLDQISKKNF DAITLGWTSG VEIDIYQMFH SSQTVAGGDN FIAYQNPELD ELIEKARATV DETERMPLWR ACERILYEDQ PYTFLMRRKS LVFVDQRMHN LGITRLGLNL GMTPVEAYVP ESKQKYTY
|
| |