Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0859 |
Symbol | |
ID | 3707164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 940074 |
End bp | 942104 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737361 |
Product | TonB-dependent haemoglobin/transferrin/lactoferrin receptor |
Protein accession | YP_342902 |
Protein GI | 77164377 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor [TIGR01785] TonB-dependent heme/hemoglobin receptor family protein [TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.32697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA CAATAAGATA CGCGGGAATT GCGATCATCC CGACCCTATG GGTCGGGATG ATCGCAAAAG CAGAAACAGA GACGTCTTTG AATGAGGAGA ATCATGCTTT TGAGCCGATG ACCGTCATCG TTACCCGCAC TGCACGCTCC TTGGCGGAGT TACCAGCTTC GGTAAGCGTT CTGGATAGCG AACAAATTAT GCGCCGCCAG GCCCAAAGCA TGGATGATCT CTTACAGGTC TTACCTAACG TCGATTTCAC CTCCGGACCC CGTCCTATCG GCGAAACCCT GACGATTCGC GGTCTGAGCA GCGAGCGAAT CCTGACGACC ATCGATGGCG CCCGGCAGGA TTCCAGCATT GGTCATCTTG GCCGTTTTTT TATCGAACCG GATCTACTGA AACGGGTTGA TGTCTTGCGC GGACCAGCCT CGGCAATTTA CGGCAGCGGC GCCCTCGGCG GAGTTGTTAC CATGACCACC CGGGAGGCAA GCGATTTTCT TGCCCCCGGC CAACGTTTCG GCGCTCGGCT CAAGGGCGGT TACCAAAGCG TTAACAATGA GAGTTCAACC AGCGCCGCCC TGTTCGGACG GGCTTCCGAC TGGGATTTTC TAGGTAATTT TTCCTACCGG GATTCCGACG ATATCACGCT GGCCAGCGGC CAAGAACTAG ACAGCTCGGC TGCGGAAAAC TTTTCCGGAC TGGCCCGCGT GAGTTATAAA CCCGGTGCCC ATCGGCTGCG TATTGGCGGC GATTACTTTA GTACCGAAGG TATTTTTCCT GCCAATCCCC AAACCGTATC GGACGGGGCC AATGAGAATG CCGCAACCAA AATCGAACGT CGCACCTACA CCTTCCAGTA TAGTTATGAT GATCCTGCTC ATCCTTGGTT TAAACCCAAA TTCAATGCTT ACCGCAATGA GTTGCGCGAC AGCCGAAATC GCTTGGAAAG CGGCAGGCAA ACCACCAGCG AGTTTGTCAC CACCGGATTT AACCTCCAAA ACAGCATGGA TTTTGGCGAT CCTCAAGATT TTTTAATGCA AACAATTACT TTAGGGGTAA ATTACTTCAA GGATGAAGAG GAAGGGCGGG AAGATGGCAA TCCTCGGCCC TCCTTTCCCA AGGCGAACAG CGATGTGTGG GGATTTTACA TTCAAGACGA AATCTCCCTT GGGCAATACC TCAGTCTTAT TCCGGCAGTG CGCTATGACC GCTATACGTT AGAGATAGAA GAGAGGGCTG GCGGGAGCAC CACCGATGAG GCCATTTCTC CCCGTATCGG CGGAATGATC CACCTCGCTT CCTGGCTCAA CCTATGGGGC AGTTACGGCA AGGCATTTCG CGCCCCTACC CTGCCCGAGC GTTTTACTGA AGGACTCCAC TTCCGAGGAG TTCCAGGCCG TCTTCCCGAT AACTTCTTCA TCCCCAACCC AGATCTTAAA CCAGAAACGG TTTATACCTG GGAAGCCGGC TTCAAGAGCG CTTGGGAAGA GTTACTGACA GCAACGGATC GGCTAAATCT GGAATTTACC TATTTCGATA CTAAAGCCGA TAATTTTATC GATCTAAAGG TCAATACCCT GGGTGGTACA ACGCAAAACG CCAACCTTGA TAAGGCGCGT CTCCACGGTT TCGAAACGGG CGTTCGCTAT GATAGTGAGT ACTTTTTCGC CGGGGCCAGT TTTGGCCGCA CGTACGGTGA AAACATCAAT ACTGGCTTGC CCCTCACCAA CGTACAACCC GCCAAGGGCG TGGTCAATCT AGGCGGCCGT TTCTCCCCCT GGGGGCTTGT GTTCGGTGGA CGGGGCCGCT TTGTCGCCAA GCAGGATCGC GTTCCTCCTG GTGTGCTGGA AGCGGCTGGC TATAGCGTGT ATGACCTATA CGCCACTTGG CTGCCTTCCT CCGCCGGAGT CAAAGGGTTA CGAATGGATT TTGGTATCGA TAACCTAACT AACAAAGCAT ACAGGCGCTA TCTTTCCGTT ATCGAGGAGG CGGGGCGTAA CTTTAAGGTA GCCCTCACCT ATCAATTTTA A
|
Protein sequence | MRKTIRYAGI AIIPTLWVGM IAKAETETSL NEENHAFEPM TVIVTRTARS LAELPASVSV LDSEQIMRRQ AQSMDDLLQV LPNVDFTSGP RPIGETLTIR GLSSERILTT IDGARQDSSI GHLGRFFIEP DLLKRVDVLR GPASAIYGSG ALGGVVTMTT REASDFLAPG QRFGARLKGG YQSVNNESST SAALFGRASD WDFLGNFSYR DSDDITLASG QELDSSAAEN FSGLARVSYK PGAHRLRIGG DYFSTEGIFP ANPQTVSDGA NENAATKIER RTYTFQYSYD DPAHPWFKPK FNAYRNELRD SRNRLESGRQ TTSEFVTTGF NLQNSMDFGD PQDFLMQTIT LGVNYFKDEE EGREDGNPRP SFPKANSDVW GFYIQDEISL GQYLSLIPAV RYDRYTLEIE ERAGGSTTDE AISPRIGGMI HLASWLNLWG SYGKAFRAPT LPERFTEGLH FRGVPGRLPD NFFIPNPDLK PETVYTWEAG FKSAWEELLT ATDRLNLEFT YFDTKADNFI DLKVNTLGGT TQNANLDKAR LHGFETGVRY DSEYFFAGAS FGRTYGENIN TGLPLTNVQP AKGVVNLGGR FSPWGLVFGG RGRFVAKQDR VPPGVLEAAG YSVYDLYATW LPSSAGVKGL RMDFGIDNLT NKAYRRYLSV IEEAGRNFKV ALTYQF
|
| |