Gene Noc_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0859 
Symbol 
ID3707164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp940074 
End bp942104 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content53% 
IMG OID637737361 
ProductTonB-dependent haemoglobin/transferrin/lactoferrin receptor 
Protein accessionYP_342902 
Protein GI77164377 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.32697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA CAATAAGATA CGCGGGAATT GCGATCATCC CGACCCTATG GGTCGGGATG 
ATCGCAAAAG CAGAAACAGA GACGTCTTTG AATGAGGAGA ATCATGCTTT TGAGCCGATG
ACCGTCATCG TTACCCGCAC TGCACGCTCC TTGGCGGAGT TACCAGCTTC GGTAAGCGTT
CTGGATAGCG AACAAATTAT GCGCCGCCAG GCCCAAAGCA TGGATGATCT CTTACAGGTC
TTACCTAACG TCGATTTCAC CTCCGGACCC CGTCCTATCG GCGAAACCCT GACGATTCGC
GGTCTGAGCA GCGAGCGAAT CCTGACGACC ATCGATGGCG CCCGGCAGGA TTCCAGCATT
GGTCATCTTG GCCGTTTTTT TATCGAACCG GATCTACTGA AACGGGTTGA TGTCTTGCGC
GGACCAGCCT CGGCAATTTA CGGCAGCGGC GCCCTCGGCG GAGTTGTTAC CATGACCACC
CGGGAGGCAA GCGATTTTCT TGCCCCCGGC CAACGTTTCG GCGCTCGGCT CAAGGGCGGT
TACCAAAGCG TTAACAATGA GAGTTCAACC AGCGCCGCCC TGTTCGGACG GGCTTCCGAC
TGGGATTTTC TAGGTAATTT TTCCTACCGG GATTCCGACG ATATCACGCT GGCCAGCGGC
CAAGAACTAG ACAGCTCGGC TGCGGAAAAC TTTTCCGGAC TGGCCCGCGT GAGTTATAAA
CCCGGTGCCC ATCGGCTGCG TATTGGCGGC GATTACTTTA GTACCGAAGG TATTTTTCCT
GCCAATCCCC AAACCGTATC GGACGGGGCC AATGAGAATG CCGCAACCAA AATCGAACGT
CGCACCTACA CCTTCCAGTA TAGTTATGAT GATCCTGCTC ATCCTTGGTT TAAACCCAAA
TTCAATGCTT ACCGCAATGA GTTGCGCGAC AGCCGAAATC GCTTGGAAAG CGGCAGGCAA
ACCACCAGCG AGTTTGTCAC CACCGGATTT AACCTCCAAA ACAGCATGGA TTTTGGCGAT
CCTCAAGATT TTTTAATGCA AACAATTACT TTAGGGGTAA ATTACTTCAA GGATGAAGAG
GAAGGGCGGG AAGATGGCAA TCCTCGGCCC TCCTTTCCCA AGGCGAACAG CGATGTGTGG
GGATTTTACA TTCAAGACGA AATCTCCCTT GGGCAATACC TCAGTCTTAT TCCGGCAGTG
CGCTATGACC GCTATACGTT AGAGATAGAA GAGAGGGCTG GCGGGAGCAC CACCGATGAG
GCCATTTCTC CCCGTATCGG CGGAATGATC CACCTCGCTT CCTGGCTCAA CCTATGGGGC
AGTTACGGCA AGGCATTTCG CGCCCCTACC CTGCCCGAGC GTTTTACTGA AGGACTCCAC
TTCCGAGGAG TTCCAGGCCG TCTTCCCGAT AACTTCTTCA TCCCCAACCC AGATCTTAAA
CCAGAAACGG TTTATACCTG GGAAGCCGGC TTCAAGAGCG CTTGGGAAGA GTTACTGACA
GCAACGGATC GGCTAAATCT GGAATTTACC TATTTCGATA CTAAAGCCGA TAATTTTATC
GATCTAAAGG TCAATACCCT GGGTGGTACA ACGCAAAACG CCAACCTTGA TAAGGCGCGT
CTCCACGGTT TCGAAACGGG CGTTCGCTAT GATAGTGAGT ACTTTTTCGC CGGGGCCAGT
TTTGGCCGCA CGTACGGTGA AAACATCAAT ACTGGCTTGC CCCTCACCAA CGTACAACCC
GCCAAGGGCG TGGTCAATCT AGGCGGCCGT TTCTCCCCCT GGGGGCTTGT GTTCGGTGGA
CGGGGCCGCT TTGTCGCCAA GCAGGATCGC GTTCCTCCTG GTGTGCTGGA AGCGGCTGGC
TATAGCGTGT ATGACCTATA CGCCACTTGG CTGCCTTCCT CCGCCGGAGT CAAAGGGTTA
CGAATGGATT TTGGTATCGA TAACCTAACT AACAAAGCAT ACAGGCGCTA TCTTTCCGTT
ATCGAGGAGG CGGGGCGTAA CTTTAAGGTA GCCCTCACCT ATCAATTTTA A
 
Protein sequence
MRKTIRYAGI AIIPTLWVGM IAKAETETSL NEENHAFEPM TVIVTRTARS LAELPASVSV 
LDSEQIMRRQ AQSMDDLLQV LPNVDFTSGP RPIGETLTIR GLSSERILTT IDGARQDSSI
GHLGRFFIEP DLLKRVDVLR GPASAIYGSG ALGGVVTMTT REASDFLAPG QRFGARLKGG
YQSVNNESST SAALFGRASD WDFLGNFSYR DSDDITLASG QELDSSAAEN FSGLARVSYK
PGAHRLRIGG DYFSTEGIFP ANPQTVSDGA NENAATKIER RTYTFQYSYD DPAHPWFKPK
FNAYRNELRD SRNRLESGRQ TTSEFVTTGF NLQNSMDFGD PQDFLMQTIT LGVNYFKDEE
EGREDGNPRP SFPKANSDVW GFYIQDEISL GQYLSLIPAV RYDRYTLEIE ERAGGSTTDE
AISPRIGGMI HLASWLNLWG SYGKAFRAPT LPERFTEGLH FRGVPGRLPD NFFIPNPDLK
PETVYTWEAG FKSAWEELLT ATDRLNLEFT YFDTKADNFI DLKVNTLGGT TQNANLDKAR
LHGFETGVRY DSEYFFAGAS FGRTYGENIN TGLPLTNVQP AKGVVNLGGR FSPWGLVFGG
RGRFVAKQDR VPPGVLEAAG YSVYDLYATW LPSSAGVKGL RMDFGIDNLT NKAYRRYLSV
IEEAGRNFKV ALTYQF