Gene Noc_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1797 
Symbol 
ID3705314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2028480 
End bp2029931 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content52% 
IMG OID637738281 
Producthypothetical protein 
Protein accessionYP_343798 
Protein GI77165273 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.636063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCAA AAATAGTTCG CTGGATTATT GTGCTGCTGG CATTAGTCCT GGTGATTGGA 
GCCGTTGGCT GGTACAAACT TTTGCGCGAG GTGGAACAAA CCTCCCTGAA AGAGTCCTCG
GCGGCGGAAT GGTTTAAATA CGGTTCCATT AGCAGTGAAG AGGAGCAAGG CGTTCCCTAT
TGGATCTGGC GAGTTTTGCC TAAGATGTTC CCTGAATATT TGCCTGCCCC CGGTGGTTAT
GCCGCCCTCG GTGTTCCTTG GGAACAGGGG CAGGAATTGC CGGTAGGGTT CTCTAAAAAA
ACCATTGGTT TCCCCCGCGT TGCCTTTAAC TGCGCCTTCT GCCATTCGGC CAGGTACCGG
CTCAAGGCAG ACGAGCCAGC CACTATAGTG GTGCCGGGAC CAGGGAATAC CGTCAGGCCC
CAAGACTATG CCCGCTTTCT TGCGGCATCG GCGAATGATG CCCGCTTTAA TTCCGATAAT
ATTCTGGAGC AGATCAGCTT GATCTACGAA TTGTCGTGGC TAGATAGGCA GCTCTATCGT
TACTTGATTA TTCCCATGAC TAAAAAAGCG CTGATCCAAT ATGGGCAGGA GTTTGCTTGG
GCCCAAGGGA AGCCCCCCTG GGGGACCGGG CGCATCGATC CTTTTAATCC CATCAAGTTT
GGAATCCTGC AGATGGGAAT CGATGCAACC ATTGGCAATT CGGACATGAT GCCTCTATGG
AATTTGAAAG TCCGGGAAGG AGATGCGCTT CATTGGGATG GTCTAAATAC TAATCTTCAT
GAAGTAGTCA TCAGTTCCGC CATTGGCGAT GGCATGACCT ACAAAGCCAT TGCCCATGAT
AGCTTGGATC GTATCGAGGC GTGGTTACAG GAAGTGCCTT CGCCGGCTTC ACCCTTTAAT
GCCAATGAGA ACCCTGCTTC TCCTTACTAC TTGGATGAGC AGCAAGCGGC AATAGGTAAA
GCTATTTATG AGCAGCATTG CGCCACATGC CATGCGCCAG GAGGAGAACG CCATAGAACG
GTAATTCCGG TTGAAGAGGT GGGCACTGAT CGCCACCGGG TGGATATGTG GACAGCCGAA
GCCGCTAAGC GCTACAACGC CTATCAGGAA GATTACGATT GGGGAATGCG TCACTTTAGG
GACGTGGATG GTTATGTGGC GGTGCCCCAT GATGGCTTGT GGTTGCGAGG CCCCTATCTC
CACAATGGCT CCGTGCCTAC CCTGCGGGAT ATGTTGAAAA AACCGGAAGA TCGGCCCCAA
GTATTTTACC GGGGTTACGA TCTTTTTGAT CCCATCAATG TAGGTTTCGT GTCCCAGGGA
GAAGAGGCTG AGCGGATTGG TTTTCGCTAT GACACGGGGG TACCTGGCAA TAGCAACCAA
GGCCATTTAT TTGGCACGGA TCTTCCGGAA GATCGGAAAG AAGCGTTGCT TGAATACTTA
AAAACGCTTT GA
 
Protein sequence
MQSKIVRWII VLLALVLVIG AVGWYKLLRE VEQTSLKESS AAEWFKYGSI SSEEEQGVPY 
WIWRVLPKMF PEYLPAPGGY AALGVPWEQG QELPVGFSKK TIGFPRVAFN CAFCHSARYR
LKADEPATIV VPGPGNTVRP QDYARFLAAS ANDARFNSDN ILEQISLIYE LSWLDRQLYR
YLIIPMTKKA LIQYGQEFAW AQGKPPWGTG RIDPFNPIKF GILQMGIDAT IGNSDMMPLW
NLKVREGDAL HWDGLNTNLH EVVISSAIGD GMTYKAIAHD SLDRIEAWLQ EVPSPASPFN
ANENPASPYY LDEQQAAIGK AIYEQHCATC HAPGGERHRT VIPVEEVGTD RHRVDMWTAE
AAKRYNAYQE DYDWGMRHFR DVDGYVAVPH DGLWLRGPYL HNGSVPTLRD MLKKPEDRPQ
VFYRGYDLFD PINVGFVSQG EEAERIGFRY DTGVPGNSNQ GHLFGTDLPE DRKEALLEYL
KTL