Gene Noc_2722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2722 
Symbol 
ID3704748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3091585 
End bp3093237 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID637739204 
Productparaquat-inducible protein B 
Protein accessionYP_344705 
Protein GI77166180 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT TGCTGACTGA GGCCAAGATA CGCCCGCCGC GGCGGATAAA GCTCTCGCCA 
GTCTGGCTGG TCCCCCTCGC AGCTGCACTG ATCGGCGCGT GGCTGGTGTA TCAGAATATT
GCATCTCAAG GTCCGGAGAT CATATTGCAG CTCGACAACG CCGAAGGTGT GGAAGCTGGT
AAAACAGTGG TTAAACTACA CAATGTGGAT GTGGGACTGG TGGAGAAGGT ACGCTTGTCC
AAGGATTACA CGGGAGCCAT CGCCGAGGTG CGCATGAAGG CGGATATGGA CCCTCTATTG
GTGGAGGATA CCCAATTTTG GGTGGCCAAA CCCCGGGTGG GACGGGAAGG TATTAGCGGC
CTGAGCACAA TCCTCTCTGG CGTCTATATC CAGATGCGCC CAGGCAACGC TCAAGACCCA
GCCCGCCGGT TCCAGGTGCT AGAACGGCCA CCCACGATCC GTATCCACAC CGAGGGATTA
TCGCTGGAGT TAGTCAGTAC GGATGATAAC TCCTTAACCA TCGGAGACCC TGTGGTCTAT
CAAGGCCAGG AAGTGGGTCA GATCGATACC GCAGAATTCA ATGCCTCAGC GCTGGAAATG
CACTACGGCG TATTTATCCG TGCTCCCTTC GATGCGCTGA TCACCGAGAA TGTCCAGTTC
TGGCCACGCT CCGGCATTAT CTTTGAGATC ACTTCCGAGG GATTGCAGGT ACAAACAGGC
ACCCTTGAGA CGATGTTGGC GGGGGGTGTC ACGTTTGGGA TTCCACCGGA TCTGAAGGAT
GGCAAACAAG CTGAACAAGG CGCTATATTT CGACTGTATC CGTCGCGCCA AGCCGCCCAG
CAGGATCGTT ACGACCATCA GTACAAGTAC GTCATTCTGT TCGATGACTC GGTGCGCGGT
TTGTACCCAG GAGCGGCTGT CGAATTCCGT GGCGTCCGGG TAGGCACGGT ACTCAACGTC
CCCTTCTTTG GTGACGACTT TGGTATGGAA TATTCCCAGA CCTTTCGCAT CCCAGTACTG
ATCGCCTTCG AACCGCAGCG GCTGGCAGGC ACTACCTGGG CGCAATTCGA TCAAAAAGCC
TGGCAGCAGC ATCTGAACCG CTTGTTCCCT AGAGGGTTGC GGGCCACTAT AAAGGCGGCT
AATCTGCTCA CGGGCGCAAT GTTTGTGGAT CTCGCTTTCA CGGACCAGAA AGGTGCCCAC
CATGAACCGG CGTTTCAAGG CCAGTATCCG TTGCTCCCCA GCCGCAGCAG CGGGCTGGCC
AGTATCGAGG AAAAGATCAC CCGACTATTG GATAAACTCA ATGAACTGGA ACTCGCGCCA
GTGCTGACTA AGTTGCAACA CACCCTAGAA AGCACTAGCG AAGTCATGAA TAAGTCCCAA
AACACCATGG ATCGGCTGAA CTCACTTCTC GGCAGCGAGG CCATGGGCAA GCTCCCTACT
GAGTTCAATG CAACCCTAGA TGAATTACGG AAGACCCTGA ACAGCTATCA GCAGGGAGCG
CCCGTCTACG ATAAGCTCAA CCGCTCCCTT GACCGGCTGA ATCAGGTTCT GGATGATCTC
GCCCCCTTCG TGGAAACGCT GCACAATGCT CCTTCAGCAC TCTTTTTCGG CGATAATGCC
CCTGAAGACC CTATTCCCCA AGCTGCCAAA TGA
 
Protein sequence
MSDLLTEAKI RPPRRIKLSP VWLVPLAAAL IGAWLVYQNI ASQGPEIILQ LDNAEGVEAG 
KTVVKLHNVD VGLVEKVRLS KDYTGAIAEV RMKADMDPLL VEDTQFWVAK PRVGREGISG
LSTILSGVYI QMRPGNAQDP ARRFQVLERP PTIRIHTEGL SLELVSTDDN SLTIGDPVVY
QGQEVGQIDT AEFNASALEM HYGVFIRAPF DALITENVQF WPRSGIIFEI TSEGLQVQTG
TLETMLAGGV TFGIPPDLKD GKQAEQGAIF RLYPSRQAAQ QDRYDHQYKY VILFDDSVRG
LYPGAAVEFR GVRVGTVLNV PFFGDDFGME YSQTFRIPVL IAFEPQRLAG TTWAQFDQKA
WQQHLNRLFP RGLRATIKAA NLLTGAMFVD LAFTDQKGAH HEPAFQGQYP LLPSRSSGLA
SIEEKITRLL DKLNELELAP VLTKLQHTLE STSEVMNKSQ NTMDRLNSLL GSEAMGKLPT
EFNATLDELR KTLNSYQQGA PVYDKLNRSL DRLNQVLDDL APFVETLHNA PSALFFGDNA
PEDPIPQAAK