Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2722 |
Symbol | |
ID | 3704748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3091585 |
End bp | 3093237 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637739204 |
Product | paraquat-inducible protein B |
Protein accession | YP_344705 |
Protein GI | 77166180 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACT TGCTGACTGA GGCCAAGATA CGCCCGCCGC GGCGGATAAA GCTCTCGCCA GTCTGGCTGG TCCCCCTCGC AGCTGCACTG ATCGGCGCGT GGCTGGTGTA TCAGAATATT GCATCTCAAG GTCCGGAGAT CATATTGCAG CTCGACAACG CCGAAGGTGT GGAAGCTGGT AAAACAGTGG TTAAACTACA CAATGTGGAT GTGGGACTGG TGGAGAAGGT ACGCTTGTCC AAGGATTACA CGGGAGCCAT CGCCGAGGTG CGCATGAAGG CGGATATGGA CCCTCTATTG GTGGAGGATA CCCAATTTTG GGTGGCCAAA CCCCGGGTGG GACGGGAAGG TATTAGCGGC CTGAGCACAA TCCTCTCTGG CGTCTATATC CAGATGCGCC CAGGCAACGC TCAAGACCCA GCCCGCCGGT TCCAGGTGCT AGAACGGCCA CCCACGATCC GTATCCACAC CGAGGGATTA TCGCTGGAGT TAGTCAGTAC GGATGATAAC TCCTTAACCA TCGGAGACCC TGTGGTCTAT CAAGGCCAGG AAGTGGGTCA GATCGATACC GCAGAATTCA ATGCCTCAGC GCTGGAAATG CACTACGGCG TATTTATCCG TGCTCCCTTC GATGCGCTGA TCACCGAGAA TGTCCAGTTC TGGCCACGCT CCGGCATTAT CTTTGAGATC ACTTCCGAGG GATTGCAGGT ACAAACAGGC ACCCTTGAGA CGATGTTGGC GGGGGGTGTC ACGTTTGGGA TTCCACCGGA TCTGAAGGAT GGCAAACAAG CTGAACAAGG CGCTATATTT CGACTGTATC CGTCGCGCCA AGCCGCCCAG CAGGATCGTT ACGACCATCA GTACAAGTAC GTCATTCTGT TCGATGACTC GGTGCGCGGT TTGTACCCAG GAGCGGCTGT CGAATTCCGT GGCGTCCGGG TAGGCACGGT ACTCAACGTC CCCTTCTTTG GTGACGACTT TGGTATGGAA TATTCCCAGA CCTTTCGCAT CCCAGTACTG ATCGCCTTCG AACCGCAGCG GCTGGCAGGC ACTACCTGGG CGCAATTCGA TCAAAAAGCC TGGCAGCAGC ATCTGAACCG CTTGTTCCCT AGAGGGTTGC GGGCCACTAT AAAGGCGGCT AATCTGCTCA CGGGCGCAAT GTTTGTGGAT CTCGCTTTCA CGGACCAGAA AGGTGCCCAC CATGAACCGG CGTTTCAAGG CCAGTATCCG TTGCTCCCCA GCCGCAGCAG CGGGCTGGCC AGTATCGAGG AAAAGATCAC CCGACTATTG GATAAACTCA ATGAACTGGA ACTCGCGCCA GTGCTGACTA AGTTGCAACA CACCCTAGAA AGCACTAGCG AAGTCATGAA TAAGTCCCAA AACACCATGG ATCGGCTGAA CTCACTTCTC GGCAGCGAGG CCATGGGCAA GCTCCCTACT GAGTTCAATG CAACCCTAGA TGAATTACGG AAGACCCTGA ACAGCTATCA GCAGGGAGCG CCCGTCTACG ATAAGCTCAA CCGCTCCCTT GACCGGCTGA ATCAGGTTCT GGATGATCTC GCCCCCTTCG TGGAAACGCT GCACAATGCT CCTTCAGCAC TCTTTTTCGG CGATAATGCC CCTGAAGACC CTATTCCCCA AGCTGCCAAA TGA
|
Protein sequence | MSDLLTEAKI RPPRRIKLSP VWLVPLAAAL IGAWLVYQNI ASQGPEIILQ LDNAEGVEAG KTVVKLHNVD VGLVEKVRLS KDYTGAIAEV RMKADMDPLL VEDTQFWVAK PRVGREGISG LSTILSGVYI QMRPGNAQDP ARRFQVLERP PTIRIHTEGL SLELVSTDDN SLTIGDPVVY QGQEVGQIDT AEFNASALEM HYGVFIRAPF DALITENVQF WPRSGIIFEI TSEGLQVQTG TLETMLAGGV TFGIPPDLKD GKQAEQGAIF RLYPSRQAAQ QDRYDHQYKY VILFDDSVRG LYPGAAVEFR GVRVGTVLNV PFFGDDFGME YSQTFRIPVL IAFEPQRLAG TTWAQFDQKA WQQHLNRLFP RGLRATIKAA NLLTGAMFVD LAFTDQKGAH HEPAFQGQYP LLPSRSSGLA SIEEKITRLL DKLNELELAP VLTKLQHTLE STSEVMNKSQ NTMDRLNSLL GSEAMGKLPT EFNATLDELR KTLNSYQQGA PVYDKLNRSL DRLNQVLDDL APFVETLHNA PSALFFGDNA PEDPIPQAAK
|
| |