Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2420 |
Symbol | |
ID | 5083652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2469090 |
End bp | 2471498 |
Gene Length | 2409 bp |
Protein Length | 802 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640483982 |
Product | DNA/RNA non-specific endonuclease |
Protein accession | YP_001168613 |
Protein GI | 146278454 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1864] DNA/RNA endonuclease G, NUC1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.352257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.785952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAACG GCGCCGCCGA TCGGTTCGAG AATGTGCTCG ACCTGCAGAC GGCGCATCTG GCGCTGATGC AGCGCCAGCA GGACCGTCGC TCGGCCGGCG GCGGGCTTCT GCCGCAAGAG GAGATCACCG ACTTCCTCGC GCGCGTGGCA CGGACCGGCG CCGTCCTGTC CACCCCGGCG GATCGGAGGA TCGCGCAGCG GGTGCTGGAC TACTGGACCG CGGACCTGCT CGACACGGCG CGCGGCTATT CCGGCCCGGT GGCCACCGAA ACCCTGCTTG CCTGCGAGGC GGAGGACAGC GACGCACGGC CCGCCGGGCT GGCCGAGGGG CACGGCTCGC GCGAGTATAT CCGGCTTGCG GCGCAGGCGC GCCAGTGGCG CGACACCCGC TCGCACGGCT ATCTCCTGTC GGGCAAGGCA CTGCGTTCGG CCGAACGGTT CAGCCGCGAT CCCGAGATCG CCGATCTCAT CGCCGCCAGC CTCGCCGAGG AGCAGCGCGA GGCCCGCCGC GCGCGTCGCC GCAAGCGGAT CGCGGCGGGT CTCACGCTGG CTCTCGTGGC GGCGGTCGCG ATGGCGGGGA TCTTCTTCCT CAAGGTCGAA ACAGCGACGC ATGAGGCCGC CGAGGCGGCC GGCGAGAAGG GCGCGCTGGC GCGCGACGTG GTCTTTCTGG GCGACGAGGA GCGGCTCCGC GCCCAGGAGC GGCAGGTGGC GCTGGAAAAC GCCAATGTCG AGCGCCGGAT CGCACAGGAA CACATGGATG CGCTCTCGGA ACGCCAGAGC CGGCTTGACG CGGCGCAGGG CGCGCTGGCC GATCTGGTGA CGGCCGAAAG GTTGCCGCTG GCGGGCCTCC CCGACGGGGT GGCCGAGGAT GTGCTGCGCA TCCTTGCCCT CCGTCAGGCC GAGGGGCGGC TCGATCCTTC GGTCCTTGCG CCGGATGTGG CCGCGGCTTT GGCGCCGGTT GCCGCCGATA TGGAGGGCTC GGTCTTCGCG CTCGATCTGA AGGGCTATGA TCCGCTCTTT CTGGGCCGGT CCCTGCCGCT GCCCGCGCTT GATCGCGCGG CGCAGGCTGC GGCCTTCCGG GGCGGCGAAG CCGTGCCCTA TGTGCATTTC AGCTTCCTCT ACAACCAGGC GCGGCGGGCG CCGCTGGTTG CGGCTGTCAA CTTCGACCGT GCCGCGCGGC AGGTGCTGCC CGCCACCGGC ACGCCCATCG AGCCCGACCC GCGCCTGCCG CCCGAGCTGC GGCCCGATCC CTCCCGGTTC GAAGGGGGGC TCGTCGCGGC GGACTATGTG GACCGGACCA TGATCTCGTG GGGAGAGCCG CTGGCCGCCG ATCCCTTCCG CACGGCGCGG ATGCTGGACC AGTCGGTCCA GCTGCATCTC AACAAGGCCC CCGTGCATCC CGCCGCCGCC GCGGTCTGGA CCGGCCTCAC CCGCTGGATC CGCGAGCAGC ACAACCGCTC GGCCACCCGC GTCACCTTCT TCACCGGCCC GATCTTCCAG CCCGGCGAAA GCGCGGTGCC GGCCTCGCTC TGGCTGATCG CCGTCTCGCT GCGGGATCCC GTGTGGGTGC CCGCGGGACA GGAGCAGCCC TTCGTGGCCG AAGCCTTCCT GATCCCGAAC CGGCCCGACA CGCTGATGGA AGAGCCCTGG AAACTCGCCA TGACCATCGA GGGGATCGGC CGCGCGACGG GGCTGCGCTT TCTTGACGAG ATCGTCCGTG CGGATCGGGG GAGGACCATC GTGAACGCCA CCGAGGGCGA CCGCCTCGCC GACCGGGCGG GCGCGCTGAA CGACCCGCCC TCGGAGGATC AGACGGCGCT GATGGCCGAG CTGGCGCTGG CGCTGCAGGG CGGGCGGCTT CCCGCCTCGG AGCAGGCCAA GATCATCCGC GAACTCGCCG GCCTGCTCGC CGGTCCGCCG GACCTGACCT CGGCGGGACG GGTCAATGTC CTGACGCTGC TGGCCGGGGT GCCCGCCGAA AGCTGGAACC GCCCGGACTG GATCGTGCTC AAGGCCGAAG TCCGCCGCGC GGTGGTGCGG GTGCGGGAGC CCGCTCCCGA ACCCGAGGCA CAGGGCCTTG TGGATCGTCT GGCCGGTGCG CTCGGGCTGG ACGAGCCGCC GCCTCAGAGG GTCTTCATCC AGTTCGCCGA CATGACCCGC GAAAGCGTCC GAAGCCTGGC CGAGCGGATC GCGGCACTCG GCTGGACCGT GCCACCGGAG GAGCGCGTGG CCGATGCGAG CGGCCTCAAC GAGGTGCGTT TCAACCCGGA GAGCGCCGAA GACGCGGCCG CGGCCCGCCT CCTGGCCGCC GATCTGGCGG CGGCGGGCCG ACCGGGGGTC CGAGCGGTGC CCCTGTCCGT GATCCGTCCG CAGGTTCTCG AGGTCTGGAT CGGCGGCCCC ACCCGCTGA
|
Protein sequence | MRNGAADRFE NVLDLQTAHL ALMQRQQDRR SAGGGLLPQE EITDFLARVA RTGAVLSTPA DRRIAQRVLD YWTADLLDTA RGYSGPVATE TLLACEAEDS DARPAGLAEG HGSREYIRLA AQARQWRDTR SHGYLLSGKA LRSAERFSRD PEIADLIAAS LAEEQREARR ARRRKRIAAG LTLALVAAVA MAGIFFLKVE TATHEAAEAA GEKGALARDV VFLGDEERLR AQERQVALEN ANVERRIAQE HMDALSERQS RLDAAQGALA DLVTAERLPL AGLPDGVAED VLRILALRQA EGRLDPSVLA PDVAAALAPV AADMEGSVFA LDLKGYDPLF LGRSLPLPAL DRAAQAAAFR GGEAVPYVHF SFLYNQARRA PLVAAVNFDR AARQVLPATG TPIEPDPRLP PELRPDPSRF EGGLVAADYV DRTMISWGEP LAADPFRTAR MLDQSVQLHL NKAPVHPAAA AVWTGLTRWI REQHNRSATR VTFFTGPIFQ PGESAVPASL WLIAVSLRDP VWVPAGQEQP FVAEAFLIPN RPDTLMEEPW KLAMTIEGIG RATGLRFLDE IVRADRGRTI VNATEGDRLA DRAGALNDPP SEDQTALMAE LALALQGGRL PASEQAKIIR ELAGLLAGPP DLTSAGRVNV LTLLAGVPAE SWNRPDWIVL KAEVRRAVVR VREPAPEPEA QGLVDRLAGA LGLDEPPPQR VFIQFADMTR ESVRSLAERI AALGWTVPPE ERVADASGLN EVRFNPESAE DAAAARLLAA DLAAAGRPGV RAVPLSVIRP QVLEVWIGGP TR
|
| |