Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3686 |
Symbol | |
ID | 4898824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 792308 |
End bp | 795508 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640114294 |
Product | parallel beta-helix repeat-containing protein |
Protein accession | YP_001045548 |
Protein GI | 126464435 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3420] Nitrous oxidase accessory protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.346515 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGT ACTACGTCGC GACGAACGGC AACAACAACG GCAACGGCAG CTCGGGGTCG CCCTGGGCCA GCATCAACTT CGCAATGAGC CACGACCTCC GTCCCGGGGA TGAGGTGGTC GTCCGCGCCG GCACCTACCG CGAGTCCGTC ACGATCAATC ACGGCGGCTC GGCGGCCGGC GACGTGACGC TCCGCTCCGA GGTCGAGGGC GGCGCCCTCA TCCGGCCGCC GGCGGGCGCC TGGAACGGCA TCTATGTCTG GGACAGCTAT GTCACCATCG ACGGGTTCGA CATCGGCGGC GCACCGGGCG ACGGGATCGA GGGCAACGAT GTCCACCACA TCACCGTCCG CAACAACACG GTCCACGGCA GCGGCGAGTC CGGCATCCAG TTCAACTGGT CCGAATTCCT GAAGATCGAA GGCAACACGA CCTACGACAA CGCCAAGAGC GGGTGGTTCT CGGGCATCTC CGTCTACGAG AACCGCAACA TCTCGGGCGA TACCACGACC GAAGGCTACC GCACCATCAT CCGCAACAAC GTGTCCTACG ACAACGTGAC CAAGGGCGGG GCGCATACCG ACGGCAACGG CATCATCATC GACGACTTCC ACAATGGCCA GACCACCGGC CATCCGAACT ACACCTACCC GACGCTGGTC GAGAACAACC TCGTCTATGA CAATGGCGGC AAGGGCATTG CTGTCCACTG GAGCGACAAT GTCACGGTCC GCAACAACAC CGCCTATCAC AACAACCAGG ACAATGCGAA CGACGGCACC TGGCGCGGCG AGCTGAGCAA CCAGGATTCC AACAACACGA TCTGGGTGAA CAACATCGCG GTGGCGGATC CGTCGGTCAA TTCGAACAAT ACGGCCATCG GCTTCTACGG GTCGAACAAG GGCGTCGTCT GGGCGAACAA CCTGACCTAC AACGGGCGCG CGGGCGATGC CTCGCTGAAG CTCGAGGGCG GCAACAACCC CGCGCCGACC GCGGCCAACG GCAACCTGCT GGGCATCAAT CCCGGCTTCG CCAATCCCGC CGCCGGCGAT TTCACCCTGA CCTCCGGGTC GAAGGCGGTG GATGCGGGCA CGACCAAATA CGGCCTCGCC ACCACGGATC TCGACGGCGA CGGGCGCGTT CAGGGCGTGG TCGACCTGGG CGCCTTCGAA TGGGGGTCCG GCTCGGGCTC GCAACCCCAA CCGCAGCCCA ATGCCGCGCC GGTCGCCAAT GACGACACCG GCTTCTCGAC CACCTCGGAT GCGGCGCTGT CCATCGCCAC CTCGCGGCTT CTGTCGAACG ACACGGATGC CAACGGCGAC AGCCTCTCGA TCGCCTCGGT GGGCCAGGCC ACGCACGGCA CGGTCAAGCT GAACACCAAC GGCACGGTGA CCTTCACGCC CGAGGCCGGC TACAAGGGGG CGGCGACCTT CTCCTACACC GTCTCCGACG GCAAGGGCGG CAGCGACGCG GCTCTGGTCA CGCTCGACAT CAAGGCTCCG CCGGTGGTCA CGCCCACGAA CACCGCGCCC GATGCCCGGG ACGATGCGGG CTTCAGCACC GTCACCGGCA AGCCGGTCTC GATCAAGGTG GCGGACCTGC TGAAGAATGA CGTCGATGCC AATGGCGACA GCCTGACGGT CACCGGGGTC GGTTCGGCCA GCCACGGCAC CGTCAAGCTC AACACGGACG GCACCGTGAC CTTCACGCCG GAGGCCGGCT ACAAGGGCGC GGCGAGCTTC ACCTACGACG TGTCCGACGG CAAGGGCGGC ACCGACCGGG CCAATGTGGC CATCGACGTG GCTGCGGCGC CGGTGGCCAA CGATCCCACC ACCTACAGCT TCTGGGATGG GGCCGCCCAG CCCAAGGTGG CCTCGTTCCG GGACTATCAC GCCGTCGAGC TCGGGATGAA GTTCGTGGCG GACGTGCCCT CCGAGATGGA GGCCATGCGC ATCTATGTGG GCTCGCGCTA CAACGGCATC GAGTCCGTCA CGCTGCGGAC GGCCGATGGC AAGGTGGTGG CGACCCAGTC GGTGGACGGG CTGACCGGCA CCGGCTGGCA GGAGATCGCC TTCGACACGC CGGTCCAGCT TCAGGCGGGG CAGACCTACG TCGCCTCCTA CTTCACCTCG ACCGGTCGCT ACTCCTACAG CGACTATTAT TTCACCAAAT CCACGGATGC GGGCCCGATC TCGGTCGGGG CGAACGCGGG TGTCTTCTCC TATGCCGACA AGAGCACCCT GCCCACGTCG AGCTATCACG GGAACAACTA CTGGATCGAC GTGGTGGTGG ATCCGATCGA GGGCGGCACG ACGCTGAAGT CGGACGCGGC GGCAGGCAGC ACGGCGCCGC AGTTCCTCGA ATCCGCGAAT GCGGTGATCG GCGAATCCGG CGTGGTGAAG GTGGATCAGG CGACGGCCGA CGGCTGGCAC AGCGTCCGCT TCGCCGAAGC GCTCGACGCG CCCTCCGTTG TCATGAGCGC CATGTCGGGC GGTGACGAGG CCTTCACGGT GCGCGTCCGC AATGTGACCG ACAAGGGCTT CCAGTATCAG ATCGACGAGT GGGACCATCA GGACGGTCGC CACGGCGTCG AGACGCTGGG CTGGATGGCG GTCGAGAACG GCACGCACCA GCTGGCCGAC GGCCGGACCA TCGTGGCGGG CAGCGGCCAG GCTTCGGGCA CGGCGGGCCG GATCGACTTC GGCGACCACA GCTTCAAGAA GGCGCCCGTC GTCCTCGCGC AGGTGACGGG CGACCGGAAC GATTTCGCGG TCAATGACCG GATCGAGAAG GTGGGCGCCG AGGGCTTCGG CCTCCGGCTG GAGCAGCAGG AGGCCCGGAC GGGCGCCATC GCGGGCGAGT CGGTGGCCTG GATCGCCATC GACCGCGGCG CGGCGGGCGA AGGCACGCCG CTTGCCGGCA CGACCGGCAC CGGCGTGACG CATCAGCCGC ATGCGCTGGA TCTGGGCGCA GCCTTCGCGG GCGAGGAATT CGTGTTCCTC ACCGACATGC AGACCCGCAA TGGCGCGGAC AGCGCGACGG TCGGCGTCAC CGACCTCTCG GGCGAGCTGG CCACGATCCT GATCGCCGAG GAAAGCTCGC GCGACGCAGA GACCCAGCAT GTGGCCGAGG ACGTGGGCTA TCTCGGCCTG CAGATCGGCC AGATCTTCGG CCACGAGGCT AATGACCTGC TGATCGCCTG A
|
Protein sequence | MTTYYVATNG NNNGNGSSGS PWASINFAMS HDLRPGDEVV VRAGTYRESV TINHGGSAAG DVTLRSEVEG GALIRPPAGA WNGIYVWDSY VTIDGFDIGG APGDGIEGND VHHITVRNNT VHGSGESGIQ FNWSEFLKIE GNTTYDNAKS GWFSGISVYE NRNISGDTTT EGYRTIIRNN VSYDNVTKGG AHTDGNGIII DDFHNGQTTG HPNYTYPTLV ENNLVYDNGG KGIAVHWSDN VTVRNNTAYH NNQDNANDGT WRGELSNQDS NNTIWVNNIA VADPSVNSNN TAIGFYGSNK GVVWANNLTY NGRAGDASLK LEGGNNPAPT AANGNLLGIN PGFANPAAGD FTLTSGSKAV DAGTTKYGLA TTDLDGDGRV QGVVDLGAFE WGSGSGSQPQ PQPNAAPVAN DDTGFSTTSD AALSIATSRL LSNDTDANGD SLSIASVGQA THGTVKLNTN GTVTFTPEAG YKGAATFSYT VSDGKGGSDA ALVTLDIKAP PVVTPTNTAP DARDDAGFST VTGKPVSIKV ADLLKNDVDA NGDSLTVTGV GSASHGTVKL NTDGTVTFTP EAGYKGAASF TYDVSDGKGG TDRANVAIDV AAAPVANDPT TYSFWDGAAQ PKVASFRDYH AVELGMKFVA DVPSEMEAMR IYVGSRYNGI ESVTLRTADG KVVATQSVDG LTGTGWQEIA FDTPVQLQAG QTYVASYFTS TGRYSYSDYY FTKSTDAGPI SVGANAGVFS YADKSTLPTS SYHGNNYWID VVVDPIEGGT TLKSDAAAGS TAPQFLESAN AVIGESGVVK VDQATADGWH SVRFAEALDA PSVVMSAMSG GDEAFTVRVR NVTDKGFQYQ IDEWDHQDGR HGVETLGWMA VENGTHQLAD GRTIVAGSGQ ASGTAGRIDF GDHSFKKAPV VLAQVTGDRN DFAVNDRIEK VGAEGFGLRL EQQEARTGAI AGESVAWIAI DRGAAGEGTP LAGTTGTGVT HQPHALDLGA AFAGEEFVFL TDMQTRNGAD SATVGVTDLS GELATILIAE ESSRDAETQH VAEDVGYLGL QIGQIFGHEA NDLLIA
|
| |