Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4166 |
Symbol | |
ID | 4894999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | + |
Start bp | 103346 |
End bp | 104134 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640110557 |
Product | ModE family transcriptional regulator |
Protein accession | YP_001041869 |
Protein GI | 126464893 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG2005] N-terminal domain of molybdenum-binding protein [COG3585] Molybdopterin-binding protein |
TIGRFAM ID | [TIGR00637] ModE molybdate transport repressor domain [TIGR00638] molybdenum-pterin binding domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 76 |
Plasmid unclonability p-value | 0.00104797 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 0.104434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGG GACTGAGGGG GGCGCTGACG GTCGAGCGGC CGGGACAGGC GCGGATGGGG GCGGAGCGGG TGGCGCTGCT CGCAGCCATC GGGCGCACGG GCTCGATCGC GGCGGCGGCG CGCGAGGTGG GCCTGTCCTA CAAGGCGGCC TGGGATGCGG TGCAGGCGAT GAACAACCTC TTCGCCCGCC CCGTCGTCAC CGCAGCACCC GGCGGACGGA CGGGCGGCGG CGCCCTCGTC ACGCCCGAGG GCGAGCAGGT GATCGCAGCC TTCGCCGCCA TCGAGGACGG GCTCTCGCGC GTTCTGTCCA CCCTCGAGAC CCGCTTGGCC CTCCATCCCG CCGACATTCT CTGGAGCCTG ATGATGAAGA CCTCGGCCCG CAACGTCTAC CGCTGCACCG TGACCGCCCT GACCCAGGGC GAGGTGAGCG CCGAAGTGCA GATGGATCTC GGCGGCGGCC AGATCCTCAC CGCCGTCATC ACCGAGCGCA GCCTGGCCGA CATGGGCCTC AGCCCCGGCG CCGAGGTCTT CGCGCTGGTC AAGTCGAGCT TCGTGATCCT CGCCCGCGGC GACGAACTCG CGCAGCTCTC GGTGCGCAAC CGTCTGGGCG GCACGGTCGC GAGCCGCACC GACGGGCAGG TGAACAGCGA GATCGTGCTG GATCTGGGCG GCGGCAAGAC GCTCGCCGCC ACCATCACCC GCGAGAGCGC CGAGAGGCTG GCGCTGCAGC CGGGCGACCG GGCGACCGCG CTCGTGAAGG CGAGCCATGT GATCGTGGCG CTGCCCTGA
|
Protein sequence | MPEGLRGALT VERPGQARMG AERVALLAAI GRTGSIAAAA REVGLSYKAA WDAVQAMNNL FARPVVTAAP GGRTGGGALV TPEGEQVIAA FAAIEDGLSR VLSTLETRLA LHPADILWSL MMKTSARNVY RCTVTALTQG EVSAEVQMDL GGGQILTAVI TERSLADMGL SPGAEVFALV KSSFVILARG DELAQLSVRN RLGGTVASRT DGQVNSEIVL DLGGGKTLAA TITRESAERL ALQPGDRATA LVKASHVIVA LP
|
| |