Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2234 |
Symbol | |
ID | 3719763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 850877 |
End bp | 852097 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640070406 |
Product | DNA-binding protein |
Protein accession | YP_352290 |
Protein GI | 77462786 |
COG category | [R] General function prediction only |
COG ID | [COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.62693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAG ACCTCGCCCG CAAGCTGGCT ATCCTGTCGG ACGCCGCGAA ATACGATGCC TCCTGCGCCT CGAGCGGGGG CACACGGCGC GATTCGAAGG ATGGCAAGGG GCTGGGATCC TCGGGCGGAA GCGGCATCTG CCACGCCTAT GCGCCGGACG GGCGCTGCAT CAGCCTTCTG AAGATCCTGA TGACCAATTT CTGCATCTTC GACTGCGCCT ATTGCGTGAA CCGCGTCTCC TCGCGGGTCG AGCGGGCGCG GTTCTCGGTC GAAGAGGTGG TGACGCTCAC CGTCGAATTC TACCGGCGGA ACTATATCGA GGGGCTCTTC CTCTCGTCGG GCATCATCCG CTCGCCCGAT GACACGATGG CCGACATGGT GCGCATCGCC AAGACCCTGC GCGAGCGCGA GCATTTCCGG GGCTATATCC ACCTCAAGAC CATTCCCGAC GCCGCGCCCG AGCTGATCGA GCAGGCGGGC CTCTATGCCG ACCGGCTGTC GATCAATGTG GAGCTGCCGA CCGAAGCCGG GCTCGACCGC TTCGCGCCGG AGAAGTCGGC CACCGGCATC CGCAAGGCGA TGGCCGAGGT GCGGCTGAAG CGCGAGGCCT CGCGCGAGCC GAGCTTCTCC GGCCGCAGAC CCTCGCGCTT CGCGCCCGCG GGCCAGTCCA CGCAGATGAT CGTGGGAGCG GACGGGGCGG ACGATGCAGC CATCCTCGGC AATGCCTCGA CGCTCTATGC CAACTACGGT CTGAGCCGGG TCTATTACTC GGCCTTCTCG CCCATTCCCG ATGCCTCGAA GGCGCTGCCC CTCGTGCGTC CGCCGCTCCT GCGCGAGCAT CGGCTCTATC AGGCGGACTG GCTCCTGCGG TTCTACGGCT TCGAGGTGGG CGAGATCGCG GACAAGGGGA TGCTCGATCT CGAGGTCGAT CCGAAGCTCG CCTGGGCGCT GGCGCATCGC GAGGCCTTTC CGATGGATGT GAACCGCGCC CCGCGCGAGA TGCTGCTGCG CGTGCCGGGC TTCGGCACCA AGACGGTGGG CCGCATCCTT GCCGCGCGGG CGCACGGGGC GGTGCGCTAC GAGCATCTGG TGGCGATGGG CGCGGTGGTG AAACAGGCGC GCCCCTTCAT CGTGGCCCCC GGCTGGCGGC CGCAGGGGCT GGACGACGCC AGCCTGCGCG CGCGCTTCGT GCCGCCGCCG GAACAGTTGA GCCTCTTCTG A
|
Protein sequence | MKKDLARKLA ILSDAAKYDA SCASSGGTRR DSKDGKGLGS SGGSGICHAY APDGRCISLL KILMTNFCIF DCAYCVNRVS SRVERARFSV EEVVTLTVEF YRRNYIEGLF LSSGIIRSPD DTMADMVRIA KTLREREHFR GYIHLKTIPD AAPELIEQAG LYADRLSINV ELPTEAGLDR FAPEKSATGI RKAMAEVRLK REASREPSFS GRRPSRFAPA GQSTQMIVGA DGADDAAILG NASTLYANYG LSRVYYSAFS PIPDASKALP LVRPPLLREH RLYQADWLLR FYGFEVGEIA DKGMLDLEVD PKLAWALAHR EAFPMDVNRA PREMLLRVPG FGTKTVGRIL AARAHGAVRY EHLVAMGAVV KQARPFIVAP GWRPQGLDDA SLRARFVPPP EQLSLF
|
| |