Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3869 |
Symbol | |
ID | 4898523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 996359 |
End bp | 997549 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640114473 |
Product | hypothetical protein |
Protein accession | YP_001045720 |
Protein GI | 126464607 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000457394 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGC CCCTCGTTCT GGCGCTGCGC GAGCTGCGGC ACGACTGGAT CTCGGCGCTC TGCTTCGTGG CGGCGCTGGT GGGCGTGCTG GCGCCCATGC TGATCCTGCT CGCGCTGAAG ACGGGTGCGC TCGACACGAT GGTCGAGCGG CTGGTCGACG ATCCGGCGAA CCGCGAACTG CTGGCGGTGG GGGCCGGCGC GTATGACGAG GGCTTCTTCC GCTGGCTGGA GGCGCGGCCC GAGGCGGGGT TCGTCGTGCC CGCCACGCGC AGCATCAACG CCCTTGCGGA TGCGGTCGTG GCCTCCGCTC CCCGCCGCGA GATGGTGCGG GAGGTGCCGC TGGTGGTTTC GGCCGCGGGC GATCCGCTGC TGGCGGGAGA TGTCGGGCCG GGTCGGGTCT GGCTGAGCGC GCCTCTCGCG CGGTCTCTGG AGGTCGCGCC GGGTGGCGCG CTGACGATGG TGATCGGACG GCGCATCGAC GGCCTCGAGC AGACGGCGCG GCGACCGCTG AAGGTGGCGG GGATCGTTCC GGCCGAGCGC TACGGCCGCC CGGCGCTGTT CCTGTCGCTG CCGGACATGC TGGCGATCGA GCGGTTCCGC GACGATCCGG CCGTCACGCC CGGAAGCTGG CTTCAGGCCG CCGCGCCGCC TGCGGCCTTT GCCAGCTTCC GCCTCTATGC GCGGACGCTC GCGGATCTCG GGCCGCTCTC GGCGGCGCTG GAGGGGCGCG GTGTTGCGGT GCGCCCCCGC GCCGAGAATG CGGCGCTGCT GCTGCAGCTG CGGCGGGGCG CGGATCGGCT GTATCTTGCG GTCGCCGCAT TGGCCGCCGC CGGATTCTGG GCCGCGATGA GCGCCAATCT CCGCGGCATG GTGGAGCGGC GGCGGCTGGC CTTCAGCCTG CTGCGGTTGC TGGGCCTGAC GCCCGTCCAG CGCGCGACGG TTCCGCTGAT CCAGAGCCTC GTGCTGATCG CGGCGGGGCT CGGGCTCTCG CTCGCCCTCG TTCTGCCGGC CGTGGCGCTG ATCAACGCGA GCTTTCCCTC CGTGGCCGAA GGGGCGGCGC TCGCGCGCCT CAGGCCGGAC CAGTTGGGGG GGGCGGCTGC GCTTGCCTGC GTGACGGCGC TGACCGCGGC GCTCTGGGCG ATGGCGGCGG TGCTGCGGAT CCCGAGCGAG GAGGTGTTGC GTCATGGCTA G
|
Protein sequence | MPMPLVLALR ELRHDWISAL CFVAALVGVL APMLILLALK TGALDTMVER LVDDPANREL LAVGAGAYDE GFFRWLEARP EAGFVVPATR SINALADAVV ASAPRREMVR EVPLVVSAAG DPLLAGDVGP GRVWLSAPLA RSLEVAPGGA LTMVIGRRID GLEQTARRPL KVAGIVPAER YGRPALFLSL PDMLAIERFR DDPAVTPGSW LQAAAPPAAF ASFRLYARTL ADLGPLSAAL EGRGVAVRPR AENAALLLQL RRGADRLYLA VAALAAAGFW AAMSANLRGM VERRRLAFSL LRLLGLTPVQ RATVPLIQSL VLIAAGLGLS LALVLPAVAL INASFPSVAE GAALARLRPD QLGGAAALAC VTALTAALWA MAAVLRIPSE EVLRHG
|
| |