Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4088 |
Symbol | |
ID | 4895029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | + |
Start bp | 28841 |
End bp | 29833 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640110490 |
Product | hypothetical protein |
Protein accession | YP_001041802 |
Protein GI | 126464826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 72 |
Plasmid unclonability p-value | 0.00189441 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 88 |
Fosmid unclonability p-value | 0.318027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGAG TCGTTCTGCA CATCGGAACG CACAAGACGG CAACCACCAC GATCCAGGAC ATGTTCGCGC ATAATGCGGA CCTTCTGCGG CAGCATGGGG TGATCTATCC GCGCCTCAGC CGCGTCGCGG GCCACCATGG GCTCGTGATG GAATGGAACA AGCTGCCCGA CATGTATGCC CTGCCTCAGG GCAGTATCGC GACGCTGAAA CAGCTCACGC GTGACTACGC TCATGTTCCG GGCACGCTGG TTCTCAGTTC CGAAGAGTTC TCACGCGGCA AGCCCGGTGC TCAGGTGGAC TTCCGGGCCG TGCGCGAACT GCTCTCGGAT TTCGAAAGCG TGTCCGTGGT TTGCGTGTTG CGTGAGCAAT GGCAATTCAT GCAGTCGATC TATCTGCAGG CTTCCAAGGA GCGGCAACCG CCCAAGCCCT CCACCATGGT CGATTCCGTT CTCAAACGCG ACATGACGGA TGGGCTGTGG ATCGACTACA ACCTGCTCTA CGATCATCTG CTTTCTGCCT TCGCGCCCGA AGAGATCACT TTCCTCGATT TTGATGCCTG CCGCCGACAT CGGGACGGGG TCGTGGGTGC CATGCTGGAC ACGCTTGGCT GCGGTCTCTC AGCCTCTTCG CTTCAGGTTG TCCATGACGG GCTGTCAAAC GTTTCGCCGC TGGCGCTGCC GACCTTTGCG GCCTGTGTCA TCACGGAACC GGACCAAGCG GCCCCTTGGG TGATCGACTG TGCGACAGGT GCCTTTCGCA TCGAATATGG CGAAGAGGCC AAGTCCTGCC TGTGGACATA CCCGGAGTTC CAGCAGCTCT TCGCATATGC GAAGGAGCGT AACGGCCGTC TGTCGGAACG CCGGAAGGCG GTGCAGCCGG AGTTCCGCAT CTCGGAAAGC GATCCCGAGG AGAATCGTGT CTATCGCGAT GGGCTCAACA CCTCGTTCTG GATCCGGTGC AGCCGCTGGA TCTATCGCGC CCGCCGGGGC TGA
|
Protein sequence | MARVVLHIGT HKTATTTIQD MFAHNADLLR QHGVIYPRLS RVAGHHGLVM EWNKLPDMYA LPQGSIATLK QLTRDYAHVP GTLVLSSEEF SRGKPGAQVD FRAVRELLSD FESVSVVCVL REQWQFMQSI YLQASKERQP PKPSTMVDSV LKRDMTDGLW IDYNLLYDHL LSAFAPEEIT FLDFDACRRH RDGVVGAMLD TLGCGLSASS LQVVHDGLSN VSPLALPTFA ACVITEPDQA APWVIDCATG AFRIEYGEEA KSCLWTYPEF QQLFAYAKER NGRLSERRKA VQPEFRISES DPEENRVYRD GLNTSFWIRC SRWIYRARRG
|
| |