Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0962 |
Symbol | |
ID | 4895250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 992028 |
End bp | 993023 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640111548 |
Product | hypothetical protein |
Protein accession | YP_001042845 |
Protein GI | 126461731 |
COG category | [S] Function unknown |
COG ID | [COG4093] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0340948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGT TGCTCGGTCT GGTTCTGGTG CTGGCCCTTC TCTGGGCGGG ATGGTGGGTG GCGGGCTCCG CCCTCGTGCG TCAGGCAGCG GAGGCCTTCT TCGCCCAGCA GCGCGCGGCG GGGCGGCTGG CCGAGCATGA GGGTCTGTCG GTGCGCGGCT TTCCGAACCG GTTCGACCTG ACGGTCGAGA ACCTCCGGCT GGCCGATCCT GCGACGGGCC TCGGCTGGCG CGCGCCCTTC GTCCAGCTCT TCGCGATGAG CTGGAAGCCC TGGCACCTGA TCGCGGCGCT GCCGCCCGAG CAGGAGATCG ACCTCGACGG CCAGACCGTC ACGCTGAATG CCTCGGCGCT CCGGGCGAGC CTCATCGTCT CGCCGAGCGC GACGCTGCCG CTCGACCGGA CCGCTTCGGC CGGTCAGGCG CTGGTCCTGC GCTCTTCCGC GGGCTGGAGC GTGCGGCTCG ACGAGGCGCA GCTTGCGACG CGCCGGATGG GTGACGATGC CACGCGGCAC GAGATCGGTC TCGACTTGGC GGGGGTGGCC CCCGAGGGCG CCTTCGCCGC GGCGGCCGAA CGCGCGGGCC TGCCGCCGCG CCTGTCCCAG CTGCGCCTTC TCGCCGAGGC CGGCTTCTCC GCGCCGATCG ACCGCGAGCT GGGCACGTCG CGGCCGCAGC TGCGGTCACT CACGCTGCGT GAATCGCAAC TCGACTGGGG GTCTCTGCAA CTGACGGGCT CGGGCGAGCT CACCTTCGAC GCCGCGGGGA TGCCCGAGGG GCGCATCGAT CTGGCGCTCG GCAACTGGCG ACAGGCGATC CGCGCGGCGG CCGAGCTCGG CTATCTGGAG GCCGAGTCGG TGCCGGCATG GGAGCGCGGG CTCGGCATCT TCGCGGCCCG CTCGGGCGGC GAGAAGCTTC AGGTGCCGCT GAACTTCAGC AAGGGCTGGA TGAGCCTCGG GCCCCTGCCC CTCGGCCCTG CCCCGCGCCT CGGCCCCGCC GGCTGA
|
Protein sequence | MRKLLGLVLV LALLWAGWWV AGSALVRQAA EAFFAQQRAA GRLAEHEGLS VRGFPNRFDL TVENLRLADP ATGLGWRAPF VQLFAMSWKP WHLIAALPPE QEIDLDGQTV TLNASALRAS LIVSPSATLP LDRTASAGQA LVLRSSAGWS VRLDEAQLAT RRMGDDATRH EIGLDLAGVA PEGAFAAAAE RAGLPPRLSQ LRLLAEAGFS APIDRELGTS RPQLRSLTLR ESQLDWGSLQ LTGSGELTFD AAGMPEGRID LALGNWRQAI RAAAELGYLE AESVPAWERG LGIFAARSGG EKLQVPLNFS KGWMSLGPLP LGPAPRLGPA G
|
| |