Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3778 |
Symbol | |
ID | 4898584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 903191 |
End bp | 905170 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640114382 |
Product | hypothetical protein |
Protein accession | YP_001045630 |
Protein GI | 126464517 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.913258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TGCGTCAGAG CCAGATCTTC CGCACCTCGA AGCTGGAGGA GGCCGACGGG CCGGGCCTCA ATCCGACCGC CACTCCCACG ATGGGCGACA TCATCGCCGC GCGCTTCTCG CGCCGCGGCT TCCTCAAGGG CTCGATGGCC TCCGCTGCCA TCGCGGCGAC CGTCTCTCCG GTGGCGCTCC TTGCCGCGGG CGAGGCGCGT GCGCAGGGCA GCTCCGCCTT CAGCTTCCCC GAGGTCGAGG CCGGCGTCGA TGCCGACCAC CATGTGGCCG AGGGCTACGA TGCCGATGTC CTGCTGCGCT GGGGCGACAA GGTCTTCGCC GATGCGCCGG AGTTCGATCC GCGGGCCCAG AGCGAGGCGG CGCAGGAGCG CCAGTTCGGC TACAACAACG ACTTCGTGGG GTTCATTCCG CTCGACGGGG CGACCGACCG CGGCCTTCTG GTCGTGAACC ACGAATATAC CAACGAACAT CTGATGTTCC CGAACGTCGT CACCCTGAAG GACGGCGAGA TGGTGGTGGC CGATGCCACC GCGGACCGGG CCGACATCGA GATGGCGGCC CACGGCGGCA CCGTGATCGA ACTGCGCAAG GTGGACGGCA AATGGGCGCC GGTGCTCGAC GGGCGTCTGA ACCGCCGCAT CACCGCCAAG ACCCGGATGC AGCTCACGGG CCCGGCGGCG GGTCATGACC GGCTGAAGAC CTCGGAGGAT CCATCCGGCG CCGAAGTGCT CGGCACGATC AACAACTGTG CGGGCGGCGT CACCCCGTGG GGCACCTACA TCATGGCCGA GGAGAACATC CACGGTTACT TCCTGGGCGA CCTGCCGGCC GATCATCCGG AGGCGCGCAA CCACGGGCGG CTGGGCGTGC CCGGCGCCTC CTACCAGTGG GGCAGGTTCC ACAAGCGTTT CGACGTGGGT CAGGAGCCCA ACGAGCCGAA CCGCTTCGGC TGGATCGTCG AGGTCGATGT GATGGACCCC ACTTCGGTGC CGAAGAAGCG GACGGCCCTC GGGCGCTTCA AGCACGAGGG CGCGGAAAGC GTCGTGGCGA AGGACGGCCG CGTCGTCTTC TATCTCGGCG ACGACGAGCG CTTCGATTAT GTCTACAAGT TCGTCACCAA CGGCCGCTAC AACCCCGACG ACCGCGCGGC CAACATGGAC CTCCTCGACG AGGGCACGCT CCATGTCGCC CGGTTCGAGG CCGACGGCTC GATGCGGTGG ATCCCGCTCG TCCATGGCGA GGGGCCGCTC ACGGCCGAGA ACGGTTTCGA AAGCCAGGCC GACGTGCTGA TCGAGACGCG CCGCGCGGCG GATCTCCTCG AGGCCACGCC CATGGACCGG CCCGAGGACA TCCAGCCCAA CCCGCAGACC GGCCGCGCCT ATGTCATGCT GACCAACAAC ACCAAGCGCA CCGAGGCCGA TGCCGCCAAC CCGCGCGTGA AGAACGCCTT CGGCCATATC ATCGAGATCC TCGAGGCCGA CGGAGATTTC ACGGCCACGA CCGGCCGGTG GGAGATCCTG CTCCAGTGCG GCGACCCGGC GGTGGCCGAG GTGGGCGCGA CCTTCTCGAC CGAGACCACG AAGAACGGCT GGTTCGGCAT GCCGGACAAT GCCGCGGTGG ATGCCGACGG CCGCCTCTGG GTCTCGACCG ACGGCAACTC GATGGCCGAT ACCGGCCGGA CCGACGGCCT CTGGGCGGTG GACACCGAGG GCGATGCGCG CGGGACCTCG CGCCTCTTCT ACCGGGTGCC GGTCGGGGCC GAACTCTGCG GCCCCTGCCC GACCGAGGAC ATGAGCACCT TCTTCGTTGC GGTCCAGCAT CCGGGCGACG GCGGCGAGGA CTGGGAGGGC CACGGCCGCC TGTCCTACTA CGAGGATCTC TCCACCCGCT GGCCGGATTT CAAGGACGAC ATGCCGGTGC GCCCGGCCGT CGTGGCGATC ACCCGGCAGG GCGGCGGCCG CATCGGCTGA
|
Protein sequence | MTDLRQSQIF RTSKLEEADG PGLNPTATPT MGDIIAARFS RRGFLKGSMA SAAIAATVSP VALLAAGEAR AQGSSAFSFP EVEAGVDADH HVAEGYDADV LLRWGDKVFA DAPEFDPRAQ SEAAQERQFG YNNDFVGFIP LDGATDRGLL VVNHEYTNEH LMFPNVVTLK DGEMVVADAT ADRADIEMAA HGGTVIELRK VDGKWAPVLD GRLNRRITAK TRMQLTGPAA GHDRLKTSED PSGAEVLGTI NNCAGGVTPW GTYIMAEENI HGYFLGDLPA DHPEARNHGR LGVPGASYQW GRFHKRFDVG QEPNEPNRFG WIVEVDVMDP TSVPKKRTAL GRFKHEGAES VVAKDGRVVF YLGDDERFDY VYKFVTNGRY NPDDRAANMD LLDEGTLHVA RFEADGSMRW IPLVHGEGPL TAENGFESQA DVLIETRRAA DLLEATPMDR PEDIQPNPQT GRAYVMLTNN TKRTEADAAN PRVKNAFGHI IEILEADGDF TATTGRWEIL LQCGDPAVAE VGATFSTETT KNGWFGMPDN AAVDADGRLW VSTDGNSMAD TGRTDGLWAV DTEGDARGTS RLFYRVPVGA ELCGPCPTED MSTFFVAVQH PGDGGEDWEG HGRLSYYEDL STRWPDFKDD MPVRPAVVAI TRQGGGRIG
|
| |