Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4037 |
Symbol | |
ID | 5086210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 72514 |
End bp | 74163 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640485600 |
Product | hypothetical protein |
Protein accession | YP_001170194 |
Protein GI | 146280037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.677701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAGC CCCACGTGCT GATCCTGGGC TCGGGTCCCG GCGGGGCGGC GCTCGCCTGG CGGCTCGCCT CGGCTGGGCT TGCGGTGCGG GTGCTCGAGG CGGGGCCCGC CTTCGATCCG GCCACCGATT ATGCCCAGGA CCGCGCAGAC TGGGAGACTC CCTTCCCCGA GCGGCCCGGC AGCCGCGGCG CCTGCGAGAC GGGACCCCTG CAGGAGCTTG GGTTCGAAAT CGACGACATC CGCTCATGGA ACGCGCTGAC CGGGCCTTAC GTCCCGGGGA CGCGGCGGGC TGACTTCGGC TATCACCATG TACGGGGCGT AGGCGGCAGC TCGCTTCACT TTACCGGCGA GGCGCACCGG CTGCATCCCC GTGCATTCAC GATGAAGAGC ACGTTCGGCG TGGCGGCGGA CTGGCCCGTG ACCTATGCCG AACTCGAGCC CTACTGGCTC GAGGCCGAGC GGCAGTCGGG TGTGGCGGGA CCGGCCGAGG ATGCACAGCG GCCGCGGAGC GCGCCCTATC CGCTGCCCGC GCATCCCTTC AGCCATGCCA GCGACCGCCT CGCCCGCGCC GCGCGGAGCC TGGGCCTCTC GGTGCAGGCC AATGCGCTGG CCGTGCCATC GCGCCCTTAT GACGACCGGC CCGACTGCAA CTATTGCGGC GGCTGCCTGC GCGGTTGCCA GCGGGGCGAC AAGGGCAGCG TGGACCAGAC CTACCTGCGC AAGGCCGTAG AGACCGGGCG CTGCGAAGTG CTGCCGGGGA TTGAGGCGAT GCGGCTCGAG ACGGCGGGGG GACGGGTGAG CGGCGTCCTC TGCGCGACGT CGGCGGGTCC GCGCCTCTTC CGTGCGCCGG TTGTGATCCT GGCCTGCGGC GCGGTGCAGA CGCCGCGGCT CCTGCTGAAC TCGGCCTCGG AGGAGAGCCC CGACGGGCTC TGCAACGAGA GCGGCGAGGT GGGGCACAAC TTCATGGAAA CACTCATATT TACCGCAAGC GCGCTTCATT CCGAGCCCTT GGGCAGCCAC CGCGGCCTGC CCGTCGACTG GATCTGCTGG GACTTCAATG CACCCGACGC GATCCCGGGC GTCACGGGCG GCTGCCGCTT CGGCTGCTCG ATGGCCGAGA GCGATCTGGT GGGCCCCGTA GCCTATGCGA CCCGGGTGGT CGGGGGCTGG GGCCGTGCCC ACAAGCGCGC GCTGCGCGCC AGTTTCGGGC GCGCGCTGTC GGTCACCGGG ATCGGCGAGT GCCTGCCCCA TCCCGAAAGC CGGATCCGCC TCTCGACGCG ACGTGACGCG CATGGGATGC CGATCCCGCG GATCGAGAGC CGCCTCGGGC CCGACGCCTT TGCTCGGCTG CGCTTCATGG CCCGGACCTG CCGGGCTATC CTTGCCGCCG CAGGCTGCGC CGCGCCCTTC GAGGAATTCA GCTCGGCTGA CGCCTTTTCC TCGACCCATG TCTTCGGCAC CTGCCGCATG GGCCATGATC CCATGCGGAA CGTTGTGGAC GGATGGGGCC GCAGCCACCG CTGGCCGAAC CTCTTCGTCG CCGACGCAAG CCTCTTTCCC TCAAGCGGCG GCGGCGAGTC TCCCGGTCTC ACGATCCAGG CACTGGCACT GCGGACGGCC GACCATCTGC TGTCGGAAGC CCGTCCATGA
|
Protein sequence | MTEPHVLILG SGPGGAALAW RLASAGLAVR VLEAGPAFDP ATDYAQDRAD WETPFPERPG SRGACETGPL QELGFEIDDI RSWNALTGPY VPGTRRADFG YHHVRGVGGS SLHFTGEAHR LHPRAFTMKS TFGVAADWPV TYAELEPYWL EAERQSGVAG PAEDAQRPRS APYPLPAHPF SHASDRLARA ARSLGLSVQA NALAVPSRPY DDRPDCNYCG GCLRGCQRGD KGSVDQTYLR KAVETGRCEV LPGIEAMRLE TAGGRVSGVL CATSAGPRLF RAPVVILACG AVQTPRLLLN SASEESPDGL CNESGEVGHN FMETLIFTAS ALHSEPLGSH RGLPVDWICW DFNAPDAIPG VTGGCRFGCS MAESDLVGPV AYATRVVGGW GRAHKRALRA SFGRALSVTG IGECLPHPES RIRLSTRRDA HGMPIPRIES RLGPDAFARL RFMARTCRAI LAAAGCAAPF EEFSSADAFS STHVFGTCRM GHDPMRNVVD GWGRSHRWPN LFVADASLFP SSGGGESPGL TIQALALRTA DHLLSEARP
|
| |