Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2604 |
Symbol | |
ID | 4897137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2743426 |
End bp | 2745051 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640113204 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_001044478 |
Protein GI | 126463364 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGACC GTGTCTCTCC CCTCGCGCCC TTCCGCCTGC CCACCTACCG CAACCTCTGG ATGGCGAGCA TCGTCTCGAA CTTCGGCGGG CTGGTGCAGG CGGTGGGCGC AGGCTGGATG ATGACCGAGC TCACCCATTC GGCCGGGATG GTGGCGCTGG TGCAGGCCTC GACGACCCTG CCGATCATGC TCTTCTCGCT GCCCTCGGGC GCGCTGGCCG ACAGCCTCAA CCGGCGGCGC CTGATGCTGA CGGCGCAGCT CTTCATGCTG ACGGCCTCGG CGGCGCTGGC GCTGGCGGCC TTCGCGGATC TGCTGACGCC GTGGCTCTTG CTGACCTTCA CCTTCCTGAT CGGCGCGGGC GTGGCGCTGC ACAACCCTTC GTGGCAGGCG TCTGTCGGCG ACATCGTGCC CCGCAAGGAC CTGCCGTCGG CGGTCGCGCT GAACAGCATG GGCTTCAACC TGATGCGGAG CGTGGGCCCC GCGCTGGGCG GGATCATCGT GGCGGCGGGC GGCGCGGCGG CGGCCTTCGC CATCAATGCG GTGAGCTATC TGCCGCTCGT CCTCACGCTC TTTCTCTGGC GGCCCGATTA TGCACCGCGG CGGCTGCCGC GCGAGACGCT GGGATCGGCG GTGGCCGCGG GCCTGCGCTA TGTCTCGATG TCGCCCATCC TGCTCAAGGT GCTCTTCCGC GGCTTCCTCT TCGGCCTTGC GGCAGTGAGC CTGCTTGCGC TGCTGCCCGT CGTCGCGCGC GACCTCGTGG GCGGGGGCGC CTTCACCTAC GGCGTGCTTC TGGGCTGTTT CGGGGTGGGC GCCATCGGCG GCGCCTTCGC CGGCGCGCGC CTGCGCGAGC GGTTCCAGAA CGAGACCATC GTGCGGGCGG GCTTCGTGAT CTTCGCGCTG GCGCTGACCG GACTCGGCCT GTCGCGCGCG CTGTGGCTGT CGGGGCTCAT GCTCCTGCCG GCGGGGGCGG CCTGGGTGCT GGCCTTGTCG CTGTTCAACG TGAGCGTCCA GCTTGCCACG CCGCGCTGGG TCGTGGGGCG GGCGCTGGCC CTCTATCAGA CCGCCACTTT CGGCGGGATG GCGGCGGGCA GCTTCCTCTG GGGGCAGGCG GCCGAGGCGG GCGGCGTGGA CGGGGCCCTG TTCGGGGCGG CCGGTGTCCT CGTCGTGGGG GCGCTGGTGG GGCTGCGGAT GCCGCTGCCG GCCTTCGGGA TGCAGGATCT CGATCCGCTG GGGCGGTTCG TGGAGCCGAG CCTGCCCGTG GATCTGCGCA TGCGCTCGGG TCCGATCATG GTGACGGTCG AATATGACGT GGTGCCCGAG AATGTGGACG AGTTTCTCGC CGCCATGGCG GACCGGCGCC GGATCCGGAT CCGGGACGGG GCGCGGCAGT GGGTGCTGTT GCGCGATCTG GAGCGGCCGG GGATCTGGGC CGAGAGCTAC CATGTCGCGA CCTGGGCGGA ATATCTGCGC CACCACGAGC GGCGGACGAA GGCGGATGCC GAGGTGACGG ACCGGCTGCT CGCGCTGCAC GCAGGGCCGG GCAAGCCGCG GGTGCGCCGG ATGATCGAGC GCCAGACGGT GCCCTTGCAC GACGATCTGC CGCTGAAGCC CGAGGATCTG ACCTGA
|
Protein sequence | MPDRVSPLAP FRLPTYRNLW MASIVSNFGG LVQAVGAGWM MTELTHSAGM VALVQASTTL PIMLFSLPSG ALADSLNRRR LMLTAQLFML TASAALALAA FADLLTPWLL LTFTFLIGAG VALHNPSWQA SVGDIVPRKD LPSAVALNSM GFNLMRSVGP ALGGIIVAAG GAAAAFAINA VSYLPLVLTL FLWRPDYAPR RLPRETLGSA VAAGLRYVSM SPILLKVLFR GFLFGLAAVS LLALLPVVAR DLVGGGAFTY GVLLGCFGVG AIGGAFAGAR LRERFQNETI VRAGFVIFAL ALTGLGLSRA LWLSGLMLLP AGAAWVLALS LFNVSVQLAT PRWVVGRALA LYQTATFGGM AAGSFLWGQA AEAGGVDGAL FGAAGVLVVG ALVGLRMPLP AFGMQDLDPL GRFVEPSLPV DLRMRSGPIM VTVEYDVVPE NVDEFLAAMA DRRRIRIRDG ARQWVLLRDL ERPGIWAESY HVATWAEYLR HHERRTKADA EVTDRLLALH AGPGKPRVRR MIERQTVPLH DDLPLKPEDL T
|
| |