Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3969 |
Symbol | |
ID | 4898261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 1108929 |
End bp | 1110527 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640114572 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045819 |
Protein GI | 126464706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCG CACATCTGCT GGCCGCGTCT TCGCTGGCCC TGATGGCTGC TGCCGGAGCC CAAGCCAAGA CGCTGGTCTA TTGCTCGGAA GGGTCGCCCG AGGGGTTCGA CCCCGCGCCC TACACCGCCG GCACCACCTT CGACGCGGCC TCGCAGGCGG TCTACAACCA GCTCGTCGAG TTCAAGCCGG GCACGACCGA GATCGCCCCC GCCCTGGCCG AGAGCTATGA GATCTCGGAC GACGGGCTGG AATACACCTT CCACCTGCGG CCGGGCGTCA AGTTCCACAC GACCGACTTC TTCACGCCCA CGCGCGAGAT GAACGCCGAC GACGTGATCT TCTCGTTCCT GCGTCAGGGC GACGAATCCA GCCCGTGGCA CCAGTATGTG GCCGGGATCA CCTACGAATA TTACAGCGGC ATGGAAATGC CGACCGTGAT CAAGGAGATC CAGAAGGTCG ACGACCTGAC GGTGAAGTTC GTGCTGACCC GTCCCGAGGC GCCCTTCCTC GCCAACCTCG CGATGGACTT CGCCTCGATC CTGTCGAAGG AATATGCCGA CAAGCTGGAG GCCGAGAACC GCAAGGAAGA CCTGAACAAC GCGCCAGTCG GCACCGGCCC GTTCAAGTTC GTGGCCTACC AGAAGGATGC GGTCATCCGC TATCAGGCCA ATGACGACTA CTGGGCCGGG CGCGAGAAGA TCGACGATCT GATCTTCGCC ATCACCCCCG ATCCGGCGGT GCGCATGCAG AAGCTGCAGG CCGGCGAATG CCACATCATG CCCTATCCGG CGCCCGCCGA CATCGAGGCG CTGAAGGCGG ACGAGAACCT GCAGGTGATG GAGCAGCCGG GCCTGAACGT GGCCTATCTC GCCTACAACA CCACCGTGGC GCCCTTCGAC AATCCGAACG TCCGCAAGGC GCTCAACATG GCGATGAACA AGGAGGCCAT CCTCGAGGCG GTCTTCCAGG GCACGGGGCA GGTCGCCAAG AACCCGATCC CGCCGACCAT GTGGAGCTAC AACGACGCGG TCGAGGACAC GGCCTTCGAT CCCGAAGCGG CCAAGAAGCT CCTCGAGGAA GCCGGCGTGT CGGATCTCTC GATGGAGATC TGGGCGATGC CTGTGCAGCG TCCCTACATG CCGAACGCCC GGCGCACCGC TGAGCTGATG CAGGAAGACT TCGCCAAGAT CGGCGTCAAG GTCGAGATCG TCTCCTACGA GTGGGGCGAG TATCTGAAGA AATCGACCGA CCCGTCGCGC AAGGGCGCGG TCATCCTCGG CTGGACGGGC GACAACGGCG ACCCGGACAA CTTCATGGGC GTGCTGCTGG GCTGCTCGGC CACCGGCGAC GGCGGCGCGA ACCGCGCGCA ATGGTGCAAC AAGGAGTTCG ACGACCTGAT CCAGAAGGCG AAGGTCACGG CGGATCAGGC GGAGCGCACC AAGCTCTACG AAGAGGCGCA GGTCGTCTTC AAGCGCGAGA ACCCCTGGGC CACCATCGCC CATTCGACGG TCTTCATGCC GATGTCGAAG AAGGTCTCGG GCTATGTGAT GAACCCGCTG GGCAAGCACA GCTTCTCGGG CGTCGATATC GAAGAGTGA
|
Protein sequence | MKFAHLLAAS SLALMAAAGA QAKTLVYCSE GSPEGFDPAP YTAGTTFDAA SQAVYNQLVE FKPGTTEIAP ALAESYEISD DGLEYTFHLR PGVKFHTTDF FTPTREMNAD DVIFSFLRQG DESSPWHQYV AGITYEYYSG MEMPTVIKEI QKVDDLTVKF VLTRPEAPFL ANLAMDFASI LSKEYADKLE AENRKEDLNN APVGTGPFKF VAYQKDAVIR YQANDDYWAG REKIDDLIFA ITPDPAVRMQ KLQAGECHIM PYPAPADIEA LKADENLQVM EQPGLNVAYL AYNTTVAPFD NPNVRKALNM AMNKEAILEA VFQGTGQVAK NPIPPTMWSY NDAVEDTAFD PEAAKKLLEE AGVSDLSMEI WAMPVQRPYM PNARRTAELM QEDFAKIGVK VEIVSYEWGE YLKKSTDPSR KGAVILGWTG DNGDPDNFMG VLLGCSATGD GGANRAQWCN KEFDDLIQKA KVTADQAERT KLYEEAQVVF KRENPWATIA HSTVFMPMSK KVSGYVMNPL GKHSFSGVDI EE
|
| |