Gene Rsph17029_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3969 
Symbol 
ID4898261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1108929 
End bp1110527 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content64% 
IMG OID640114572 
Productextracellular solute-binding protein 
Protein accessionYP_001045819 
Protein GI126464706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG CACATCTGCT GGCCGCGTCT TCGCTGGCCC TGATGGCTGC TGCCGGAGCC 
CAAGCCAAGA CGCTGGTCTA TTGCTCGGAA GGGTCGCCCG AGGGGTTCGA CCCCGCGCCC
TACACCGCCG GCACCACCTT CGACGCGGCC TCGCAGGCGG TCTACAACCA GCTCGTCGAG
TTCAAGCCGG GCACGACCGA GATCGCCCCC GCCCTGGCCG AGAGCTATGA GATCTCGGAC
GACGGGCTGG AATACACCTT CCACCTGCGG CCGGGCGTCA AGTTCCACAC GACCGACTTC
TTCACGCCCA CGCGCGAGAT GAACGCCGAC GACGTGATCT TCTCGTTCCT GCGTCAGGGC
GACGAATCCA GCCCGTGGCA CCAGTATGTG GCCGGGATCA CCTACGAATA TTACAGCGGC
ATGGAAATGC CGACCGTGAT CAAGGAGATC CAGAAGGTCG ACGACCTGAC GGTGAAGTTC
GTGCTGACCC GTCCCGAGGC GCCCTTCCTC GCCAACCTCG CGATGGACTT CGCCTCGATC
CTGTCGAAGG AATATGCCGA CAAGCTGGAG GCCGAGAACC GCAAGGAAGA CCTGAACAAC
GCGCCAGTCG GCACCGGCCC GTTCAAGTTC GTGGCCTACC AGAAGGATGC GGTCATCCGC
TATCAGGCCA ATGACGACTA CTGGGCCGGG CGCGAGAAGA TCGACGATCT GATCTTCGCC
ATCACCCCCG ATCCGGCGGT GCGCATGCAG AAGCTGCAGG CCGGCGAATG CCACATCATG
CCCTATCCGG CGCCCGCCGA CATCGAGGCG CTGAAGGCGG ACGAGAACCT GCAGGTGATG
GAGCAGCCGG GCCTGAACGT GGCCTATCTC GCCTACAACA CCACCGTGGC GCCCTTCGAC
AATCCGAACG TCCGCAAGGC GCTCAACATG GCGATGAACA AGGAGGCCAT CCTCGAGGCG
GTCTTCCAGG GCACGGGGCA GGTCGCCAAG AACCCGATCC CGCCGACCAT GTGGAGCTAC
AACGACGCGG TCGAGGACAC GGCCTTCGAT CCCGAAGCGG CCAAGAAGCT CCTCGAGGAA
GCCGGCGTGT CGGATCTCTC GATGGAGATC TGGGCGATGC CTGTGCAGCG TCCCTACATG
CCGAACGCCC GGCGCACCGC TGAGCTGATG CAGGAAGACT TCGCCAAGAT CGGCGTCAAG
GTCGAGATCG TCTCCTACGA GTGGGGCGAG TATCTGAAGA AATCGACCGA CCCGTCGCGC
AAGGGCGCGG TCATCCTCGG CTGGACGGGC GACAACGGCG ACCCGGACAA CTTCATGGGC
GTGCTGCTGG GCTGCTCGGC CACCGGCGAC GGCGGCGCGA ACCGCGCGCA ATGGTGCAAC
AAGGAGTTCG ACGACCTGAT CCAGAAGGCG AAGGTCACGG CGGATCAGGC GGAGCGCACC
AAGCTCTACG AAGAGGCGCA GGTCGTCTTC AAGCGCGAGA ACCCCTGGGC CACCATCGCC
CATTCGACGG TCTTCATGCC GATGTCGAAG AAGGTCTCGG GCTATGTGAT GAACCCGCTG
GGCAAGCACA GCTTCTCGGG CGTCGATATC GAAGAGTGA
 
Protein sequence
MKFAHLLAAS SLALMAAAGA QAKTLVYCSE GSPEGFDPAP YTAGTTFDAA SQAVYNQLVE 
FKPGTTEIAP ALAESYEISD DGLEYTFHLR PGVKFHTTDF FTPTREMNAD DVIFSFLRQG
DESSPWHQYV AGITYEYYSG MEMPTVIKEI QKVDDLTVKF VLTRPEAPFL ANLAMDFASI
LSKEYADKLE AENRKEDLNN APVGTGPFKF VAYQKDAVIR YQANDDYWAG REKIDDLIFA
ITPDPAVRMQ KLQAGECHIM PYPAPADIEA LKADENLQVM EQPGLNVAYL AYNTTVAPFD
NPNVRKALNM AMNKEAILEA VFQGTGQVAK NPIPPTMWSY NDAVEDTAFD PEAAKKLLEE
AGVSDLSMEI WAMPVQRPYM PNARRTAELM QEDFAKIGVK VEIVSYEWGE YLKKSTDPSR
KGAVILGWTG DNGDPDNFMG VLLGCSATGD GGANRAQWCN KEFDDLIQKA KVTADQAERT
KLYEEAQVVF KRENPWATIA HSTVFMPMSK KVSGYVMNPL GKHSFSGVDI EE