Gene Rsph17029_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4022 
Symbol 
ID4898728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1167130 
End bp1168425 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID640114625 
Productextracellular solute-binding protein 
Protein accessionYP_001045872 
Protein GI126464759 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GCACATTCCT CACCGCCACG GCGCTTGCGC TGATCGTTCA GGCGGGTGCC 
GCCGCGGCCC AGACCGAGAT CAGCTGGTGG CACGCCATGA CGGGCGCCAA CGCGGAAGTG
GTCGAGAAGA TCGCCGCGGA TTTCAACGCG AGCCAGTCGG ACTACAAGGT GACAGCGGTC
TTCAAGGGCA CCTACCCCGA GACGCTGAAC GCCGGCATCG CAGCCTTCCG CGCCGGTCAG
GCCCCCGATA TCATCCAGGT CTTCGACGTG GGCACCGGCG TCATGATGGC GGCCGAGGGC
GCGATCAAGC CGGTGGCCGA GGTGCTGGGC GACAGTTTCG ACAAGTCGGC CTACCTGCCG
GGGATCGTGG CCTATTATTC CAAGCCCGAC GGCACGATGC TGTCCTTCCC CTACAACTCG
TCCTCGCCGA TCCTCTATTA CAACAAGGAC ATCTTCGAGA AGGCGGGCCT CGATGCAGAC
ACCCCGCCCA AGACCTGGAC CGAGGTCTGG GACATGGCGA AGAAGATCAA GGAGAGCGGC
GCCGCCCCCT GCGGCTACAC CTCGACCTGG CTCACCTGGA TTCATACCGA GAATTTCGCG
GCCTGGAACG ACGTGCCCTT CGCCACGAAC GAGAACGGGC TTGCCGATGT GAATGCCGAG
CTGAAGATCA ACGAGCCGAT CTTCGTCAAC CACTTCCAGG CGCTGGCCGA TCTCGCCAAG
GACGGCACGT TCAAATACGG CGGCCGCACG TCCGAGGCCA AGCAGATCTT CCTTGCGGGC
GAATGCGGGA TCTTCACCGA AAGCTCGGGC GGGCTCGGCG ACATCGTGAA ATCGGGCATG
AACTACGGCA TCGGCCAGCT GCCCTATGAC GAGGCGGGCA ACGGGCCGCA GAACACGGTG
CCGGGCGGCG CGAGCCTCTG GGTGATGGGC GGCAAGTCGG ACGAGACCTA TGAGGGCGTC
GCCGCCTTCT TCAACTATCT CTCGCAGACC GACGTGCAGG AATATCTGCA CCAGACGTCG
GGCTATCTGC CGGTGACGAT GGAGGCCTAC GAGGCGACCA AGGCCTCGGG CTTCTACGAG
AAGAACCCGG GCCGCGAGGT GCCGATCACC CAGATGATGG GCAAGGAGCC GACCGCCAAC
TCCAAGGGCG TGCGCCTCGT GAACCTGCCG CAGGTGCGCG ACATCGAGAA CGAGGAGTTC
GAGAAGATGC TCGCCGGAGA GCAGACCGCA CAGGAAGCGC TCGACGCGGC CGTCTCGCGC
GGCAACGAGG CGATTCGCCA GGCCATCGGC GGCTGA
 
Protein sequence
MKRRTFLTAT ALALIVQAGA AAAQTEISWW HAMTGANAEV VEKIAADFNA SQSDYKVTAV 
FKGTYPETLN AGIAAFRAGQ APDIIQVFDV GTGVMMAAEG AIKPVAEVLG DSFDKSAYLP
GIVAYYSKPD GTMLSFPYNS SSPILYYNKD IFEKAGLDAD TPPKTWTEVW DMAKKIKESG
AAPCGYTSTW LTWIHTENFA AWNDVPFATN ENGLADVNAE LKINEPIFVN HFQALADLAK
DGTFKYGGRT SEAKQIFLAG ECGIFTESSG GLGDIVKSGM NYGIGQLPYD EAGNGPQNTV
PGGASLWVMG GKSDETYEGV AAFFNYLSQT DVQEYLHQTS GYLPVTMEAY EATKASGFYE
KNPGREVPIT QMMGKEPTAN SKGVRLVNLP QVRDIENEEF EKMLAGEQTA QEALDAAVSR
GNEAIRQAIG G