Gene Rsph17029_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1321 
Symbol 
ID4896664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1368489 
End bp1370213 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content66% 
IMG OID640111908 
Productextracellular solute-binding protein 
Protein accessionYP_001043203 
Protein GI126462089 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00160153 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACC GTACGACAGC CACCGCGATG GCCCTGTGCC TGATGGCCGT GGCCGCGCCC 
GCCTGGGCCG ACATGGAGGC CGCGAAGAAG TTTCTCGATG CCGAGATCGG GGATCTCTCC
TCGCTCTCGC GCGCCGAGCA GGAAGCCGAG ATGCAATGGT TCATCGACGC GGCCCAGCCC
TTCGCCGGGA TGGAGATCAA GGTCGTCTCC GAGACCATCA CCACCCACGA ATATGAGAGC
AAGGTCCTCG CCCCCGCCTT CTCCGCCATC ACCGGGATCA AGATCAGCCA CGACCTGATC
GGCGAGGGCG ACGTGGTCGA GAAGCTGCAG ACCCAGATGC AGTCGGGCGA GAACATCTAC
GACGCCTATA TCAACGACAG CGACCTGATC GGCACCCACT GGCGCTACAA GCAGGCGCGC
AGCCTCACCG ACTGGATGGC CAATGCGGGC AAGGACGTCA CCAACCCCGG CCTCGACCTG
CAGGATTACA TCGGGCTGAA ATTCACCACC GCCCCCGACG GCGAGCTCTA CCAGCTGCCC
GACCAGCAGT TCGCCAACCT CTACTGGTTC CGCGCCGACT GGTTCGACGA TGCCGACACC
AAGGCCGCGT TCAAGGAGAA GTACGGCTAC GAGCTGGGCG TGCCGCTGAA CTGGTCGGCC
TATGAGGACA TCGCCGAATT CTTCACCGGC CGCGACATGA GCGCGCTCGG CGGGCCGAAG
AGCGCCTTCG GCAGCATGGA TTACGGCAAG AAGGACCCGA GCCTCGGCTG GCGCTACACC
GACGCCTGGA TGTCGATGGC GGGCATGGGC GACAAGGGCG ATCCGAACGG GCTGCCCGTG
GACGAATGGG GCATCCGGGT GGACGAGAAC TCGCGTCCCG TGGGCTCCTG CGTGGCGCGC
GGCGGGGCCA CCAACGATGC GGCCGCGGTC TATGCAATCA CCAAATCGAT CGAATGGCTG
CAGAAATACG CCCCGCCGCA GGCGGCCGGC ATGACCTTCT CGGAATCGGG CCCGGTGCCG
GCGCAGGGCG AGGTGGCCCA GCAGATATTC TGGTACACCG CTTTCACCGC CGACATGGTC
AAGGAGGGCC TGCCGGTGAT GAACGAGGAC GGCACGCCCA AGTGGCGCAT GGCCCCCTCG
CCGCACGGCG CCTACTGGTC GGAGGGCACC AAGGTCGGCT ACCAGGACGT GGGTTCCTGG
ACGCTGCTGA AATCCACCCC CGACGAGCGG GCCAAGGCCG CCTGGCTCTA TGCGCAGTTC
GTCTCCTCGA AGACGGTCGA CGTGAAGAAG AGCCATGTGG GCCTGACCTT CGTGCGCGAA
TCTACCATCC AGCACCAGAG CTTCACCGAC CGTGCGCCGA AGCTCGGCGG ACTCGTGGAA
TTCTACCGCT CGCCCGCCCG GGTGCAGTGG TCGCCCACGG GCACGAACGT GCCGGACTAT
CCCAAGCTCG CCCAGCTCTG GTGGCAGAAC ATCGGCGACG CCATGTCGGG CGCCAAGTCG
CCGCAGGAGG CGCTCGACGC GCTCTGCGCC GAGCAGGAGA AGGTGATGGC CCGGCTCGAG
CGCGCGGGCG TGCAGGGCGA TCTCGGCCCG AAGCTGAACG AGGAGAAGGA CCCGCAGGAA
TGGCTCGACG CCCCCGGAGC CCCGGTGGCC AAGCTCGAGA ACGAGAAGCC ACAGGGCGAG
ACCATCTCCT ATGACGAGCT CATCAAGTCC TGGCAGAAGG GCTGA
 
Protein sequence
MRYRTTATAM ALCLMAVAAP AWADMEAAKK FLDAEIGDLS SLSRAEQEAE MQWFIDAAQP 
FAGMEIKVVS ETITTHEYES KVLAPAFSAI TGIKISHDLI GEGDVVEKLQ TQMQSGENIY
DAYINDSDLI GTHWRYKQAR SLTDWMANAG KDVTNPGLDL QDYIGLKFTT APDGELYQLP
DQQFANLYWF RADWFDDADT KAAFKEKYGY ELGVPLNWSA YEDIAEFFTG RDMSALGGPK
SAFGSMDYGK KDPSLGWRYT DAWMSMAGMG DKGDPNGLPV DEWGIRVDEN SRPVGSCVAR
GGATNDAAAV YAITKSIEWL QKYAPPQAAG MTFSESGPVP AQGEVAQQIF WYTAFTADMV
KEGLPVMNED GTPKWRMAPS PHGAYWSEGT KVGYQDVGSW TLLKSTPDER AKAAWLYAQF
VSSKTVDVKK SHVGLTFVRE STIQHQSFTD RAPKLGGLVE FYRSPARVQW SPTGTNVPDY
PKLAQLWWQN IGDAMSGAKS PQEALDALCA EQEKVMARLE RAGVQGDLGP KLNEEKDPQE
WLDAPGAPVA KLENEKPQGE TISYDELIKS WQKG