Gene RPB_4664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4664 
Symbol 
ID3912482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5275273 
End bp5276619 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID637886569 
Productextracellular solute-binding protein 
Protein accessionYP_488258 
Protein GI86751762 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.593429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG GGAGTGATCG GCGATGCGGA CGAGCCAGGA TCCTACTGAT ATCCGCGGTT 
GTTCTGCTTG CAATCGCCGT GGGCAACGCC CCCGCCCAGG CGGCGACCGA GATCGCATGG
TGGCACGCGA TGTCCGGCCA ACTCGGCCGG GAGCTCGAAA AGCTCGCCGC GGACTTCAAC
ACGTCGCAAT CCGACTACCG TGTGGTGCCC ACCTACAAGG GCAACTACAC CGAGGCGGTG
ACTGCCGCGA TTTTCGCCTT CCGCTCGTCG AGCCAGCCGG CGATCGTGCA GGTCAACGAG
ATCGCGACCG CCACGATGAT GGCGGCGAAA GGCGCGGTCT ATCCGGTCTA CGAATTGATG
CGCGACGAAA AGGAAGCGTT CTCTCCGTCG GACTACCTTC CCGCGGTCGC CGGCTATTAT
ACCGATCTGG CCGGCAACAT GCTGTCGTTT CCGTTCAACG CCTCGACCCC GATGCTGTAC
TACAACAAGT CGATGTTCAG AAAGGTCGGC CTCGACCCCG AGACGCCGCC GGCGACATGG
CCTGACGTCG GCGCCGCGGC GAAGCGGCTG GTCGCCGCCG GGGTGCCGTG CGGACTCACC
ACGTCGTGGC CGTCCTGGGT CAATGTCGAG AATTTCTCCG CCTATCACAA CCTCCCGCTC
GCGACCCGGG CGAACGGCCT CGGCGGGATG GATGCAGTAC TGGTCTTCAA CAATCCCGTC
CTGGTTCGGC ACATCGCCGA ACTGGCGGAA TGGCAGAAGA CCAGGGTATT CGACTATGGT
GGCCGCGCCA CCGCCACGGA GCCGCGATTC CAGCGGGGCG ATTGCGGTAT CTTCGTCGGC
TCCTCGGCGA CCCGCGCCGA TATCATCGCC AATTCCAAAT TCGAGGTCGG TTACGGCCGG
CTGCCGTTCT GGCCGGACGT CGCCGGCGCG CCGCAAAACA CCATTATCGG CGGCGCGACG
CTGTGGGTGC TGCGCGGCCG GCCGGCCGAC GAATACAAAG GCGTCGCCAA GTTCTTCGCC
TATCTGTCGC GCGCCGACGT GCAGGCCGCC TGGCATCAAA ACACGGGCTA TCTGCCGGTG
ACGCGCGCCG CCTACGAACT GACGCGCGCG CAGGGATTCT ACGAACGCAA TCCCGGCACG
GCGATCTCGA TCGAGCAGAT GACCCTGAAG CCGCCGACCG ACAATTCGCG CGGATTGCGA
CTGGGCTCCT TCGTCCTGAT CCGCGACGTC ATTGACGACG AGCTCGAACA GGCGTTCAGC
GGCCGAAAGC CGGCGCAGGC GGCAATGGAT TCCGCGGTCG AGCGCGGCAA CAAGCTGCTG
CGTCAGTTCG AACGGACCCA ACCATGA
 
Protein sequence
MAIGSDRRCG RARILLISAV VLLAIAVGNA PAQAATEIAW WHAMSGQLGR ELEKLAADFN 
TSQSDYRVVP TYKGNYTEAV TAAIFAFRSS SQPAIVQVNE IATATMMAAK GAVYPVYELM
RDEKEAFSPS DYLPAVAGYY TDLAGNMLSF PFNASTPMLY YNKSMFRKVG LDPETPPATW
PDVGAAAKRL VAAGVPCGLT TSWPSWVNVE NFSAYHNLPL ATRANGLGGM DAVLVFNNPV
LVRHIAELAE WQKTRVFDYG GRATATEPRF QRGDCGIFVG SSATRADIIA NSKFEVGYGR
LPFWPDVAGA PQNTIIGGAT LWVLRGRPAD EYKGVAKFFA YLSRADVQAA WHQNTGYLPV
TRAAYELTRA QGFYERNPGT AISIEQMTLK PPTDNSRGLR LGSFVLIRDV IDDELEQAFS
GRKPAQAAMD SAVERGNKLL RQFERTQP