Gene RPB_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1010 
Symbol 
ID3909134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1157846 
End bp1159168 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content68% 
IMG OID637882903 
Productglycoside hydrolase family protein 
Protein accessionYP_484631 
Protein GI86748135 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG CGGTGCCGAG CCATCTCGTC GATCCGCCGG CGATCGTCCG GGACGCCGCC 
GATCTGCGGC CGGCGGGGCG CGGCGGCCAG TTGCTGCCGC CCGGCTATCT CGGCACGCGG
GGCAGCCAGA TCGTCGACGT GACCGGCCGG CCGGTGCGGA TCGCCTCGAT CGGCTGGAAC
GGCACCGAGG GCCCGCCCGG CGCAGCACCC TCGGGGATCT GGAAGGTCAG CTACCGGACC
GTTCTCGACT CGATCGTCGC CGCGGGCTTC AACACCGTGC GGATTCCATG GACGGATATC
GGCCTCGACA CGCCGCTGAA CGGCTACAGC GACCGGCTCG GCTGGATCAA CACCACGCTC
AATCCCGACC TGCTGGCATC CGACACGCCC GACGCCAACG GGCGCTATCG CTACGTCACC
ACGCTGGTGG CGTTTCAGCG CATCGTCGAC TACGCCGGCG ACATCGGCCT GAAGGTGATT
TTCAATCACC ACACCAATCA GGGCACCGCG GGGCAGCAGC GCAACGGGCT GTGGTTCGAT
CTCGGCCCAG GCACCGACAA CACCGACGGC ATCAAGCCGG GCCGGTTCAC CGCGCAGGAC
TTCAAGCAGA ACTGGCTGCG GGTGGCGCGG ACCTTCGCCG GCAATCCGAC CGTGATCGGC
TACGATCTGC ACAACGAGCC CAACGGCGAC CGCGGCGCCA TCACCTGGGG CGGCGGCGGG
CCGACCGACA TCAAGGCGAT GTGCGAGGAC GTCGGCTCGG CGATCCAGGA CGTCAGCCCC
GACGTGCTGA TCATCTGCGA GGGGCCGGAG ACCTACAAGC CGCCGCCGGA ATCGTCGGGG
ATGGACCCGC GCCACGCCGC GCCCGCGGGC AATCTCACCG CGGCGGGCGC CAATCCGGTG
CGGCTCAAGA TCCCGCACAA GCTGGTGTAT TCGATCCATG AATATCCGGA GGAGATCGCC
GACACCAAGC GCTGGGGCAT TCCGGAGACC GGCAAGGGCT TCATCGACCG GATGAACACC
ACCTGGGGCT ATCTGGTGCG CGACGACATC GCGCCGGTGT GGATCGGCGA GATGGGCGCA
TCATTGCGGA CGCCCGAGAC GCGCGAATGG GCGCGCAATC TGATCGACTA CATGAACGGC
AAATACGGCA GCGAGGGCGG CCCGGCCTTT TCGGGCGATC AGCAGCCGAT CAGCGGCAGC
TGGTGGCTGA TCGGTCCGTC GAACGATCCG CCCTATGGGC TGCAGACGGA CTGGGGCGTC
GGCCACTATC GACCGGACCA GATCGCGATC ACCGACCAGA TGCTGTTTCG GCCGCGCAAG
TAG
 
Protein sequence
MDAAVPSHLV DPPAIVRDAA DLRPAGRGGQ LLPPGYLGTR GSQIVDVTGR PVRIASIGWN 
GTEGPPGAAP SGIWKVSYRT VLDSIVAAGF NTVRIPWTDI GLDTPLNGYS DRLGWINTTL
NPDLLASDTP DANGRYRYVT TLVAFQRIVD YAGDIGLKVI FNHHTNQGTA GQQRNGLWFD
LGPGTDNTDG IKPGRFTAQD FKQNWLRVAR TFAGNPTVIG YDLHNEPNGD RGAITWGGGG
PTDIKAMCED VGSAIQDVSP DVLIICEGPE TYKPPPESSG MDPRHAAPAG NLTAAGANPV
RLKIPHKLVY SIHEYPEEIA DTKRWGIPET GKGFIDRMNT TWGYLVRDDI APVWIGEMGA
SLRTPETREW ARNLIDYMNG KYGSEGGPAF SGDQQPISGS WWLIGPSNDP PYGLQTDWGV
GHYRPDQIAI TDQMLFRPRK