Gene RPB_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0445 
Symbol 
ID3910001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp491316 
End bp492596 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID637882331 
Productglycoside hydrolase family protein 
Protein accessionYP_484067 
Protein GI86747571 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.482345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAA CGACCAGGAT CGTGTTGCTC GGCGCCAGCA GTGCGTCATT CGGCCTCAGC 
ATGTTGCGCG ATCTGTTCGC CACGCCGGAG CTGCGCGGGT CGACGCTGGT GATGGTCGGG
CTCGATGCGG CGAGGCTCGC GACCATGGCC GAGCTGGCGA AGCTGCTGAA CGCGACGACC
GGCGCCGGCT TCGTCATCGA ACACACCACC GACCGCCGCG CCGCGCTGGA CGGCGCAAGC
TTCGTCATCA ACGCCACCGC GATCGATCGC AACCGGCTGT GGAAGATGGA TTTCGAGGTG
CCGAAGAAGC ACGGCATCCG GCATCCGCTG GGTGAGAACG GTGGCCCCGG CGGATTGTTC
TTCACGTTGC GGACGCTGCC GCTGGTGTTC GATTTCATCC GCGACATCGA GGAGCTTTGC
CCCGAGGCGC TGTTTCTCAA CTACTCCAAT CCGGAAAGCC GCATCGTGCT GGCGCTCGGG
CGCTATTCGA AGGTGCGCTG CATCGGCCTG TGTCACGGCA TCTTCATGGG CCGCGACGCC
GTCGCCGACA TCATGGGACT GCCGCGCGAG CGCGTCGAGG TGTGGGGCGC GGGGCTCAAT
CACTTCCAGT GCCTGCTGCA GATCCGCGAC CGCCTCACCG GCGAAGACCT CGCGCCGCGG
CTGCGCGCGG CGGAGCAGAG CTTCGATCCC AATGCCTGGC GCTTCACCCG GCGGCTGTAT
CGCGCCTTCG GCCACTGGCT GACCTGCAGC GACGATCATC TCGGCGAGTA TCTGGCTTAC
GGCTGGGAGG CCGGCGAGCG CGGCTATGAT TTCGCCGGCG ACGACCGCAG CCGCGTCGAG
ACCCTGGCGC AGATCGACGC CGTGCTGGCC GGGACGATGC CGATCCCACA TTGGTGGACC
GAGCCCTCGG GCGAGCGCGG CGCCGCGGTG ATCGCCGCGA TGCTGCACGA CCAGAAGCGC
TTCATCGAAT CCGGCATCGT GATGAACCGC GGCGTCATCC CCAATCTGCC GGCGGAGCTC
GCCGTCGAGG TGCCGGTGAC GGTCGACGCC GCCGGGGTGC ATCCGGTGTC GCTCGGTCCA
CTACCAGATC CGATCGCCAA GCTGATGCTG ATGCAGGCCA GCGTGCAGCA ACTCGCGGTC
GAGGCGGCGG TGCACGCCTC GAAAGAACTC GCGCTGCAGG CGCTGCTGAT CGACCCGGTG
GTCAACTCAG CGGTCGCCGC GGAAAAGATC CTCGACGAGC TGTGGGAGAT CAACCGGCCC
TATATCAGGG CGTGCGTGTA G
 
Protein sequence
MARTTRIVLL GASSASFGLS MLRDLFATPE LRGSTLVMVG LDAARLATMA ELAKLLNATT 
GAGFVIEHTT DRRAALDGAS FVINATAIDR NRLWKMDFEV PKKHGIRHPL GENGGPGGLF
FTLRTLPLVF DFIRDIEELC PEALFLNYSN PESRIVLALG RYSKVRCIGL CHGIFMGRDA
VADIMGLPRE RVEVWGAGLN HFQCLLQIRD RLTGEDLAPR LRAAEQSFDP NAWRFTRRLY
RAFGHWLTCS DDHLGEYLAY GWEAGERGYD FAGDDRSRVE TLAQIDAVLA GTMPIPHWWT
EPSGERGAAV IAAMLHDQKR FIESGIVMNR GVIPNLPAEL AVEVPVTVDA AGVHPVSLGP
LPDPIAKLML MQASVQQLAV EAAVHASKEL ALQALLIDPV VNSAVAAEKI LDELWEINRP
YIRACV