Gene Rpal_0382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0382 
Symbol 
ID6408029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp405710 
End bp406990 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID642710293 
Productglycoside hydrolase family 4 
Protein accessionYP_001989418 
Protein GI192288813 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.126645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACA CCACCAAGAT CGTGTTTCTC GGCGCCTCCA GTGCGTCGTT CGGTCTCAGC 
ATGTTTCGTG ACCTGTTCGC TTCGCCGGTG CTGGCCGGTT CGACGCTGAC GCTGGTCGGA
CGCAATGCGG AAACGCTCGG CAGAATGACC GAGCTGGCGA AGCTGATGAA CGCAAGGTCC
GGTGCCGGAC TGATCATCGA GCAGACCACC GACCGCCACG CCGCGCTGGA CGGCGCCGGC
TTCGTCATCA ACGCCACCGC GATCGATCGC AACCGGCTGT GGAAGCAGGA CTTCGAGGTG
CCGAAGAAGC ACGGCATCCG TCACACCCTC GGCGAGAACG GCGGCCCGGG CGGATTGTTC
TTCACGCTGC GCACGCTCCC CTTGGTGTTC GACTTCATCC GCGACATCGA GGAGCTGTGC
CCGAACGCGC TGTTTCTCAA CTACTCCAAT CCCGAAAGCC GGATCATTCT GGCGCTCGGC
CGCTACACCA AGGTGCGCCA TATCGGACTG TGTCACGGCA TCTTCATGGG CCGCGACGCG
GTCGCCTACA TCATGCAGAT GCCGCGCGAA GAGATCGAAG TGTGGGGCGC CGGGCTCAAT
CACTTCCAGT GCTTGACCGA GATCCGCCAC CGTGACACCG GCGAGGATCT GTATCCGCGG
TTTCGCGCCG CCGAGCAGAG CTTTGATCCG GATGCGTGGC GGTTCACGCG ACGGCTGTAT
CGCGCGTTCG GCTATTGGCT GACCTGCAGC GATGATCATC TCGGCGAGTA TCTGCCGTAT
GGCTGGGAAG CCGGCGAGAA GGGCTACGAT TTCGACCAGG ACGAACGCTG GCGCGGCGAA
TTCCTCACCC AGCTGAATGG CGTGCTCGGC GGAACCATGC CGATCCCGCG GTGGTGGACC
GAACCGTCGG GCGAGCGCGG CGCCGCCGTG ATCGCCGCGA TGCTGCACAA CCAGAAGCGT
TTCATCGAAT CCGGCATCGT GCTCAATCGC GGCGTGATCC CCAACCTGCC GGCGGAGCTC
GCGGTCGAAG TCCCGGTGAC TGTAGATGCG GCCGGCGTGC ATCCGGTGTC GCTCGGCCCG
TTGCCCGACC CGATCGCCAA GCTGATGCTG ATGCAGGCCA GCGTGCAGCA GCTCGCGGTC
GAGGCGGCCG TCCACGCCTC GAAGGAACTG GCCCTGCAGG CGCTGCTGAT CGATCCGGTG
GTCAACTCGG CGGTCGCGGC CGAAAAGATC CTGGACGAAT TGTGGGAGAT CAACCGGCCG
TATATTCGGA AGTGTGTGTA G
 
Protein sequence
MTNTTKIVFL GASSASFGLS MFRDLFASPV LAGSTLTLVG RNAETLGRMT ELAKLMNARS 
GAGLIIEQTT DRHAALDGAG FVINATAIDR NRLWKQDFEV PKKHGIRHTL GENGGPGGLF
FTLRTLPLVF DFIRDIEELC PNALFLNYSN PESRIILALG RYTKVRHIGL CHGIFMGRDA
VAYIMQMPRE EIEVWGAGLN HFQCLTEIRH RDTGEDLYPR FRAAEQSFDP DAWRFTRRLY
RAFGYWLTCS DDHLGEYLPY GWEAGEKGYD FDQDERWRGE FLTQLNGVLG GTMPIPRWWT
EPSGERGAAV IAAMLHNQKR FIESGIVLNR GVIPNLPAEL AVEVPVTVDA AGVHPVSLGP
LPDPIAKLML MQASVQQLAV EAAVHASKEL ALQALLIDPV VNSAVAAEKI LDELWEINRP
YIRKCV