Gene Rpal_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5242 
Symbol 
ID6412942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5653057 
End bp5654673 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content66% 
IMG OID642715132 
Productalpha amylase catalytic region 
Protein accessionYP_001994205 
Protein GI192293600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCG TGACGAGCGT GTCGTGGTGG GCGGCGGGTG TGCTGTACCA GATCTATCCG 
CGCTCGTTTC AGGACAGCAA CAACGACGGC ATCGGTGATC TGCGCGGGAT CATCGATCGG
CTCGGCTATC TGTCCGATCT CGGCGTCGAC GCGATCTGGC TGTCTCCGAT CTTTCCGTCA
CCGATGGCGG ACTTCGGCTA CGATGTCGCC GACTATGTCG GCATCGATCC AATCTTCGGC
ACGATGGACG ATTTCGACGC GCTGGTGCTC ACGGCCCATG CCCGCGGGCT CAAGGTCATT
CTCGATCTGG TGCCGAACCA TTCATCCGAA CAGCATCCCT GGTTCGTCGA GAGCCGGTCG
TCGCGGCACA ATCCCAAGCG CGATTGGTAC ATCTGGCGCG ACCCAGCCCC GGACGGCGGA
CCGCCGACGA ATTGGCTGTC GGAATTCGGC GGCAGCGGCT GGGAGTACGA CGACGCCACC
GGCCAATACT ACTATCACGC CTTTCTGAAG CAGCAGCCGG ACCTCAACTG GCGTAATCCG
GAAGTGCGGG CCGCGATCTA TGACGCGATG CGGTTCTGGC TGCGGAAGGG CGTCGATGGC
TTCCGGGTCG ACGTGATCTG GCACCTGATC AAGGACGACC AATTTCGCGA CAACCCGCCC
AATCCGGATT TTCGCCCCGG CATGCCGCCG CACGCCGCGC TGATCCCGAT TTACTCCGCG
GATCGGCCGG AGACGTTGGA GGTGGTTGCC GAACTGCGCC GTGTCGTCGA CGAATTCGAT
CATCGGCTGC TGATCGGCGA GATCTATCTG CCGGTGGAGC GGCTGGTCGC GTATTACGGA
GCGGAGCTGA AGGGCGCGCA GCTGCCGTTC AACTTCGCGC TGCTGTCGAC GCCGTGGCGT
GCCCGGGAGA TTGCGGCGCT GATCGATCGC TACGAGCAGG CATTGCCGGC CGGCGCGTGG
CCGAACTGGG TGCTGGGCAA TCACGACCGG CCGCGGGTGG CGAGCCGGGT CGGGCCGGCA
CAGGCCCGCG TCGCCGCGAT GCTGTTGCTC ACACTGCGGG GCACCCCGAC GATGTATTAC
GGCGACGAGC TTGGCATGGA GCAGGTCGAG ATCGCGCCCG AGGACGTGCA GGATCCGTTC
GAGAAGAACG TGCCCGGCAT CGGTGTCGGC CGCGATGGCT GCCGCACGCC GATGCAGTGG
GACGCGTCGG ACAACGCCGG GTTCTCGGAT GTCCGGCCGT GGCTCCCGCT GGCCCCGAAC
GCCACCCAGG ACAACGTCGC TAATCTGCGC GCTGACGCGC AGTCGATCCT AAACCTCTAT
CGCGCTCTGC TGCGGCTACG GCGCGCACTG CCCCAGCTCG CGCTTGGCGA GTATCAGCCA
CTCGCCGCGG AGGGCGAGCT GTTGCTGTAT CGTCGGCATC ATCAGGGGCG GTCGGTCTTG
GTGGCGCTCA ATCTCGGCCC GGATCCGATC TCGGCGGCGT CCGACGCGAT CGGGCTAGAC
GGCGAGGTGC TGCTGTCGAC CATGCTCGAC CGCGCCGGCG AACGCATCGG TGCCACGCTC
GACCTGCGCG GCAGCGAAGG CATCATCGTC GGCCGGGCGC CGGAGGACCT CGTTTAG
 
Protein sequence
MESVTSVSWW AAGVLYQIYP RSFQDSNNDG IGDLRGIIDR LGYLSDLGVD AIWLSPIFPS 
PMADFGYDVA DYVGIDPIFG TMDDFDALVL TAHARGLKVI LDLVPNHSSE QHPWFVESRS
SRHNPKRDWY IWRDPAPDGG PPTNWLSEFG GSGWEYDDAT GQYYYHAFLK QQPDLNWRNP
EVRAAIYDAM RFWLRKGVDG FRVDVIWHLI KDDQFRDNPP NPDFRPGMPP HAALIPIYSA
DRPETLEVVA ELRRVVDEFD HRLLIGEIYL PVERLVAYYG AELKGAQLPF NFALLSTPWR
AREIAALIDR YEQALPAGAW PNWVLGNHDR PRVASRVGPA QARVAAMLLL TLRGTPTMYY
GDELGMEQVE IAPEDVQDPF EKNVPGIGVG RDGCRTPMQW DASDNAGFSD VRPWLPLAPN
ATQDNVANLR ADAQSILNLY RALLRLRRAL PQLALGEYQP LAAEGELLLY RRHHQGRSVL
VALNLGPDPI SAASDAIGLD GEVLLSTMLD RAGERIGATL DLRGSEGIIV GRAPEDLV