Gene RPB_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1717 
Symbol 
ID3908242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1956442 
End bp1957824 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content66% 
IMG OID637883611 
Productethanolamine ammonia lyase large subunit 
Protein accessionYP_485336 
Protein GI86748840 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.133607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.24055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTATC GTCACGCTAT CGGTAACGTC GCTTACGTCT TCGACAATCT GCGCGACCTG 
CTCGCCAGAG CCACGCCCCC TCGATCCGGT GACCGGCTCG CCGGCGTCGC CGCCGACAGT
GCCGAGCAGA TGGTCGCGGC GCGGATGGCG CTCGCCGAGG TGCCGCTGCG GCAATTTCTC
AATGAGACCG TCATCCCCTA TGAAGACGAC GAGGTCACAA GGCTGATCGT CGACAGCCAC
GACGCGCAAA GCTTTGCTCC GATTGCCGCG CTCACCGTCG GAGGTTTCCG CGACTGGCTG
CTGTCGGATG CGGCGACGCC CGCGACGCTC GCCGCGATCG CGCGCGGCGT CACTCCCGAA
ATGGCCGCCG CGGTCAGCAA GCTGATGCGC AACCAGGATC TGATCCTGGT CGCCAAGAAG
TGCAGCGTCG TCACCCGTTT CCGCAACACG ATCGGCCTGC CGGGCCGGAT GAGCGTGCGG
TTGCAGCCCA ATCACCCGTT CGACGATGTT CGCGGCATCA CCGCCTCGAC GCTGGACGGC
CTGCTGCTCG GCGCCGGCGA TGCCTGTATC GGCATCAACC CGGCGAGCGA CGATCCGGCG
GTGCTTGGGC AATTGGTGCG GCTGCTCGAC GACGTCATCA CGCGGCTGGC GATCCCCACC
CAGAGTTGCG TGCTGACCCA CGTCACCACC TCGCTGCGAT TGATGGAGGA GGGAGTGCCC
GTCGATCTGG TGTTCCAGTC GATTGCCGGC ACCGAAGCCG CCAACCGCAG CTTCGGCATC
GACCTGTCGA TCCTGAAGGA GGCGCACGAC GCCGGGCTCT CGTTGAAGCG GGGCACCGTC
GGCGAAAATG TGATGTATTT CGAGACCGGG CAGGGCTCCG CGCTGTCGGC CGACGCCCAT
CACGGCGTCG ATCAGCAGAC CTGCGAGGCG CGAGCCTATG CGGTGGCGCG GGCCTATGCG
CCGCTGCTGG TCAACAGCGT CGTCGGATTC ATCGGCCCCG AATATCTCTA CGACGGCAAG
GAGATCATCC GCGCCGGGCT GGAGGACCAT TTTTGCGGCA AGCTGCTAGG CCTGCCGCTC
GGCGTCGACA TCTGCTATAC CAACCACGCC GAAGCCGACC AGGACGACAT GGACACGCTG
CTGACGCTGC TGGCCACCGC CGGCGTCAGC TTCATCATGG GCGTGCCCGG CGCCGACGAC
GTCATGCTGA ACTACCAGTC GACCTCGTTT CACGACGCGC TCTACGTCCG CGAACTTCTC
GGTCTGAAGC GAGCCCCGGA GTTCGACGAC TGGCTCGTTC GCACCGGGCT CTCCCAGGCC
GACCTCCGTC TGACGGCCGC CGATGGGCGG CTGCCGGACT TCGCCGCCCG GCTGATCGCC
TGA
 
Protein sequence
MLYRHAIGNV AYVFDNLRDL LARATPPRSG DRLAGVAADS AEQMVAARMA LAEVPLRQFL 
NETVIPYEDD EVTRLIVDSH DAQSFAPIAA LTVGGFRDWL LSDAATPATL AAIARGVTPE
MAAAVSKLMR NQDLILVAKK CSVVTRFRNT IGLPGRMSVR LQPNHPFDDV RGITASTLDG
LLLGAGDACI GINPASDDPA VLGQLVRLLD DVITRLAIPT QSCVLTHVTT SLRLMEEGVP
VDLVFQSIAG TEAANRSFGI DLSILKEAHD AGLSLKRGTV GENVMYFETG QGSALSADAH
HGVDQQTCEA RAYAVARAYA PLLVNSVVGF IGPEYLYDGK EIIRAGLEDH FCGKLLGLPL
GVDICYTNHA EADQDDMDTL LTLLATAGVS FIMGVPGADD VMLNYQSTSF HDALYVRELL
GLKRAPEFDD WLVRTGLSQA DLRLTAADGR LPDFAARLIA