Gene RPB_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2102 
Symbol 
ID3908516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2390044 
End bp2391570 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content68% 
IMG OID637883995 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_485719 
Protein GI86749223 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00738947 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTGT CCGAATGGCT CGCCGCGAGC GCACGGCTGC GGCCGTCCGC GCCGGCCTTG 
CTCACCGGCA CCACGATCGA GGCGGACTAC GCGACGTTCG CGCAGCGCGC CGCCTCGTTC
GCTGCAGCGC TGCAGCGCGA CTACGGCATC GTCTCCGGCG ACCGCGTCGC GCTGTTCGCG
CATAATTGCA CGCAATATCT CGAGGCACTG TACGGCATCT GGTGGGCGGG CGCGGTGGCG
GTGCCGATCA ACGCCAAGCT GCACGGCAAG GAAGCGGCGT GGATCTGCAG CAATTCCGGC
GCCAAGCTGG CGCTGATCTG CGACGACACC GCGGACACTT TCAACGAGGC CGCGGGCGAA
TTGCCGGCCC GCATGGCGAC GCTGGCGCTC GACAGCGACG CCTACATTCG CGCCCGTAGC
GGCGACGGGC CGGCGGCGCC GGCGGCGCGC GAGGACGGCG ATCTCGCCTG GCTGTTCTAC
ACCTCCGGCA CCACCGGCCG GCCGAAGGGG GTGATGCTCA GCCACGGCAA TCTGATCGCG
ATGTCGCTGT GCTATTTGGC CGATGTCGAC ACGGTGTCGT CCGATGACGC CGCGCTCTAT
GCCGCGCCGA TCTCGCACGG TGCCGGGCTC TACAACATGA TCCACACCCG GTTCGGCGCG
CGTCACGTCG TGCCCGCCTC CAAGGGCTTC GACCCCGACG AGGTGCTGAC GCTCGGCAAG
CAGCTCGGCA ACGTCGCGAT GTTCGCCGCG CCCACCATGG TGAAGCGGCT GGTCGAGGCC
GCAAGGCGCC GCGGCGAGCG CGGCGAGGGA CTGCGCACCA TCGTCTACGG CGGCGGCCCG
ATGTATCTCG CCGACATCCG CGACGCGCTC GACGTGATGG GCCAGCGCTT CGTGCAGATC
TACGGCCAGG GCGAATCGCC GATGGCGATC ACGTCGCTGA AGCGCGAGTT GCACGCCGAT
GTCGATCATC CGCGCTATCT GCAGCGGCTG GCCTCGGTCG GCACCGCGCA GAGCGCGCTG
TCGGTGCGGA TCACCGGGCC TGACGGCGAG GTGCTGCCGG CCGGCGAGAC CGGCGAGATC
GAGGCCAAGG GCCCGACCGT GATGCTCGGC TACTGGAACA ATTCGGACGC CAACGCCGAG
ACGCTGAAAG ACGGCTGGCT GCGCACCGGC GATGTCGGGC GCCTGGACGA GGACGGCTTT
CTCACGCTGT CGGACCGCTC CAAGGACGTG ATCATCTCCG GCGGCACCAA CATCTATCCG
CGCGAAGTGG AAGAAGCGCT GCTGACGCAT CCCGCGGTGC GCGAGGTCTC GGCGATCGGC
GTCGCCGATC CGGAATGGGG CGAGACCGTG GTCGCCTGTG TGGTGCTGGC GGACGGATCG
GAGCCCAGCG ACACTGCGCT CGACGCGCAT TGCCTCGCCG CCATCGCCCG CTTCAAGCGG
CCGAAGCGCT ACGTCTATCT GGAAGCGTTG CCGAAGAACA ATTACGGCAA GGTGCTGAAG
ACCGAGCTGC GCAAGATGGT GACTTAG
 
Protein sequence
MNLSEWLAAS ARLRPSAPAL LTGTTIEADY ATFAQRAASF AAALQRDYGI VSGDRVALFA 
HNCTQYLEAL YGIWWAGAVA VPINAKLHGK EAAWICSNSG AKLALICDDT ADTFNEAAGE
LPARMATLAL DSDAYIRARS GDGPAAPAAR EDGDLAWLFY TSGTTGRPKG VMLSHGNLIA
MSLCYLADVD TVSSDDAALY AAPISHGAGL YNMIHTRFGA RHVVPASKGF DPDEVLTLGK
QLGNVAMFAA PTMVKRLVEA ARRRGERGEG LRTIVYGGGP MYLADIRDAL DVMGQRFVQI
YGQGESPMAI TSLKRELHAD VDHPRYLQRL ASVGTAQSAL SVRITGPDGE VLPAGETGEI
EAKGPTVMLG YWNNSDANAE TLKDGWLRTG DVGRLDEDGF LTLSDRSKDV IISGGTNIYP
REVEEALLTH PAVREVSAIG VADPEWGETV VACVVLADGS EPSDTALDAH CLAAIARFKR
PKRYVYLEAL PKNNYGKVLK TELRKMVT